Loading

stack es inference put-amazonsagemaker cli command

Auth required Idempotent Scope: global
elastic stack es inference put-amazonsagemaker \
  --task-type <task-type> \
  --amazonsagemaker-inference-id <amazonsagemaker-inference-id> \
  --service <service> \
  --service-settings <service-settings> \
  [options]
		

Create an Amazon SageMaker inference endpoint.

Behaviour flags:

--dry-run — validate all inputs and exit without performing any action

--task-type enum required

The type of the inference task that the model will perform.

Values: text_embedding, completion, chat_completion, sparse_embedding, rerank

--amazonsagemaker-inference-id string required
The unique identifier of the inference endpoint.
--service enum required

The type of service supported for the specified task type. In this case, amazon_sagemaker.

Values: amazon_sagemaker

--service-settings string required
Settings used to install the inference model. These settings are specific to the amazon_sagemaker service and service_settings.api you specified.
--timeout string
Specifies the amount of time to wait for the inference endpoint to be created.
--chunking-settings string
The chunking configuration object. Applies only to the sparse_embedding or text_embedding task types. Not applicable to the rerank, completion, or chat_completion task types.
--task-settings string
Settings to configure the inference task. These settings are specific to the task type and service_settings.api you specified.
--input-file string
path to a JSON file to use as command input
--[no-]dry-run
validate all inputs and exit without performing any action (preview changes without applying them)
--[no-]json

output as JSON