stack es inference put-amazonsagemaker cli command

Auth required Idempotent Scope: global

		elastic stack es inference put-amazonsagemaker \
  --task-type <task-type> \
  --amazonsagemaker-inference-id <amazonsagemaker-inference-id> \
  --service <service> \
  --service-settings <service-settings> \
  [options]
		
	

Create an Amazon SageMaker inference endpoint.

Behaviour flags:

--dry-run — validate all inputs and exit without performing any action

Options

--task-type enum required

The type of the inference task that the model will perform.

Values: text_embedding, completion, chat_completion, sparse_embedding, rerank

--amazonsagemaker-inference-id string required

The unique identifier of the inference endpoint.

--service enum required

The type of service supported for the specified task type. In this case, amazon_sagemaker.

Values: amazon_sagemaker

--service-settings string required

Settings used to install the inference model. These settings are specific to the amazon_sagemaker service and service_settings.api you specified.

--timeout string

Specifies the amount of time to wait for the inference endpoint to be created.

--chunking-settings string

The chunking configuration object. Applies only to the sparse_embedding or text_embedding task types. Not applicable to the rerank, completion, or chat_completion task types.

--task-settings string

Settings to configure the inference task. These settings are specific to the task type and service_settings.api you specified.

--input-file string

path to a JSON file to use as command input

--[no-]dry-run

validate all inputs and exit without performing any action (preview changes without applying them)

Global Options

--[no-]json: output as JSON