stack es inference put-amazonsagemaker cli command
Auth required
Idempotent
Scope: global
elastic stack es inference put-amazonsagemaker \
--task-type <task-type> \
--amazonsagemaker-inference-id <amazonsagemaker-inference-id> \
--service <service> \
--service-settings <service-settings> \
[options]
Create an Amazon SageMaker inference endpoint.
Behaviour flags:
--dry-run — validate all inputs and exit without performing any action
--task-typeenumrequired-
The type of the inference task that the model will perform.
Values: text_embedding, completion, chat_completion, sparse_embedding, rerank
--amazonsagemaker-inference-idstringrequired- The unique identifier of the inference endpoint.
--serviceenumrequired-
The type of service supported for the specified task type. In this case,
amazon_sagemaker.Values: amazon_sagemaker
--service-settingsstringrequired- Settings used to install the inference model. These settings are specific to the
amazon_sagemakerservice andservice_settings.apiyou specified. --timeoutstring- Specifies the amount of time to wait for the inference endpoint to be created.
--chunking-settingsstring- The chunking configuration object. Applies only to the
sparse_embeddingortext_embeddingtask types. Not applicable to thererank,completion, orchat_completiontask types. --task-settingsstring- Settings to configure the inference task. These settings are specific to the task type and
service_settings.apiyou specified. --input-filestring- path to a JSON file to use as command input
--[no-]dry-run- validate all inputs and exit without performing any action (preview changes without applying them)
--[no-]json-
output as JSON