stack es inference put-alibabacloud cli command
Auth required
Idempotent
Scope: global
elastic stack es inference put-alibabacloud \
--task-type <task-type> \
--alibabacloud-inference-id <alibabacloud-inference-id> \
--service <service> \
--service-settings <service-settings> \
[options]
Create an AlibabaCloud AI Search inference endpoint.
--task-typeenumrequired-
The type of the inference task that the model will perform.
Values: completion, rerank, sparse_embedding, text_embedding
--alibabacloud-inference-idstringrequired- The unique identifier of the inference endpoint.
--serviceenumrequired-
The type of service supported for the specified task type. In this case,
alibabacloud-ai-search.Values: alibabacloud-ai-search
--service-settingsstringrequired- Settings used to install the inference model. These settings are specific to the
alibabacloud-ai-searchservice. --timeoutstring- Specifies the amount of time to wait for the inference endpoint to be created.
--chunking-settingsstring- The chunking configuration object. Applies only to the
sparse_embeddingortext_embeddingtask types. Not applicable to thererankorcompletiontask types. --task-settingsstring- Settings to configure the inference task. These settings are specific to the task type you specified.
--input-filestring- path to a JSON file to use as command input
-V--[no-]version- Print the Elastic CLI version
--config-filestring- path to a config file (default: ~/.elasticrc.yml)
--use-contextstring- override the active context from the config file
--command-profilestring- restrict available commands to a deployment profile (serverless, stack, default)
--[no-]json- output as JSON
--output-fieldsstring- comma-separated list of fields to include in output (dot-notation supported)
--output-templatestring- Mustache-like template for custom text output (e.g. "{{id}}: {{name}}")
--[no-]dry-run-
validate all inputs and exit without performing any action (preview changes without applying them)