Loading

Elastic Inference Service supported models

The following tables list the models supported by Elastic Inference Service by model type.

The corresponding Kibana connectors and inference endpoints for these models are created automatically. To customize the configuration, you can create your own connectors or inference endpoints.

Note

The Inference Regions column shows the regions where inference requests are processed and where data is sent.

For region availability and request routing, refer to Region and hosting. For rate limits, refer to Rate limits.

Scroll horizontally to view more information.
Author Name ID Model Card Provider Terms Input Modalities Output Modalities EOL Date Data Retention Period (Days) Data Used To Train Models? Inference Regions Release Status Stack Version
Anthropic Claude Haiku 4.5 anthropic-claude-4.5-haiku Claude Haiku 4.5 System Card Google Terms
AWS Terms
Text, Image, File Text 0 No US Generally Available 9.3
Anthropic Claude Opus 4.5 anthropic-claude-4.5-opus Claude Opus 4.5 System Card Google Terms
AWS Terms
Text, Image, File Text 2026-11-24 0 No US Legacy (EOL Soon) 9.3
Anthropic Claude Opus 4.6 anthropic-claude-4.6-opus Claude Opus 4.6 System Card Google Terms
AWS Terms
Text, Image, File Text 2027-02-05 0 No US Generally Available 9.3
Anthropic Claude Opus 4.7 anthropic-claude-4.7-opus Anthropic Claude Opus 4.7 Google Terms
AWS Terms
Text, Image, File Text 0 No US Generally Available 9.4
Anthropic Claude Sonnet 4.5 anthropic-claude-4.5-sonnet Anthropic Claude 4.5 Sonnet AWS terms Text, Image, File Text 2026-09-29 0 No US Legacy (EOL Soon) 9.2
Anthropic Claude Sonnet 4.6 anthropic-claude-4.6-sonnet Claude Sonnet 4.6 System Card Google Terms
AWS Terms
Text, Image, File Text 0 No US, SG Generally Available 9.3
Google Gemini 2.5 Flash google-gemini-2.5-flash Google Gemini 2.5 Flash Google terms Text, Image, File Text 2026-06-17 0 No US Legacy (EOL Soon) 9.3
Google Gemini 2.5 Flash Lite google-gemini-2.5-flash-lite Gemini 2.5 Flash Lite Google terms Text, Image, File Text 0 No US Generally Available 9.4
Google Gemini 2.5 Pro google-gemini-2.5-pro Google Gemini 2.5 Pro Google terms Text, Image, File Text 2026-06-17 0 No US Legacy (EOL Soon) 9.3
Google Gemini 3 Flash google-gemini-3-flash Gemini 3 Flash Google terms Text, Image, File Text 0 No US Tech Preview 9.4
Google Gemini 3.1 Pro google-gemini-3.1-pro Gemini 3.1 Pro Google terms Text, Image, File Text 0 No US Tech Preview 9.4
OpenAI GPT-4.1 openai-gpt-4.1 OpenAI GPT 4.1 Microsoft Terms Text, Image, File Text 2026-07-01 0 No US Legacy (EOL Soon) 9.3
OpenAI GPT-4.1 Mini openai-gpt-4.1-mini OpenAI GPT 4.1 Mini Microsoft Terms Text, Image, File Text 2026-07-01 0 No US Legacy (EOL Soon) 9.3
OpenAI GPT-5.2 openai-gpt-5.2 OpenAI GPT 5.2 Microsoft Terms Text, Image, File Text 2027-05-12 0 No US Generally Available 9.3
OpenAI GPT-5.4 openai-gpt-5.4 OpenAI GPT 5.4 Microsoft Terms Text, Image, File Text 0 No US Generally Available 9.4
OpenAI GPT-5.4 Mini openai-gpt-5.4-mini OpenAI GPT 5.4 Mini Microsoft Terms Text, Image, File Text 0 No US Generally Available 9.4
OpenAI GPT-5.4 Nano openai-gpt-5.4-nano OpenAI GPT 5.4 Nano Microsoft Terms Text, Image, File Text 0 No US Generally Available 9.4
OpenAI GPT-OSS 120B openai-gpt-oss-120b OpenAI GPT-OSS-120B Google Terms
Together AI Terms
DeepInfra Terms
AWS Terms
Text Text 0 No US Generally Available 9.3
OpenAI GPT-OSS 20B openai-gpt-oss-20b OpenAI GPT-OSS-20B Together terms
DeepInfra terms
Groq terms
Text Text 0 No US Generally Available 9.4
Scroll horizontally to view more information.
Author Name ID Model Card Provider Terms Input Modalities Output Modalities EOL Date Data Retention Period (Days) Data Used To Train Models? Inference Regions Release Status Stack Version
Jina CLIP v2 jina-clip-v2 jina-clip-v2 Elastic Terms Text, Image Embedding 0 No US, SG, EU Generally Available 9.3
Elastic ELSER v2 elser_model_2 ELSER docs Elastic Terms Text Embedding 0 No US, SG, EU Generally Available 9.1
Jina Embeddings v3 jina-embeddings-v3 jina-embeddings-v3 Elastic Terms Text Embedding 0 No US, SG, EU Generally Available 9.3
Jina Embeddings v5 Omni Nano jina-embeddings-v5-omni-nano jina-embeddings-v5-omni-nano Elastic Terms Text, Image, Video, Audio, File Embedding 0 No US, SG, EU Generally Available 9.4
Jina Embeddings v5 Omni Small jina-embeddings-v5-omni-small jina-embeddings-v5-omni-small Elastic Terms Text, Image, Video, Audio, File Embedding 0 No US, SG, EU Generally Available 9.4
Jina Embeddings v5 Text Nano jina-embeddings-v5-text-nano jina-embeddings-v5-text-nano Elastic Terms Text Embedding 0 No US, SG, EU Generally Available 9.3
Jina Embeddings v5 Text Small jina-embeddings-v5-text-small jina-embeddings-v5-text-small Elastic Terms Text Embedding 0 No US, SG, EU Generally Available 9.3
Google Gemini Embedding 1 google-gemini-embedding-001 Gemini Embedding 001 Google terms Text Embedding 55 days No US Generally Available 9.3
Microsoft Multilingual E5 Large microsoft-multilingual-e5-large Multilingual E5 Large System Card DeepInfra terms Text Embedding 0 No US Generally Available 9.3
OpenAI Text Embedding 003 Large openai-text-embedding-3-large Text Embedding 003 Large OpenAI terms Text Text Unknown No US Generally Available 9.3
OpenAI Text Embedding 003 Small openai-text-embedding-3-small Text Embedding 003 Small OpenAI terms Text Text Unknown No US Generally Available 9.3
Scroll horizontally to view more information.
Author Name ID Model Card Provider Terms Input Modalities Output Modalities EOL Date Data Retention Period (Days) Data Used To Train Models? Inference Regions Release Status Stack Version
Jina Reranker v2 jina-reranker-v2-base-multilingual jina-reranker-v2-base-multilingual Elastic Terms Text Text 0 No US, SG, EU Generally Available 9.3
Jina Reranker v3 jina-reranker-v3 jina-reranker-v3 Elastic Terms Text Text 0 No US, SG, EU Generally Available 9.3
Important
  • The applicable terms of use, uptime, and performance for each of the AI models available with EIS are each described in the applicable AI model's Provider Terms and Model Card.
  • Prior to using the AI model with EIS, Customers are responsible for reviewing and agreeing to the chosen AI model's Provider Terms to understand the availability and data practices of the AI model's provider.
  • After the listed end-of-life (EOL) date, the model is no longer available for inference use and requests will fail. You need to actively transition to another model before the EOL date, there is no automated migration.
  • Elastic makes every effort to use third party providers who do not use inputs to train models, and do not retain any data (zero data retention). Browse the tables on this page to double-check the status of a specific model.