Elastic Inference Service supported models
The following tables list the models supported by Elastic Inference Service by model type.
The corresponding Kibana connectors and inference endpoints for these models are created automatically. To customize the configuration, you can create your own connectors or inference endpoints.
Note
The Inference Regions column shows the regions where inference requests are processed and where data is sent.
For region availability and request routing, refer to Region and hosting. For rate limits, refer to Rate limits.
| Author | Name | ID | Model Card | Provider Terms | Input Modalities | Output Modalities | EOL Date | Data Retention Period (Days) | Data Used To Train Models? | Inference Regions | Release Status | Stack Version |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Anthropic | Claude Haiku 4.5 | anthropic-claude-4.5-haiku | Claude Haiku 4.5 System Card | Google Terms AWS Terms |
Text, Image, File | Text | 0 | No | US | Generally Available | 9.3 | |
| Anthropic | Claude Opus 4.5 | anthropic-claude-4.5-opus | Claude Opus 4.5 System Card | Google Terms AWS Terms |
Text, Image, File | Text | 2026-11-24 | 0 | No | US | Legacy (EOL Soon) | 9.3 |
| Anthropic | Claude Opus 4.6 | anthropic-claude-4.6-opus | Claude Opus 4.6 System Card | Google Terms AWS Terms |
Text, Image, File | Text | 2027-02-05 | 0 | No | US | Generally Available | 9.3 |
| Anthropic | Claude Opus 4.7 | anthropic-claude-4.7-opus | Anthropic Claude Opus 4.7 | Google Terms AWS Terms |
Text, Image, File | Text | 0 | No | US | Generally Available | 9.4 | |
| Anthropic | Claude Sonnet 4.5 | anthropic-claude-4.5-sonnet | Anthropic Claude 4.5 Sonnet | AWS terms | Text, Image, File | Text | 2026-09-29 | 0 | No | US | Legacy (EOL Soon) | 9.2 |
| Anthropic | Claude Sonnet 4.6 | anthropic-claude-4.6-sonnet | Claude Sonnet 4.6 System Card | Google Terms AWS Terms |
Text, Image, File | Text | 0 | No | US, SG | Generally Available | 9.3 | |
| Gemini 2.5 Flash | google-gemini-2.5-flash | Google Gemini 2.5 Flash | Google terms | Text, Image, File | Text | 2026-06-17 | 0 | No | US | Legacy (EOL Soon) | 9.3 | |
| Gemini 2.5 Flash Lite | google-gemini-2.5-flash-lite | Gemini 2.5 Flash Lite | Google terms | Text, Image, File | Text | 0 | No | US | Generally Available | 9.4 | ||
| Gemini 2.5 Pro | google-gemini-2.5-pro | Google Gemini 2.5 Pro | Google terms | Text, Image, File | Text | 2026-06-17 | 0 | No | US | Legacy (EOL Soon) | 9.3 | |
| Gemini 3 Flash | google-gemini-3-flash | Gemini 3 Flash | Google terms | Text, Image, File | Text | 0 | No | US | Tech Preview | 9.4 | ||
| Gemini 3.1 Pro | google-gemini-3.1-pro | Gemini 3.1 Pro | Google terms | Text, Image, File | Text | 0 | No | US | Tech Preview | 9.4 | ||
| OpenAI | GPT-4.1 | openai-gpt-4.1 | OpenAI GPT 4.1 | Microsoft Terms | Text, Image, File | Text | 2026-07-01 | 0 | No | US | Legacy (EOL Soon) | 9.3 |
| OpenAI | GPT-4.1 Mini | openai-gpt-4.1-mini | OpenAI GPT 4.1 Mini | Microsoft Terms | Text, Image, File | Text | 2026-07-01 | 0 | No | US | Legacy (EOL Soon) | 9.3 |
| OpenAI | GPT-5.2 | openai-gpt-5.2 | OpenAI GPT 5.2 | Microsoft Terms | Text, Image, File | Text | 2027-05-12 | 0 | No | US | Generally Available | 9.3 |
| OpenAI | GPT-5.4 | openai-gpt-5.4 | OpenAI GPT 5.4 | Microsoft Terms | Text, Image, File | Text | 0 | No | US | Generally Available | 9.4 | |
| OpenAI | GPT-5.4 Mini | openai-gpt-5.4-mini | OpenAI GPT 5.4 Mini | Microsoft Terms | Text, Image, File | Text | 0 | No | US | Generally Available | 9.4 | |
| OpenAI | GPT-5.4 Nano | openai-gpt-5.4-nano | OpenAI GPT 5.4 Nano | Microsoft Terms | Text, Image, File | Text | 0 | No | US | Generally Available | 9.4 | |
| OpenAI | GPT-OSS 120B | openai-gpt-oss-120b | OpenAI GPT-OSS-120B | Google Terms Together AI Terms DeepInfra Terms AWS Terms |
Text | Text | 0 | No | US | Generally Available | 9.3 | |
| OpenAI | GPT-OSS 20B | openai-gpt-oss-20b | OpenAI GPT-OSS-20B | Together terms DeepInfra terms Groq terms |
Text | Text | 0 | No | US | Generally Available | 9.4 |
| Author | Name | ID | Model Card | Provider Terms | Input Modalities | Output Modalities | EOL Date | Data Retention Period (Days) | Data Used To Train Models? | Inference Regions | Release Status | Stack Version |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Jina | CLIP v2 | jina-clip-v2 | jina-clip-v2 | Elastic Terms | Text, Image | Embedding | 0 | No | US, SG, EU | Generally Available | 9.3 | |
| Elastic | ELSER v2 | elser_model_2 | ELSER docs | Elastic Terms | Text | Embedding | 0 | No | US, SG, EU | Generally Available | 9.1 | |
| Jina | Embeddings v3 | jina-embeddings-v3 | jina-embeddings-v3 | Elastic Terms | Text | Embedding | 0 | No | US, SG, EU | Generally Available | 9.3 | |
| Jina | Embeddings v5 Omni Nano | jina-embeddings-v5-omni-nano | jina-embeddings-v5-omni-nano | Elastic Terms | Text, Image, Video, Audio, File | Embedding | 0 | No | US, SG, EU | Generally Available | 9.4 | |
| Jina | Embeddings v5 Omni Small | jina-embeddings-v5-omni-small | jina-embeddings-v5-omni-small | Elastic Terms | Text, Image, Video, Audio, File | Embedding | 0 | No | US, SG, EU | Generally Available | 9.4 | |
| Jina | Embeddings v5 Text Nano | jina-embeddings-v5-text-nano | jina-embeddings-v5-text-nano | Elastic Terms | Text | Embedding | 0 | No | US, SG, EU | Generally Available | 9.3 | |
| Jina | Embeddings v5 Text Small | jina-embeddings-v5-text-small | jina-embeddings-v5-text-small | Elastic Terms | Text | Embedding | 0 | No | US, SG, EU | Generally Available | 9.3 | |
| Gemini Embedding 1 | google-gemini-embedding-001 | Gemini Embedding 001 | Google terms | Text | Embedding | 55 days | No | US | Generally Available | 9.3 | ||
| Microsoft | Multilingual E5 Large | microsoft-multilingual-e5-large | Multilingual E5 Large System Card | DeepInfra terms | Text | Embedding | 0 | No | US | Generally Available | 9.3 | |
| OpenAI | Text Embedding 003 Large | openai-text-embedding-3-large | Text Embedding 003 Large | OpenAI terms | Text | Text | Unknown | No | US | Generally Available | 9.3 | |
| OpenAI | Text Embedding 003 Small | openai-text-embedding-3-small | Text Embedding 003 Small | OpenAI terms | Text | Text | Unknown | No | US | Generally Available | 9.3 |
| Author | Name | ID | Model Card | Provider Terms | Input Modalities | Output Modalities | EOL Date | Data Retention Period (Days) | Data Used To Train Models? | Inference Regions | Release Status | Stack Version |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Jina | Reranker v2 | jina-reranker-v2-base-multilingual | jina-reranker-v2-base-multilingual | Elastic Terms | Text | Text | 0 | No | US, SG, EU | Generally Available | 9.3 | |
| Jina | Reranker v3 | jina-reranker-v3 | jina-reranker-v3 | Elastic Terms | Text | Text | 0 | No | US, SG, EU | Generally Available | 9.3 |
Important
- The applicable terms of use, uptime, and performance for each of the AI models available with EIS are each described in the applicable AI model's Provider Terms and Model Card.
- Prior to using the AI model with EIS, Customers are responsible for reviewing and agreeing to the chosen AI model's Provider Terms to understand the availability and data practices of the AI model's provider.
- After the listed end-of-life (EOL) date, the model is no longer available for inference use and requests will fail. You need to actively transition to another model before the EOL date, there is no automated migration.
- Elastic makes every effort to use third party providers who do not use inputs to train models, and do not retain any data (zero data retention). Browse the tables on this page to double-check the status of a specific model.