Elastic Inference Service for self-managed clusters

Elastic Inference Service (EIS) is available with zero setup on Elastic Cloud Hosted and Serverless deployments. To use EIS with other deployment types, you can use Cloud Connect. Cloud Connect enables you to use Elastic Cloud services in your self-managed cluster without having to install and maintain their infrastructure yourself.

You can use EIS to enable features such as:

For a full list of EIS-powered features, refer to AI features powered by EIS.

Prerequisites

Before you can use EIS with your self-managed cluster, ensure you meet the following requirements:

Your self-managed cluster is on an Enterprise self-managed license or an active self-managed trial license
You have an Elastic Cloud account with either an active Cloud Trial or billing information configured

Set up EIS with Cloud Connect

Open Cloud Connect

In your self-managed Kibana instance, navigate to the Cloud Connect page using the search bar.

×
Get your Cloud Connect API key

Sign up or log in to Elastic Cloud and get the Cloud Connect API key:
- If you don’t have an account yet, click Sign up and follow the prompts to create your account and start a free Cloud Trial.
- If you already have an Elastic Cloud account, click Log in.
Connect your cluster

Copy the Cloud Connect API key, paste it into your self-managed cluster's Cloud Connect page, then click Connect.
Enable Elastic Inference Service

On the Cloud connected services page, click Connect for Elastic Inference Service.

×

After you connect Elastic Inference Service through Cloud Connect, Elasticsearch automatically creates multiple inference endpoints for search and chat use cases, along with corresponding Kibana AI connectors. Supported Kibana features now use these connectors automatically.

Test EIS through Cloud Connect with semantic search

In this example, you create an index with a semantic_text field, index a document, then run a query that returns a semantically related match.

In Dev Tools, run the following requests:

Create an index with a semantic_text field
```
				PUT /semantic-search-eis
					{
  "mappings": {
    "properties": {
      "text": {
        "type": "semantic_text"
      }
    }
  }
}
		
```
1. Because you already enabled EIS, the semantic_text field type uses EIS through the default inference endpoint (.elser-2-elastic). To learn more, refer to semantic_text.

Index a document

						POST /semantic-search-eis/_doc
					{
  "text": "Aberdeen Football Club"
}
		
	

Run a search query

						GET /semantic-search-eis/_search
					{
  "query": {
    "match": {
      "text": "soccer"
    }
  }
}
		
	

The response should include the indexed document:

		{
  "took": 161,
  "timed_out": false,
  "_shards": {
    "total": 1,
    "successful": 1,
    "skipped": 0,
    "failed": 0
  },
  "hits": {
    "total": {
      "value": 1,
      "relation": "eq"
    },
    "max_score": 4.729913,
    "hits": [
      {
        "_index": "semantic-search-eis",
        "_id": "oyH935sBG2FaZ-zOMrer",
        "_score": 4.729913,
        "_source": {
          "text": "Aberdeen Football Club"
        }
      }
    ]
  }
}
		
	

Elastic Inference Service for self-managed clusters

Prerequisites

Set up EIS with Cloud Connect

Open Cloud Connect

Get your Cloud Connect API key

Connect your cluster

Enable Elastic Inference Service

Test EIS through Cloud Connect with semantic search

Create an index with a `semantic_text` field

Index a document

Run a search query

Supported models with EIS through Cloud Connect

LLMs

Embedding and rerank models

Regions and billing