Set up and configure `semantic_text` fields

This page provides instructions for setting up and configuring semantic_text fields. Learn how to configure inference endpoints, including default and preconfigured options, ELSER on EIS, custom endpoints, and dedicated endpoints for ingestion and search operations.

Configure inference endpoints

You can configure inference endpoints for semantic_text fields in the following ways:

Use ELSER on EIS
Use default and preconfigured endpoints
Use a custom inference endpoint

Note

If you use a custom inference endpoint through your ML node and not through Elastic Inference Service (EIS), the recommended method is to use dedicated endpoints for ingestion and search.

If you use EIS, you don't have to set up dedicated endpoints.

Use default and preconfigured endpoints

This section shows you how to set up semantic_text with different default and preconfigured endpoints.

Default ELSER on EIS on Serverless

To use the default .elser-v2-elastic endpoint that runs on EIS, you can set up semantic_text with the following API request:

						PUT my-index-000001
					{
  "mappings": {
    "properties": {
      "inference_field": {
        "type": "semantic_text"
      }
    }
  }
}
		
	

If you don't specify an inference endpoint, the inference_id field defaults to .elser-v2-elastic, a preconfigured endpoint for the elasticsearch service.

Preconfigured ELSER on EIS in Cloud

To use the preconfigured .elser-v2-elastic endpoint that runs on EIS, you can set up semantic_text with the following API request:

						PUT my-index-000001
					{
  "mappings": {
    "properties": {
      "inference_field": {
        "type": "semantic_text",
        "inference_id": ".elser-v2-elastic"
      }
    }
  }
}
		
	

If you don't specify an inference endpoint, the inference_id field defaults to .elser-2-elasticsearch, a preconfigured endpoint for the elasticsearch service.

Default ELSER

If you use the default .elser-2-elasticsearch endpoint, you can set up semantic_text with the following API request:

						PUT my-index-000001
					{
  "mappings": {
    "properties": {
      "inference_field": {
        "type": "semantic_text"
      }
    }
  }
}
		
	

Use ELSER on EIS

If you use the preconfigured .elser-2-elastic endpoint that utilizes the ELSER model as a service through the Elastic Inference Service (ELSER on EIS), you can set up semantic_text with the following API request:

Using ELSER on EIS on Serverless

						PUT my-index-000001
					{
  "mappings": {
    "properties": {
      "inference_field": {
        "type": "semantic_text"
      }
    }
  }
}
		
	

Using ELSER on EIS in Cloud

						PUT my-index-000001
					{
  "mappings": {
    "properties": {
      "inference_field": {
        "type": "semantic_text",
        "inference_id": ".elser-2-elastic"
      }
    }
  }
}
		
	

Use a custom inference endpoint

To use a custom inference endpoint instead of the default endpoint, you must Create inference API and specify its inference_id when setting up the semantic_text field type.

						PUT my-index-000002
					{
  "mappings": {
    "properties": {
      "inference_field": {
        "type": "semantic_text",
        "inference_id": "my-openai-endpoint"
      }
    }
  }
}
		
	

The inference_id of the inference endpoint to use to generate embeddings.

Use dedicated endpoints for ingestion and search

If you use a custom inference endpoint through your ML node and not through Elastic Inference Service, the recommended way to use semantic_text is by having dedicated inference endpoints for ingestion and search.

This ensures that search speed remains unaffected by ingestion workloads, and vice versa. After creating dedicated inference endpoints for both, you can reference them using the inference_id and search_inference_id parameters when setting up the index mapping for an index that uses the semantic_text field.

						PUT my-index-000003
					{
  "mappings": {
    "properties": {
      "inference_field": {
        "type": "semantic_text",
        "inference_id": "my-elser-endpoint-for-ingest",
        "search_inference_id": "my-elser-endpoint-for-search"
      }
    }
  }
}
		
	

Set `index_options` for `sparse_vectors`

Configuring index_options for sparse vector fields lets you configure token pruning, which controls whether non-significant or overly frequent tokens are omitted to improve query performance.

The following example enables token pruning and sets pruning thresholds for a sparse_vector field:

						PUT semantic-embeddings
					{
  "mappings": {
    "properties": {
      "content": {
        "type": "semantic_text", 
        "index_options": {
          "sparse_vector": {
            "prune": true,
            "pruning_config": {
              "tokens_freq_ratio_threshold": 10,
              "tokens_weight_threshold": 0.5
            }
          }
        }
      }
    }
  }
}
		
	

(Optional) Enables pruning. Default is true.
(Optional) Prunes tokens whose frequency is more than 10 times the average token frequency in the field. Default is 5.
(Optional) Prunes tokens whose weight is lower than 0.5. Default is 0.4.

Learn more about sparse_vector index options settings and token pruning.

Set `index_options` for `dense_vectors`

Configuring index_options for dense vector fields lets you control how dense vectors are indexed for kNN search. You can select the indexing algorithm, such as int8_hnsw, int4_hnsw, or disk_bbq, among other available index options.

The following example shows how to configure index_options for a dense vector field using the int8_hnsw indexing algorithm:

						PUT semantic-embeddings
					{
  "mappings": {
    "properties": {
      "content": {
        "type": "semantic_text",
        "index_options": {
          "dense_vector": {
            "type": "int8_hnsw",
            "m": 15,
            "ef_construction": 90,
            "confidence_interval": 0.95
          }
        }
      }
    }
  }
}
		
	

(Optional) Selects the int8_hnsw vector quantization strategy. Learn about default quantization types.
(Optional) Sets m to 15 to control how many neighbors each node connects to in the HNSW graph. Default is 16.
(Optional) Sets ef_construction to 90 to control how many candidate neighbors are considered during graph construction. Default is 100.
(Optional) Sets confidence_interval to 0.95 to limit the value range used during quantization and balance accuracy with memory efficiency.