Region and hosting
This page lists the AWS and GCP regions where Elastic Inference Service (EIS) is available and explains how inference requests are routed.
AWS:
us-east-1(Virginia)
GCP:
asia-southeast1(Singapore)europe-west1(Belgium)us-east4(Virginia)
All inference requests sent through EIS are routed to the nearest region, regardless of where your Elasticsearch deployment or Serverless project is hosted.
Depending on the model being used, request processing may involve Elastic inference infrastructure and, in some cases, trusted third-party model providers. For example, ELSER and Jina requests are processed entirely within Elastic inference infrastructure. Other models, such as large language models or third-party embedding models, may involve additional processing by their respective model providers, which can operate in different cloud platforms or regions.