Reading indices from older Elasticsearch versions
Elastic Stack Elastic Cloud Hosted Self Managed
Elasticsearch has full query and write support for indices created in the previous major version. If you have indices created in Elasticsearch versions 5 or 6, you can use the archive functionality to import them into newer Elasticsearch versions as well.
The archive functionality provides slower read-only access to older Elasticsearch data, for compliance or regulatory reasons, the occasional lookback or investigation, or to rehydrate parts of it. Access to the data is expected to be infrequent, and can therefore happen with limited performance and query capabilities.
For this, Elasticsearch can access older snapshot repositories going back to version 5. The legacy indices in the snapshot repository can either be restored or directly accessed through searchable snapshots so that the archived data won’t need to fully reside on local disks for access.
Old mappings are imported as close as possible into Elasticsearch 9, but only provide regular query capabilities on a select subset of fields:
- Numeric types
boolean
typeip
typegeo_point
typedate
types: the dateformat
setting on date fields is supported as long as it behaves similarly across these versions. If it is not, for example, when using custom date formats, this field can be updated on legacy indices so a user can change it if needed.keyword
type: Thenormalizer
setting on keyword fields is supported as long as it behaves similarly across these versions. If they do not, they can be updated on legacy indices if necessary.text
type: Scoring capabilities are limited, and all queries return constant scores equal to 1.0. Theanalyzer
settings on text fields are supported as long as they behave similarly across these versions. If they don't, they can be updated on legacy indices.- Multi-fields
- Field aliases
object
fields- some basic metadata fields, such as
_type
for querying Elasticsearch 5 indices - runtime fields
_source
field
Elasticsearch 5 indices with mappings that have multiple mapping types are collapsed together on a best-effort basis before being imported.
If auto-importing mappings does not work, or the new Elasticsearch version doesn't understand the mapping, it falls back to importing the index without the mapping, but stores the original mapping in the _meta section of the imported index. Users can then examine the legacy mapping using the GET mapping API, and manually update the mapping using the update mapping API, copying and adapting relevant sections of the legacy mapping to work with the current Elasticsearch version. While auto-import is expected to work in most cases, any failures should be raised with the Elastic team for future improvements.
Archive indices are read-only and provide data access through the search and field capabilities APIs. They do not support the Get API or any write APIs.
Archive indices allow running queries and aggregations if the field type supports them.
Due to _source
access, the data can also be reindexed to a new index that's fully compatible with the current Elasticsearch version.
To upgrade older Elasticsearch 5 or 6 clusters: