Initial setup
- Set up a remote cluster on both clusters.
### On cluster A ###
PUT _cluster/settings { "persistent": { "cluster": { "remote": { "clusterB": { "mode": "proxy", "skip_unavailable": true, "server_name": "clusterb.es.region-b.gcp.elastic-cloud.com", "proxy_socket_connections": 18, "proxy_address": "clusterb.es.region-b.gcp.elastic-cloud.com:9400" } } } } } ### On cluster B ### PUT _cluster/settings { "persistent": { "cluster": { "remote": { "clusterA": { "mode": "proxy", "skip_unavailable": true, "server_name": "clustera.es.region-a.gcp.elastic-cloud.com", "proxy_socket_connections": 18, "proxy_address": "clustera.es.region-a.gcp.elastic-cloud.com:9400" } } } } }
- Set up bi-directional cross-cluster replication.
### On cluster A ###
PUT /_ccr/auto_follow/logs-generic-default { "remote_cluster": "clusterB", "leader_index_patterns": [ ".ds-logs-generic-default-20*" ], "leader_index_exclusion_patterns":"*-replicated_from_clustera", "follow_index_pattern": "{{leader_index}}-replicated_from_clusterb" } ### On cluster B ### PUT /_ccr/auto_follow/logs-generic-default { "remote_cluster": "clusterA", "leader_index_patterns": [ ".ds-logs-generic-default-20*" ], "leader_index_exclusion_patterns":"*-replicated_from_clusterb", "follow_index_pattern": "{{leader_index}}-replicated_from_clustera" }
Important
Existing data on the cluster will not be replicated by_ccr/auto_follow
even though the patterns may match. This function will only replicate newly created backing indices (as part of the data stream).Important
Useleader_index_exclusion_patterns
to avoid recursion.Tip
follow_index_pattern
allows lowercase characters only.Tip
This step cannot be executed via the Kibana UI due to the lack of an exclusion pattern in the UI. Use the API in this step. - Set up the Logstash configuration file. This example uses the input generator to demonstrate the document count in the clusters. Reconfigure this section to suit your own use case.
### On Logstash server ### ### This is a logstash config file ### input { generator{ message => 'Hello World' count => 100 } } output { elasticsearch { hosts => ["https://clustera.es.region-a.gcp.elastic-cloud.com:9243","https://clusterb.es.region-b.gcp.elastic-cloud.com:9243"] user => "logstash-user" password => "same_password_for_both_clusters" } }
Important
The key point is that whencluster A
is down, all traffic will be automatically redirected tocluster B
. Oncecluster A
comes back, traffic is automatically redirected back tocluster A
again. This is achieved by the optionhosts
where multiple ES cluster endpoints are specified in the array[clusterA, clusterB]
.Tip
Set up the same password for the same user on both clusters to use this load-balancing feature. - Start Logstash with the earlier configuration file.
### On Logstash server ### bin/logstash -f multiple_hosts.conf
- Observe document counts in data streams. The setup creates a data stream named
logs-generic-default
on each of the clusters. Logstash will write 50% of the documents tocluster A
and 50% of the documents tocluster B
when both clusters are up. Bi-directional cross-cluster replication will create one more data stream on each of the clusters with the-replication_from_cluster{a|b}
suffix. At the end of this step:- data streams on cluster A contain:
- 50 documents in
logs-generic-default-replicated_from_clusterb
- 50 documents in
logs-generic-default
- 50 documents in
- data streams on cluster B contain:
- 50 documents in
logs-generic-default-replicated_from_clustera
- 50 documents in
logs-generic-default
- 50 documents in
- data streams on cluster A contain:
- Queries should be set up to search across both data streams. A query on
logs*
, on either of the clusters, returns 100 hits in total.GET logs*/_search?size=0