AEROSPIKE_ERR_CLUSTER_CHANGE when querying or scanning a namespace

Aerospike_Knowledge · March 3, 2016, 8:15pm

The Aerospike Knowledge Base has moved to https://support.aerospike.com. Content on https://discuss.aerospike.com is being migrated to either https://support.aerospike.com or https://docs.aerospike.com. Maintenance on articles stored in this repository ceased on December 31st 2022 and this article may be stale. If you have any questions, please do not hesitate to raise a case via https://support.aerospike.com.

Synopsis

When I query or scan a namespace, why does aql return “Error: (7) AEROSPIKE_ERR_CLUSTER_CHANGE”? I get the same error when I attempt to run the operation from C or Java client.

In Java, the exception is:

com.aerospike.client.AerospikeException: Error Code 7: Cluster key mismatch

I know that the node is alive, the node belongs to a cluster, and the namespace holds records. I have run queries and scans in the past. Why do I see this error?

Answer

Scans during cluster changes will not result in accurate data (potentially duplicate or incomplete results). Thus, Aerospike prevents the scans from proceeding if the cluster is currently rebalancing data.

aql returns the error “Error: (7) AEROSPIKE_ERR_CLUSTER_CHANGE” because the cluster is migrating partitions.

In the following example, we see that the nodes are migrating. Notice the non-zero values under the ‘Migrates’ column. The aql scan cannot complete successfully:

Monitor> info node
===NODES===
2015-10-14 15:57:11.885725
Sorting by IP, in Ascending order: 
ip:port                 Build   Cluster      Cluster   Free   Free   Migrates              Node         Principal   Replicated    Sys
                            .      Size   Visibility   Disk    Mem          .                ID                ID      Objects   Free
                            .         .            .    pct    pct          .                 .                 .            .    Mem
192.168.160.129:3000    3.6.2         2         true      0     99      (0,1)   BB9DD05EF290C00   BB9F06318290C00       84,028     75
192.168.160.132:3000    3.6.2         2         true      0     98   (6244,0)   BB9F06318290C00   BB9F06318290C00      542,292     69
Number of nodes displayed: 2

aql> select * from test
Error: (7) AEROSPIKE_ERR_CLUSTER_CHANGE

You see the same error when you query a secondary index while the nodes are migrating.

After the nodes complete migrations, run the command again or issue a scan using the clients and you should be able to get results as expected.

Workaround

You can however disable this check from the client policies if you still prefer the scans going through even if they may give inaccurate results.

Example in AQL,

aql> set FAIL_ON_CLUSTER_CHANGE false
FAIL_ON_CLUSTER_CHANGE = false

Some older versions of aql may not have that flag, you can use “HELP SET” to confirm.

Example in Java, the following policy can be switched to false. https://www.aerospike.com/apidocs/java/com/aerospike/client/policy/ScanPolicy.html#failOnClusterChange

public boolean failOnClusterChange

Example in Python, the scan policy can be set to false on ‘fail_on_cluster_change’

       s.foreach(callback,{'fail_on_cluster_change':False})

Mannoj · March 8, 2018, 9:57pm

I removed the arguement - --no-cluster-change . Its going on. Do you think apps are not responding to their GETS at this moment?

rbotzer · March 8, 2018, 10:09pm

This only refers to scans during migration, not to reading single records (i.e. GET).

Albot · March 8, 2018, 10:37pm

What if migration threads are set to 0? Will the results still be inaccurate?

Topic		Replies	Views
Error: (7) AEROSPIKE_ERR_CLUSTER_CHANGE	4	1641	May 11, 2017
Inconsistent result if fetching a key when 1 node crashed on 4 node Aerospike cluster (3.9.0) AQL	31	3879	October 14, 2016
Configuration review for file backed namespace Configuration	1	1267	May 7, 2015
Aerospike_err_cluster AQL	2	1948	July 6, 2017
ASBackup fails with Error while running node scan for BB94735220A0102 - code 7: AEROSPIKE_ERR_CLUSTER_CHANGE at src/main/aerospike/aerospike_scan.c:197	3	941	June 13, 2019

AEROSPIKE_ERR_CLUSTER_CHANGE when querying or scanning a namespace

Synopsis

Answer

Workaround

Related Topics