Data integrity (select aggregation uncorrect?) during node restart

moreno · June 4, 2015, 7:51am

HI all, I have a cluster with 3 nodes.

I inserted 1 million records in a Set.

With AMC I see that:

Node1 has 340000 Master Objects and 337000 Replice Objects

Node2 has 340000 Master Objects and 335000 Replice Objects

Node3 has 320000 Master Objects and 328000 Replice Objects

Then I stop aerospike process (service aerospike stop) on Node3.

Immediately I see:

Node1 has 510000 Master Objects and few Replice Objects

Node2 has 490000 Master Objects and few Replice Objects

In this fase I also see increasing Replice Objects (at the end of migration phase Replice Objects are, as expected, 490000 for Node1 and 510000 for Node2).

During migration phase a select aggregation (a select count) of the records in the Set, answers with correct value: 1 million.

When I restart node3 Master Object immediately decrease for Node1 and Node2 (340000 and 330000) and are 2000 for node3.

In this situation “select aggregation” returns about 700000 recordst and the result go on being wrong until migration is over.

Question is: why data are immediately “congruents” when a node is stopped and are “incongruents” when a node is restarted (in the first case I don’t need to wait until migrations phase is over).

Many thanks for any help.

Moreno

raj · June 4, 2015, 10:38am

Moreno,

This is expected behavior

http://www.aerospike.com/docs/operations/manage/scans/

Look at the recommendation 2 at the bottom. Not advisable to run scan while migrations are going on. We are working on improving the behavior there … Will keep you posted.

R

moreno · June 4, 2015, 12:46pm

Thank you ray (as usual a diligent and accurate answer!)

Another (two little) questions (last questions )

during migration it’s guaranteed correct result if I insert/delete/select a specific record?
If a node is restarting it’s possible to prevent that insertion of new records will involve that node (till the end of migration)?

(I’d like that new records, master and backup copy, will be stored in other nodes of the cluster).

Thanks again

Moreno

raj · June 4, 2015, 6:55pm

Yes when you do key based read
No !! As soon as node starts and joins cluster it starts taking reads/writes as others
There is no way to explicitly control which node data is stored. It is determined by system based on hash.

– R

moreno · June 5, 2015, 6:41am

Thank you again, raj!

have a good weekend.

Moreno

Topic		Replies	Views
Odd record count when adding new nodes to cluster Operations	3	1312	July 4, 2016
Replicas invalidated after restart Upgrading	18	4102	October 13, 2016
Replica and master objects is inconsistent Migration	12	2560	June 13, 2017
Cluster upgrade	7	1248	May 29, 2017
Data inconsistencies in reviving a dead node Installation	5	963	September 20, 2022

Data integrity (select aggregation uncorrect?) during node restart

Related topics