I’m playing around with Aerospike to see how it behaves. It works well, but I don’t understand some syncing behavior. I’m running 3 (virtual) machines with debian, aerospike 3.4.1. I load some data in the database with the benchmark util, about a million keys, and keep an eye on things with the management console. All good so far.
Now I stop a machine. The other two take over the keys for the dead machine. They go from about 300K master and 300K replica objects each to about 500K master and 150K replica objects each. Still all good so far, since some replica keys will have become master keys, and they start syncing the missing keys.
But now I restart the machine I removed earlier. It comes back nicely, and all three machines have 300K master objects right away, but, and now here’s my question, the replica object counts have dropped to 0 on all three machines. They do start syncing, and everything goes back to normal after a while, but I don’t get why they go to 0. Is that expected?
prole-objects started as 0 after the restart and grows slowly to the expected, normal number (~300K). It’s the same on all three machines. That matches with what I see in AMC.
(asinfo on a remote host gives me trouble. If I do asinfo -h 192.168.2.77 -v namespace/bench -l I get request to 192.168.2.77 : 3000 returned error. Is the port number non-obvious?)
It being copy does not necessarily mean it is available on the designated replica hence prole_objects is zero. That said it sounds peculiar that it happened for all the partition.
I will check this. So what it means your copies are intact it is just they are not on the designated replica. Thanks for pointing this out.