Dun was removed in 3.9.1 because it was no longer necessary due to the enhanced paxos algorithm. The cluster should auto-heal and auto-rebalance.
A few questions:
- Are all you nodes on the same version? (220.127.116.11)
- What is your paxos-recovery-policy set to? It should be auto-reset-master, which is the default in the version you’re using.
- If you run asadm -e info, do all nodes agree on on the size of the cluster, and have cluster visibility true?
- Do you have any errors in your logs, particularly network errors?
I suspect you have network issues between your nodes, either because of networking issues or your Aerospike is mis-configured. Can you give out any information about deployment in terms of (a) bare metal vs cloud, (b) number of NICs in the nodes and © how those NICs are used?
I would also note that your stop-writes, high-water-mark-memory and high-water-mark-disk parameters are set oddly. These are typically 90%, 60% and 50% respectively, yours are 80%, 80%, 99%. There are serious ramifications of mis-configuring these, be aware of what these ramifications are before they bite you in production.