One of Three Nodes went down abruptly and writes are not happening

dhanasekaran1980 · January 23, 2019, 11:27am

We are running 3 Node Cluster, data in memory on version 4.2.0.4 CE. We recently noticed writes are not happening and found one down. Ideally write should happen. Once we start the node which was down, the writes resumed.

Found below INFO Logs being printed continuosly on two nodes.
INFO (hb): (hb.c:4319) found redundant connections to same node, fds 101 31 - choosing at random

On the other node, no logs being printed and no read/writes happening on adadm stats. Also we have observed that the records are unevenly distributed across the nodes.

Please help.

pgupta · January 23, 2019, 7:01pm

check if the other two nodes are publishing a private ip address not accessible to client and only one node (that went down) is publishing an accessible ip address. (network stanza, service sub-context)

dhanasekaran1980 · January 24, 2019, 10:13am

Gupta, Thanks for your apt reply. Yes this is what is happening. But in the configuration file I have provided the public IP addresses. These three nodes are on AWS.

Please help.

pgupta · January 24, 2019, 4:15pm

Discussion continuing here: Aerospike: One of Three Nodes went down abruptly and writes are not happening - Stack Overflow

Topic		Replies	Views
Aerospike Crash	4	1918	January 9, 2016
Aerospike cluster behavior in different consistency mode? Configuration	6	1614	September 28, 2018
Unexpected behavior on EC2 Installation	1	1422	August 18, 2014
Aerospike cluster sync issues Operations	2	105	May 29, 2025
After restarting one node there is extreme decrease in Ops/sec and even timeout Configuration query	7	569	April 27, 2023

One of Three Nodes went down abruptly and writes are not happening

Related topics