Summary
I’d like to construct the aerospike cluster with 3 servers(=aerospike101,102,103), however, it does not go well.
Could you take an advice?
Conditions
- aerospike version: aerospike server community edition v3.15.01
- OS: CentOS 7.4.1708
Symptom
When the 3 aerospike servers(=aerospike101,102,103) are started initially, asadm shows that the cluster is normal. IP addresses are hidden with * because of the privacy.
[cocoamaemae@aerospike101 ~]$ asadm
Aerospike Interactive Shell, version 0.1.13
Found 3 nodes Online: *******(node1 ip address):3000, *******(node2 ip address):3000, *******(node3 ip address):3000
Admin> info network
Node Node Ip Build Cluster Cluster Cluster Principal Client Uptime . Id . . Size Key Integrity . Conns .
aerospike101:3000 *BB9CEDA634C9A50 *******:3000 C-3.15.0.1 3 7158AB0DB916 True BB9CEDA634C9A50 3 00:00:09
aerospike102:3000 BB9C2DA634C9A50 1*******:3000 C-3.15.0.1 3 7158AB0DB916 True BB9CEDA634C9A50 3 00:00:10
aerospike103:3000 BB902A5634C9A50 *******:3000 C-3.15.0.1 3 7158AB0DB916 True BB9CEDA634C9A50 4 00:00:11 Number of rows: 3
Once the construction is succeeded, however, the cluster would be broken after a few minutes passed. IP addresses are hidden with * because of the privacy.
[cocoamaemae@aerospike101 ~]$ asadm
Aerospike Interactive Shell, version 0.1.13
Found 1 nodes
Online: *******:3000
Extra nodes in alumni list: ********(node2 ip address):3000, ********(node3 ip address):3000
Admin> info network
~~~~~~~~~~Network Information~~~~~~~~~~~~~~~~~~~
Node Node Ip Build Cluster Cluster Cluster Principal Client Uptime
. Id . . Size Key Integrity . Conns .
aerospike101:3000 *BB9CEDA634C9A50 ******:3000 C-3.15.0.1 1 DEC994B610E8 True BB9CEDA634C9A50 2 00:18:16
Number of rows: 1
After the cluster was broken, the below message was confirmed in /etc/aerospike/aerospike.log
WARNING (as): (as.c:337) will not write PID file in new-style daemon mode
Nov 10 2017 06:45:05 GMT: INFO (fabric): (fabric.c:2520) fabric: node bb9ceda634c9a50 departed
Nov 10 2017 06:45:05 GMT: INFO (fabric): (fabric.c:935) fabric_node_disconnect(bb9ceda634c9a50)
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5520d848 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef57a0db88 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef55a0dac8 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef58a0d948 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef57e0d548 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef54a0d208 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5960db88 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef58e0dd08 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5560d588 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5740d148 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef56c0d948 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef53a0de48 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef54e0d5c8 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5420d508 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef55e0dac8 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef59a0d388 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5860d588 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5660de08 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5920df88 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5820d0c8 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5460d8c8 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5620db48 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef53e0da48 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1599) fc 0x7fef5700da08 is not in (node 0x7fef5994f008)->fc_hash
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:1159) fabric_node_destructor(0x7fef5994f008)
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:2219) 20: epoll : error, will close: fc 0x7fef4b80ddc8 fd 80 errno 11 signal {err:0, hup:0, rdhup:1}
Nov 10 2017 06:45:05 GMT: DETAIL (fabric): (fabric.c:2219) 25: epoll : error, will close: fc 0x7fef4b40d408 fd 81 errno 11 signal {err:0, hup:0, rdhup:1}
The aerospike.conf is like the blow. The only network setting is showed, the other settings are hidden
network {
service {
address any
port 3000
reuse-address
access-address *******(node ip address)
alternate-access-address *******(ip address entering into node)
}
heartbeat {
mode multicast
multicast-group 239.1.1.1
port 9918
address *******(node ip address)
interval 150
timeout 10
}
fabric {
address *******(node ip address)
port 3001
}
info {
address *******(node ip address)
port 3003
}
}
The multicast connection surely has been already checked, and it is normal.
Why the aerospike cluster is broken?
FYI When using mesh, the aerospike cluster was not broken. It was normal.