Reboot 2 node cluster

Ivan44785372 · March 6, 2019, 8:17pm

Hello.

I have 2 node cluster. What happens if I reboot both of them simultaneously ? Is it safe ? or it is better to reboot one of them, wait till synchronization and then reboot the second one.

kporter · March 6, 2019, 8:35pm

Depends on what you mean by ‘safe’. You will obviously lose availability which for most users isn’t considered ‘safe’. But if you use persistence, i.e. ‘storage-engine device’, you shouldn’t lose any data, assuming that is your concern. Also note that cold-start is much slower than the ‘Fast Start’ in Enterprise so depending on the amount of data and your situation, you could be unavailable for a significant amount of time.

Ivan44785372 · March 6, 2019, 8:43pm

Thank you for you answer. Yes My concern is about data consistency. now I understand that nothing bad should happen. Could you please clarify what about synchronization. Does the speed depend on whether I reboot 2 node simultaneously or one by one ?

kporter · March 6, 2019, 10:00pm

In Enterprise, we store the primary index in shared memory and re-attach on restart making restarts much faster. When Enterprise editions reboot or shutdown in an unsafe manner they will not be able to Fast-Start and will resort to a cold-start. In Community edition, we always cold-start since the index isn’t stored in shared memory thus always lost on restart. Cold-start is the process of rebuilding the primary index from records found in storage, this requires fully reading the storage layer which can take a significant amount of time. Restarting one by one doesn’t avoid this issue but you will be able to wait for a restart and migration completion before restarting the next node which will minimize availability. Also if you had a 3rd node, you wouldn’t need to wait for migrations to complete after a node starts since the latest data will be available between the two remaining nodes.

Also note that I have made the assumption we are discussing AP mode and not strong-consistency mode which is an Enterprise feature. With a bit of effort, you can violate consistency in AP, if you are very sensitive to consistency violations, you should consider using strong-consistency.

Albot · March 7, 2019, 5:21am

We can’t really give you a straight answer without more details. What version/edition of Aerospike are you using? Whats your config like? In general, though, you really dont want to reboot the entire cluster at once… in almost every situation. Now the maintenance process outside of that can differ depending on how things are setup.

system · March 14, 2019, 6:20am

This topic was automatically closed 6 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Will Data Recover on the Other Cluster or on the Local HDD? How Aerospike Works	6	2353	August 3, 2015
Cluster upgrade	7	1248	May 29, 2017
Adding multiple nodes to a running cluster simultaneously Upgrading	4	3020	June 27, 2018
Can we change the time one node take to join cluster after restart? Monitoring	5	771	June 3, 2022
Speed up re-joining a cluster Operations	7	819	January 31, 2020

Reboot 2 node cluster

Related topics