Outage is causing users to see old data (potential data loss)

Theo_Cincotta · July 22, 2015, 11:37pm

We had two nodes come up with the same multicast address and they put our production servers into a bad state. It took us a while to track down what was causing this issue and the logs said to run this and we did.

Jul 21 2015 20:07:51 GMT: INFO (paxos): (paxos.c::2412) CLUSTER INTEGRITY FAULT. [Phase 1 of 2] To fix, issue this command across all nodes: dun:nodes=bb9fc7396171500,bb98ce7ef902500,bb955e9ef902500,bb9327296171500

Now people are seeing old data coming from the database. What could be causing this and how can we put it back to where it was before the outage?

Thanks,

Theo

kporter · July 23, 2015, 5:10pm

Can you describe how you recovered from this outage?

Topic		Replies	Views
Data inconsistency after failed node back Tuning	7	4203	November 14, 2014
Cluster integrity fault Operations	1	2162	January 24, 2016
One node cluster visibility false (a node became invisible to other nodes over the network)	5	2761	June 12, 2016
One node (of 6) has integrity problem after a crash and reboot. Will not recover	4	506	January 17, 2023
Cluster not syncing back: try rolling restart or fast restart (AER-4500)	10	2912	November 21, 2015

Outage is causing users to see old data (potential data loss)

Related topics