EhmeedRls
02/21/2025, 12:33 PMsort_by
clause?
I am getting different results from the master node than from the 2 slave nodes.Kishore Nallan
02/21/2025, 12:34 PMEhmeedRls
02/21/2025, 12:55 PMFanis Tharropoulos
02/21/2025, 12:56 PMKishore Nallan
02/21/2025, 12:57 PMEhmeedRls
02/21/2025, 12:58 PMKishore Nallan
02/21/2025, 1:08 PMTomas Hauk
02/21/2025, 2:10 PMRepeat steps 2 and 3 for the other _followers_, leaving the leader node uninterrupted for now.
2. we definitively did the restarts of the nodes only AFTER the restarted node rejoined the cluster AND restarted node was healthy AND restarted node started to serve requests => i.e. there was at most one node out of cluster, syncing
3. and also made sure that the master was the last node to restart (and upgrade)
our expectation would be that since there is some kind of WAL log/checkpointing:
• there will be created some inconsistency when the master as the last node is going down (there may be some ingest going on there)
• but then some of the former follower nodes will become new master and the state from the new master node will get replicated into new followers (which probably means that some state will get rollbacked on the node that was the original master before the upgrade)
can you please comment on the expectation above and/or correct it?
we can definitely try to force reelection (e.g. make the original master master again, but it does not seem to explain what actually happened)Tomas Hauk
02/21/2025, 2:16 PMcommited
number in these logs which seem to be consistent (from all our nodes):
I20250221 14:14:13.597611 184 raft_server.cpp:683] Term: 13, pending_queue: 0, last_index: 6148621, committed: 6148621, known_applied: 6148621, applying: 0, pending_writes: 0, queued_writes: 0, local_sequence: 36410614
I20250221 14:14:11.064772 184 raft_server.cpp:683] Term: 13, pending_queue: 0, last_index: 6148621, committed: 6148621, known_applied: 6148621, applying: 0, pending_writes: 0, queued_writes: 0, local_sequence: 36405023
I20250221 14:13:57.792254 185 raft_server.cpp:683] Term: 13, pending_queue: 0, last_index: 6148621, committed: 6148621, known_applied: 6148621, applying: 0, pending_writes: 0, queued_writes: 0, local_sequence: 36410627
Kishore Nallan
02/21/2025, 2:58 PMTomas Hauk
02/23/2025, 12:36 PMKishore Nallan
02/23/2025, 1:50 PM