Scott Nei
03/14/2025, 7:01 PMScott Nei
03/14/2025, 7:07 PMScott Nei
03/14/2025, 7:21 PMScott Nei
03/14/2025, 7:27 PMScott Nei
03/14/2025, 7:40 PMScott Nei
03/14/2025, 8:13 PMScott Nei
03/14/2025, 8:17 PMJason Bosco
03/14/2025, 10:35 PMUsing the /stats.json endpoint, we see that pending writes are stuck at 1606.
Could this be blocking even API keys from syncing to this node?This is the core of the issue. Somehow there seems to be some bad write that got into the write queue on that node which is stalling the rest of the writes. Historically this happens when we have a missing validation for some shape of malformed data and end up accepting the write instead of throwing an error up front. So we need to take a look at the raw data on disk to see what's stalling the write queue. Would you be ok if we took a snapshot of the data on disk and loaded it into a debug cluster to take a closer look?
Scott Nei
03/15/2025, 1:15 AMJason Bosco
03/15/2025, 1:30 AMJason Bosco
03/15/2025, 1:31 AMScott Nei
03/15/2025, 1:32 AMJason Bosco
03/15/2025, 1:32 AMJason Bosco
03/15/2025, 1:33 AMScott Nei
03/15/2025, 2:14 AMScott Nei
03/26/2025, 4:38 PM