Self-Hosted Cluster Down, Logs Reveal "No matching field found for token"
TLDR Imtiaz had a self-hosted server cluster of 3 instances crash. Kishore Nallan suggested it could be a bug or edge case, and recommended enabling query logging in Typesense for future issues.
1
Dec 13, 2022 (10 months ago)
Imtiaz
02:15 PMI just had a self hosted cluster of 3 instances go down. It was preceded by a spam in the logs of "No matching field found for token: xxxxxx". The servers have been running stably for close to a year prior to this with the occasional restart of just the service for upgrades. Currently on
0.23.1
.Got the cluster back up by restarting the nodes. Any hint as to what I could look for as a root cause? (for context, the log entry was occurring around 32 times per second when the server went down).
Kishore Nallan
02:17 PMImtiaz
02:17 PM/health
endpoint on all 3 serversKishore Nallan
02:17 PMKishore Nallan
02:18 PMImtiaz
02:19 PMImtiaz
02:20 PMKishore Nallan
02:21 PMNo matching field found for token
need not always been an error, but since it ran so many times here, it could be.Kishore Nallan
02:22 PM1
for the slow request logging: https://typesense.org/docs/0.23.1/api/cluster-operations.html#toggle-slow-request-logImtiaz
02:25 PM1
Typesense
Indexed 2779 threads (79% resolved)
Similar Threads
Troubleshooting Typesense Cluster Mode Error Logs
gaurav was having troubles with Typesense error logs in cluster mode. Kishore Nallan explained that it can occur due to master's location troubles and node restarts or state resets, especially in a 3-node setup.
Problem with Loading Records on Typesense Server
Rafael had an issue with Typesense server loop during startup after loading 8M of 15M records. Kishore Nallan identified the issue as memory shortage and advised Rafael to allocate more server memory.
Stability Issues with Self-Hosted Typesense Server on AWS ECS
Ryan reports stability issues with a self-hosted typesense server, whilst Jason suggests various potential factors to consider, including configuration, CPU allocation, modifying the healthcheck, and testing on EC2 instances. However, no definitive solution is reached.
Addressing Typesense Server Issues and Optimization Needs
Robert had an issue with a 'stuck' typesense server. Jason and Kishore Nallan gave advice on handling writes, configuration for high search volumes, and running multiple typesense instances. They also recommended monitoring CPU usage and updating the server version for bug fixes.
Discussion About Typesense Nodes Not Synchronizing Correctly
Erick experienced an issue where documents weren't updated properly in a Typesense instance running on 3 nodes. Upon requesting debug logs and configs, Jason identified that these nodes weren't part of the same cluster. They couldn't resolve the nodes' failure to connect issue and recommended a fresh installation.