hey guys i m using <https github com typesense typesense doc typesense #community-help

hey guys, i'm using <this scraper> tool to index o...

Thu Vo

07/17/2025, 4:04 AM

hey guys, i'm using this scraper tool to index our document site, and noticed sometimes when restart typesense it will get this message

Copy code

Write to disk failed. Will restore old document

Does this mean our collection schema data/structure is wrong, or this is failed to indeces in mem in startup? https://github.com/typesense/typesense/blob/v29.0/src/collection.cpp#L738-L744

Thu Vo

07/17/2025, 5:52 AM

also get this in ealier

Stack trace

point seem cant get the store

store

Copy code

DEFAULT 2025-07-17T05:49:43.080379Z E20250717 05:49:43.079523 14 backward.hpp:4200] Stack trace (most recent call last):
DEFAULT 2025-07-17T05:49:43.080395Z E20250717 05:49:43.079578 14 backward.hpp:4200] #7 Object "/usr/lib/x86_64-linux-gnu/ld-linux-x86-64.so.2", at 0xffffffffffffffff, in
DEFAULT 2025-07-17T05:49:43.080399Z E20250717 05:49:43.079586 14 backward.hpp:4200] #6 Object "/opt/typesense-server", at 0x55941e7d002d, in _start
DEFAULT 2025-07-17T05:49:43.080403Z E20250717 05:49:43.079591 14 backward.hpp:4200] #5 Object "/usr/lib/x86_64-linux-gnu/libc.so.6", at 0x7f1edd100e3f, in __libc_start_main
DEFAULT 2025-07-17T05:49:43.080407Z E20250717 05:49:43.079596 14 backward.hpp:4200] #4 Object "/usr/lib/x86_64-linux-gnu/libc.so.6", at 0x7f1edd100d8f, in
DEFAULT 2025-07-17T05:49:43.080411Z E20250717 05:49:43.079602 14 backward.hpp:4200] #3 Source "src/main/typesense_server.cpp", line 224, in main [0x55941e7db1bd]
DEFAULT 2025-07-17T05:49:43.080415Z E20250717 05:49:43.079608 14 backward.hpp:4200] #2 Source "src/typesense_server_utils.cpp", line 506, in run_server [0x55941ec943cf]
DEFAULT 2025-07-17T05:49:43.080419Z E20250717 05:49:43.079613 14 backward.hpp:4200] #1 Source "src/ratelimit_manager.cpp", line 498, in init [0x55941ec0a447]
DEFAULT 2025-07-17T05:49:43.080422Z E20250717 05:49:43.079618 14 backward.hpp:4200] #0 Source "src/store.cpp", line 101, in get [0x55941ec3302f]
DEFAULT 2025-07-17T05:49:43.080426Z Segmentation fault (Address not mapped to object [(nil)])

Thu Vo

07/17/2025, 7:17 AM

Hey sorry @Fanis Tharropoulos , do you have any idea on this one 🙌

Thu Vo

07/17/2025, 7:18 AM

I constantly get this one

Write to disk failed. Will restore old document

each reboot

Fanis Tharropoulos

07/17/2025, 7:19 AM

I'm thinking this has to do with Google Cloud and the machine you're hosted on

Fanis Tharropoulos

07/17/2025, 7:21 AM

Try dropping the index and re-indexing

Fanis Tharropoulos

07/17/2025, 7:21 AM

Which Typesense version are you using?

Thu Vo

07/17/2025, 7:21 AM

image.png

Thu Vo

07/17/2025, 7:21 AM

im using v29

Fanis Tharropoulos

07/17/2025, 7:22 AM

Have you changed anything in the data directory Typesense is using?

Thu Vo

07/17/2025, 7:25 AM

Nah i dont.

Try dropping the index and re-indexing

For now it's testing, so would I snapshot and backup from snapshot or should I crawl all data again?

Fanis Tharropoulos

07/17/2025, 7:32 AM

From the logs it seems like rocksdb didn't initialize at all, so the process is crashing. Could you try grepping for those logs:

Copy code

grep "Initializing DB by opening state dir" 
grep "Error while initializing store"

Thu Vo

07/17/2025, 7:37 AM

Hey that time it was init success

Thu Vo

07/17/2025, 7:38 AM

I also encounter some error cant init store, but it's not happen this time when write to disk failed https://typesense-community.slack.com/archives/C01P749MET0/p1752726993065609?thread_ts=1752658980.051089&cid=C01P749MET0

Fanis Tharropoulos

07/17/2025, 7:41 AM

I'm guessing it has to do with file permissions under

/usr/

. You should have it under the home directory instead

Thu Vo

07/17/2025, 7:49 AM

It's totally config in my template, i can change to any path. hmm and no error warn about perm because when init TS I think we will startup some progress to read/write files and Also I run with root level

Copy code

consumers:
          - name: application
            mountPath: /usr/share/typesense/data

Thu Vo

07/17/2025, 7:52 AM

I can try as non-root user

Fanis Tharropoulos

07/17/2025, 8:07 AM

Try as a non-root on the home dir

Thu Vo

07/17/2025, 8:41 AM

hey as console here, HOME=/root, mount gcs with non-group user in home dir /root/lib/typesense/data. You can see the error in stack trace still happen do you need to run typesense server with non-root. I think we still need to run typesnse server with root since it is having something need to do w /usr/ as stack trace show

Thu Vo

07/17/2025, 8:53 AM

Error while initializing store: IO error: Stale file handle

Copy code

Error while initializing store: IO error: No such file or directory: While opening a file for sequentially reading: /root/lib/typesense/data/db/MANIFEST-000035: No such file or directory

Thu Vo

07/17/2025, 8:53 AM

Does it cause an issue? when a MANIFEST file dont exist? then leak to issue in stack trace?

Thu Vo

07/17/2025, 8:54 AM

But in this case why it didnt re-init a fresh db?

Thu Vo

07/17/2025, 8:57 AM

image.png

Thu Vo

07/17/2025, 8:59 AM

Ok from log I guess, there was 2 instances, start up nearly same time and mount to single GCS one of them is starting up and the rest one will show

Copy code

It seems like the data directory /root/lib/typesense/data/meta is already being used by another Typesense server.

But i didnt clear why it's ccausing a stack trace error and container was force exit because of this

Copy code

DEFAULT 2025-07-17T08:45:56.169490Z E20250717 08:45:56.168789 13 typesense_server.cpp:159] Typesense 29.0 is terminating abruptly.
DEFAULT 2025-07-17T08:45:56.178313Z Segmentation fault
WARNING 2025-07-17T08:45:56.190706Z Container called exit(139).

Thu Vo

07/17/2025, 9:02 AM

Just a question: I didn’t manually delete anything. Was it intentionally deleted by process from TS

Copy code

] Error while initializing store: IO error: No such file or directory: While opening a file for sequentially reading: /root/lib/typesense/data/db/MANIFEST-000020: No such file or directory

Thu Vo

07/17/2025, 10:54 AM

I wrote a small tests, which perm 100 request API call into 3 instances and yeah obviously the data is not sync between them, even all of them are mounting into only one GCS bucket

Fanis Tharropoulos

07/17/2025, 10:58 AM

This leans closer to a Google Cloud issue, rather than Typesense itself. I don't have much context on how GCS works, and how to deal with permissions and multiple processes. Maybe the official docs could help here

Thu Vo

07/17/2025, 11:00 AM

yeah, thanks Fanis for all help so far now, could I ask how typesense work with multiple instances spin at time? in case i dont set

--nodes

peering-address

Fanis Tharropoulos

07/17/2025, 11:16 AM

The second process will fail as it will try to bind to the same address using the same data directory

Thu Vo

07/17/2025, 11:18 AM

All address is default config 127.0.0.1. So even through theses cases let say • instance1: `--api-address=0.0.0.0`node_peer auto resolve is 127.0.0.181088107 • instance2: `--api-address=0.0.0.0`node_peer auto resolve is 127.0.0.181088107 • instance3: `--api-address=0.0.0.0`node_peer auto resolve is 127.0.0.181088107 they will fail is it

Fanis Tharropoulos

07/17/2025, 11:26 AM

The port is set on the

nodes

file, which is needed for High Availability. May I know the reason for running multiple Typesense Processes under the same machine that don't interact in HA?

Thu Vo

07/17/2025, 11:32 AM

It's running under a Cloud Run service, so it's really difficult for us to obtain an internal IP (like you would with a GKE Pod ID). Because of that, I expected it to be running in single-node mode. In this setup, I rely on a persistent volume, specifically, GCS for syncing data during cold starts. That’s why I didn’t set any

--nodes

parameter, assuming it wouldn’t attempt any automatic

node_peer

resolution. However, based on the logs I’ve reviewed so far, it still seems to attempt auto-resolution by default, and in my case, it always resolves to

127.0.0.1

Thu Vo

07/17/2025, 11:37 AM

image.png

Thu Vo

07/18/2025, 4:18 AM

Hey @Fanis Tharropoulos thanks for all your help, Just wanted to give back to the community in case someone else wants to try this approach like I did. In Cloud Run, that's just how Google Cloud Run works. • we can only expose a single port. So if we expose

for the API address, we can't also expose

for peering • I*nstance-to-instance communication* isn’t possible in Cloud Run natively My workaround was to deploy two separate Cloud Run services: • One Cloud Run service acts as the leader, handling write requests. It's configured to run as a single instance. • Another Cloud Run service handles read-only requests, using the

--skip-write

argument. This one can scale out to multiple instances.

❤️ 1

Thu Vo

07/30/2025, 5:12 AM

Hey @Fanis Tharropoulos Do we have any way to talk to Raft or DB to do this index whenever DB data-dir change?

Copy code

raft_server.cpp:692] Term: 3, pending_queue: 0, last_index: 243, committed: 243, known_applied: 243, applying: 0, pending_writes: 0, queued_writes: 0, local_sequence: 15364

Since I start up a read-only server. and have external mount - that shared w other TS server (read-write)

Fanis Tharropoulos

07/30/2025, 8:55 AM

There's no way to interact with rocksDB directly apart from compacting the database: https://typesense.org/docs/29.0/api/cluster-operations.html#compacting-the-on-disk-database CC: @Kishore Nallan

Kishore Nallan

07/30/2025, 9:06 AM

Yeah the access to rocksdb is not directly exposed.

👍 1

17 Views

Open in Slack

Previous Next