We have a collection with 10+ fields and about 300k records typesense #community-help

We have a collection with 10+ fields and about 300...

Dima

11/18/2024, 5:07 PM

We have a collection with 10+ fields and about 300k records. We have a rare use case where semantic search would be quite useful for our users, but most of the queries are still keyword only. So we decided to enable auto-embedding with a local model for our collection, process went smoothly and all features are working, but we found that latencies for keyword search in the same collection increased by a factor of 5. (p95 changed from 120ms to 600ms). Do you have any hypothesis as to why keyword-only searches might slow down in a collection with embedding?

Dima

11/18/2024, 5:09 PM

Configuration: • 3 nodes, 3gb memory, 3 vCPU • Memory consumption before embedding ~500mb, after ~1.6 gb • Memory increase to 5gb didn’t help

Dima

11/18/2024, 10:32 PM

I have a blind guess that it could be related with some of

index: false

fields in the collection 🤔

Kishore Nallan

11/19/2024, 1:20 AM

Check if you are returning the embedding fields in the response. That will increase I/O latency and those fields are also large so take time to process.

Dima

11/19/2024, 9:37 AM

Nope, we’re using strict list of

include_fields

Kishore Nallan

11/19/2024, 9:41 AM

So you are saying that a query that doesn't even use the embedding field is now taking that much longer?

Kishore Nallan

11/19/2024, 9:42 AM

What's your per_page? Can you try running the same keyword search query but with per_page as 1.

Dima

11/19/2024, 9:51 AM

So you are saying that a query that doesn’t even use the embedding field is now taking that much longer?

Yes, exactly.

What’s your per_page? Can you try running the same keyword search query but with per_page as 1.

Will try on the week

Kishore Nallan

11/19/2024, 9:54 AM

I wonder somehow it's taking a long time to read that record from disk now. The per page of 1 will help us figure it that's the issue.

➕ 1

Dima

11/19/2024, 9:55 AM

BTW, how to ensure that no reading from the disk happening? If I use

index: true

for all fields in the collection, should it make it truly in-memory?

Kishore Nallan

11/19/2024, 3:25 PM

Index true just means we enable in memory indices. There is store: false For not storing the data.

👀 1

Dima

11/19/2024, 7:45 PM

Hm, I got this insight from the docs:

You want to NOT mention these fields in the collection’s schema or mark these fields as
index: false
(see
fields
schema parameter below) to mark it as an unindexed field. You can have any number of these additional unindexed fields in the documents when adding them to a collection - they will just be stored on disk, and will not take up any memory.

Kishore Nallan

11/20/2024, 1:29 AM

Yes. Having them on the schema with

index: false

just ensures that the data in the records are validated for presence of mandatory fields. Which won't happen if it's not part of the schema with

index: false

Kishore Nallan

11/20/2024, 8:22 AM

Also, are you doing any facets?

Dima

11/20/2024, 9:24 AM

Yes, we have one facet field in the collection

Kishore Nallan

11/20/2024, 9:25 AM

Do you change the max facet values default?

Dima

11/20/2024, 9:26 AM

No, it’s a small set of values, about ~10

Kishore Nallan

11/20/2024, 9:26 AM

I ran into an issue with another customer where larger docs have noticeable slow faceting performance because we rely on fetching the doc from disk for a part of the facet computation which gets slow for large docs. Will be releasing a patch for it in a day or so.

Kishore Nallan

11/20/2024, 9:27 AM

This is a problem that has surfaced when people introduced embedding vector fields which are very large.

Dima

11/20/2024, 9:27 AM

Sounds similar, yes 👍

Kishore Nallan

11/20/2024, 9:28 AM

Btw store false approach I suggested above won't help here because that will lose the embedding index on restart.

👌 1

Dima

11/20/2024, 10:12 AM

Disabling facets helped to decrease p95 from 600ms to 345ms, still not to the old 120ms 🤔

Kishore Nallan

11/20/2024, 10:24 AM

Yes because there is probably some latency involving in fetching the page from disk as well because of the size of the docs.

Kishore Nallan

11/20/2024, 10:24 AM

Also additional cycles to parse the json string from disk into a JSON record in the program to do field inclusion, exclusion etc.

Open in Slack

Previous Next