Does anybody know if it s possible to fetch all docs gt 1k f typesense #community-help

Does anybody know if it's possible to fetch all do...

Julian

09/09/2022, 10:33 AM

Does anybody know if it's possible to fetch all docs (>1k) from a collection in one request? Or is 250 hits/page the request limit?

Kishore Nallan

09/09/2022, 10:36 AM

You can use a multi_search to paginate in a single request.

Julian

09/09/2022, 10:56 AM

Mhm, I see. This is a little bit of a pain when using with instantsearch.js though. Any plans to increase the limit of 250 results or any reasons why not to?

Kishore Nallan

09/09/2022, 10:58 AM

Primarily because Typesense is a search engine and optimized very much for fetching the "top" records for a given query. So while we can fetch greater than 250 results on pagination, that's going to become slower as you progress through. That limit exists to be a reminder of this limitation. We don't want people accidentally blowing up a search with a 1000 page fetch on a large dataset.

Kishore Nallan

09/09/2022, 10:59 AM

Maybe we should add a flag for overriding this behavior.

Julian

09/09/2022, 10:59 AM

Fair point. Thanks for explaining 👍

Julian

09/09/2022, 11:00 AM

Maybe we should add a flag for overriding this behavior.

That could be a decent compromise, yes 👍

Andrew Sittermann

09/11/2022, 8:09 AM

@Julian we use the export function for this

Julian

09/12/2022, 6:10 PM

@Andrew Sittermann I see. Do you use this for runtime tasks with your whole data set? May I ask how many docs/collection we are talking?

Andrew Sittermann

09/12/2022, 6:12 PM

Yes. Very roughly it's 1.5 million docs

Andrew Sittermann

09/12/2022, 6:12 PM

@Julian

Julian

09/12/2022, 6:13 PM

OK, that's quite a lot. And you are not experiencing bandwidth issues or the like? Do you perform any further client-sided mutations to the data after fetching?

Andrew Sittermann

09/12/2022, 6:13 PM

Wait..Iam getting mixed up between threads. We export roughly 1000 docs

Julian

09/12/2022, 6:17 PM

OK, that's indeed another cup size. May I ask for a (very brief!) outline of your use case? Just so I know if it's anything like what we are dealing with?

Andrew Sittermann

09/12/2022, 6:21 PM

We actually have built a kind of proxy which uses a template which contains a load of Typesense requests, executes all those requests, collates the responses, throws away unnecessary data (we'll need to keep throwing data away until Typesense implements a feature for doing this inside of complex objects), .... And then returns it all to the client in one go

Andrew Sittermann

09/12/2022, 6:22 PM

We're basically using Typesense as an in-memory DB. (We're also using it the normal way ... I.e for typo tolerant search)

Julian

09/12/2022, 6:26 PM

Thanks for the insight ✅ And performance for this "expensive" proxy operation is still OK?

Andrew Sittermann

09/12/2022, 6:45 PM

It's obviously nowhere near as fast as calling Typesense directly. But it's still amazing compared to using PostgreSQL which is what we were doing before. With "one-step" templates, where Typesense executes all the queries at the same time, and then the collation runs, we are usually sub 100ms. With "dynamic" templates, where we call Typesense initially with a bunch of queries, and then call it a 2nd time using results from the first query as inputs, we are usually sub 200ms. (Example: we retrieve a court judgement, which includes a list of the judges who participated, then in the second step, we query for other judgements where those judges participated)

Julian

09/14/2022, 10:14 AM

Alright, got it. Thanks for sharing, really appreciated.

Open in Slack

Previous Next