Hello < Kishore Nallan> Is there a way to consider prefix ma typesense #community-help

Hello <@U01PL2YSG8L> Is there a way to consider pr...

Sidharth Aggarwal

08/08/2023, 10:47 AM

Hello @Kishore Nallan Is there a way to consider prefix match for each token separately?

Copy code

eg. rel  fut
searched output: document -> Reliance Future

Kishore Nallan

08/08/2023, 10:56 AM

No this is not possible. Primarily because implementing this will be very intensive computationally. Each prefix could produce tens of matching words. For two prefixes if each produces ten words, then total combinations are 10 x 10 = 100

Sidharth Aggarwal

08/08/2023, 11:01 AM

Is there any way to solve, in case some one have faced similar issue?

Kishore Nallan

08/08/2023, 11:25 AM

Should they always be adjacent words?

Kishore Nallan

08/08/2023, 11:25 AM

Infix search will help but against that's exhaustive so won't support high concurrency.

Kishore Nallan

08/08/2023, 11:54 AM

The only workaround I can think of is storing n grams of the words in an array. So for reliance industries you will store: [r, re, rel, reli, relia, relianc, reliance, i, in, ind, ...]

Kishore Nallan

08/08/2023, 11:55 AM

This way rel ind will produce results fast. This might work for you since you are perhaps indexing stock symbols? There are not many companies so this should not take too much memory.

Sidharth Aggarwal

08/08/2023, 3:07 PM

@Kishore Nallan With infix parameter, for

Copy code

eg. query -> rel fut
and: document -> Reliance-Future

will it match both rel & fut in the document?

Sidharth Aggarwal

08/08/2023, 3:48 PM

@Kishore Nallan How many input tokens will be searched in the scenario of INFIX.

Copy code

eg. query -> rel fut
Will both rel & fut be searched in the fields?

Kishore Nallan

08/08/2023, 3:52 PM

will it match both rel & fut in the document?

Yes it will.

Kishore Nallan

08/08/2023, 3:53 PM

You can play around with it to get a feel. Some additional details here under the

infix

column here: https://typesense.org/docs/0.24.1/api/search.html#search-parameters

Sidharth Aggarwal

08/08/2023, 3:54 PM

To how many tokens INFIX will be applied?

Sidharth Aggarwal

08/08/2023, 3:54 PM

Is it possible to connect over a short huddle?

Kishore Nallan

08/08/2023, 3:55 PM

All tokens

Sidharth Aggarwal

08/08/2023, 3:57 PM

In our scenario, we are not getting the top matches based upon the both the keywords

Sidharth Aggarwal

08/08/2023, 3:58 PM

Idealy for below example,

Copy code

eg. query -> rel fut
and: document -> Reliance-Future

Kishore Nallan

08/08/2023, 4:03 PM

Difficult to say without looking at the overall dataset, your schema and query etc. We do community support in a public slack channel so other users who might be looking for similar information find this conversation helpful. Here's more info if you need private/prioritized support when self-hosting: https://typesense.org/support/

Sidharth Aggarwal

08/08/2023, 4:07 PM

Sure Thanks @Kishore Nallan

Sidharth Aggarwal

08/08/2023, 4:09 PM

One last query @Kishore Nallan Is there any feature to apply

prefix

on all the query tokens?

Sidharth Aggarwal

08/08/2023, 5:03 PM

@Kishore Nallan Can you please further guide me on the ngram solution

Kishore Nallan

08/09/2023, 5:33 AM

The ngram solution is pretty much what I've described above. You generate ngrams of words in a field and store them as a string array field in Typensese and then search on it.

Sidharth Aggarwal

08/10/2023, 5:53 AM

@Kishore Nallan Thanks a lot for suggesting a great solution. Most likely it will solve the problem in our use-case.

👍 1

Open in Slack

Previous Next