Discussing Prefix-Match for Multiple Tokens
TLDR Sidharth asked if prefix matching for separate tokens was possible and Kishore Nallan explained why it would be computationally intensive. Kishore Nallan then suggested an ngram solution which seemed to satisfy Sidharth's need.
1
Aug 08, 2023 (1 month ago)
Sidharth
10:47 AMIs there a way to consider prefix match for each token separately?
eg. rel fut
searched output: document -> Reliance Future
Kishore Nallan
10:56 AMSidharth
11:01 AMKishore Nallan
11:25 AMKishore Nallan
11:25 AMKishore Nallan
11:54 AM[r, re, rel, reli, relia, relianc, reliance, i, in, ind, ...]
Kishore Nallan
11:55 AMSidharth
03:07 PMWith infix parameter, for
eg. query -> rel fut
and: document -> Reliance-Future
will it match both rel & fut in the document?
Sidharth
03:48 PMHow many input tokens will be searched in the scenario of INFIX.
eg. query -> rel fut
Will both rel & fut be searched in the fields?
Kishore Nallan
03:52 PMYes it will.
Kishore Nallan
03:53 PMinfix
column here: https://typesense.org/docs/0.24.1/api/search.html#search-parametersSidharth
03:54 PMSidharth
03:54 PMKishore Nallan
03:55 PMSidharth
03:57 PMSidharth
03:58 PMeg. query -> rel fut
and: document -> Reliance-Future
Kishore Nallan
04:03 PMSidharth
04:07 PMSidharth
04:09 PMIs there any feature to apply
prefix
on all the query tokens?Sidharth
05:03 PMCan you please further guide me on the ngram solution
Aug 09, 2023 (1 month ago)
Kishore Nallan
05:33 AMAug 10, 2023 (1 month ago)
Sidharth
05:53 AMThanks a lot for suggesting a great solution. Most likely it will solve the problem in our use-case.
1
Typesense
Indexed 2779 threads (79% resolved)
Similar Threads
Token Priorities and Infix Search in Typesense Multi-word Queries
Sidharth sought guidance on creating multi-word query with token priority in Typesense. Kishore Nallan explained fetching results only for last word as prefix and suggested `infix` search and data modelling as potential solutions. However, Kishore Nallan emphasized that `infix` doesn't support multiple words and is only recommend for small datasets.
Querying and Indexing Multiple Elements Issues
Krish queried fields with multiple elements, which Kishore Nallan suggested checking `drop_tokens_threshold`. Krish wished to force OR mode for token, but Kishore Nallan admitted the feature was missing. Krish was able to resolve the issue with url encoding.
Troubleshooting "drop_tokens_threshold" and Typo Tolerance in Typesense
Joe had issues with "drop_tokens_threshold" = 0 and typo tolerance in Typesense, after which Kishore Nallan provided solutions and clarifications on feature functionality. Their issues with the search result limit and tokens were resolved after discussion and testing.
Phrase Search Relevancy and Weights Fix
Jan reported an issue with phrase search relevancy using Typesense Instantsearch Adapter. The problem occurred when searching phrases with double quotes. The team identified the issue to be related to weights and implemented a fix, improving the search results.
Issues with Repeated Words and Hyphen Queries in Typesense API
JinW discusses issues with repeated word queries and hyphen-containing queries in Typesense. Kishore Nallan offers possible solutions. During the discussion, Mr seeks advice on `token_separators` and how to send custom headers. Issues remain with repeated word queries.