Discussing Improved Multifield Search Options and Text Score Feature
TLDR bnfd asked Kishore Nallan about issue #516's improvements to multi-field searches and text score feature. Kishore Nallan explained how typo and drop tokens are now considered globally. They also discussed the text match score bucketing.
2
Feb 07, 2022 (23 months ago)
bnfd
02:39 PMKishore Nallan
02:42 PMfoo
if there was a name
field with that value but no description
field containing that word, Typesense used to query for typo variations of the word in the description
index. This often confused people. Now, we are considering the typo and drop tokens threshold at the global level. If a record with atleast one field matches the query word, typo/drop token variations are not looked for in the other fields.1
Kishore Nallan
02:45 PM_text_match(buckets: 10):desc
The
buckets
parameter indicates the number of intervals that the text match scores should be divided into, such that the documents falling within these buckets would be deemed to have the same text matching score. When there are fewer documents than the number of buckets, then all documents will belong to the same bucket. Only the first 250 documents and their scores are bucketed this way. A value of 0
or 1
disables bucketing.bnfd
02:46 PMKishore Nallan
02:47 PMbnfd
02:49 PMKishore Nallan
02:56 PMtext_match
value that we return today into components that make it easier to relate to.1
Typesense
Indexed 3011 threads (79% resolved)
Similar Threads
Understanding Typesense Query Fuzziness and Thresholds
Ashraful was confused about different query results when applying filters in Typesense. Jason clarified the function of `drop_tokens_threshold` and `typo_tokens_threshold` options, explaining their effect on search results and their precedence.
Troubleshooting "drop_tokens_threshold" and Typo Tolerance in Typesense
Joe had issues with "drop_tokens_threshold" = 0 and typo tolerance in Typesense, after which Kishore Nallan provided solutions and clarifications on feature functionality. Their issues with the search result limit and tokens were resolved after discussion and testing.
Issue with Typo Correction/Prefix Search and the Role of max_candidates
John noticed inconsistent search results based on max_candidates settings, and Kishore Nallan clarified its role for multi-word queries. They resolved that increasing max_candidates ensures the query isn't prematurely limited.