Discussing Improved Multifield Search Options and Text Score Feature
TLDR bnfd asked Kishore Nallan about issue #516's improvements to multi-field searches and text score feature. Kishore Nallan explained how typo and drop tokens are now considered globally. They also discussed the text match score bucketing.
Feb 07, 2022 (23 months ago)
Kishore Nallan02:42 PM
fooif there was a
namefield with that value but no
descriptionfield containing that word, Typesense used to query for typo variations of the word in the
descriptionindex. This often confused people. Now, we are considering the typo and drop tokens threshold at the global level. If a record with atleast one field matches the query word, typo/drop token variations are not looked for in the other fields.
Kishore Nallan02:45 PM
bucketsparameter indicates the number of intervals that the text match scores should be divided into, such that the documents falling within these buckets would be deemed to have the same text matching score. When there are fewer documents than the number of buckets, then all documents will belong to the same bucket. Only the first 250 documents and their scores are bucketed this way. A value of
Kishore Nallan02:47 PM
Kishore Nallan02:56 PM
text_matchvalue that we return today into components that make it easier to relate to.
Indexed 3011 threads (79% resolved)
Understanding Typesense Query Fuzziness and Thresholds
Ashraful was confused about different query results when applying filters in Typesense. Jason clarified the function of `drop_tokens_threshold` and `typo_tokens_threshold` options, explaining their effect on search results and their precedence.
Troubleshooting "drop_tokens_threshold" and Typo Tolerance in Typesense
Joe had issues with "drop_tokens_threshold" = 0 and typo tolerance in Typesense, after which Kishore Nallan provided solutions and clarifications on feature functionality. Their issues with the search result limit and tokens were resolved after discussion and testing.
Issue with Typo Correction/Prefix Search and the Role of max_candidates
John noticed inconsistent search results based on max_candidates settings, and Kishore Nallan clarified its role for multi-word queries. They resolved that increasing max_candidates ensures the query isn't prematurely limited.