Benchmarking Typesense Search Results
TLDR Narayan asked about the search algorithm in Typesense and how it compares to TF-IDF/BM25. Kishore Nallan explained that Typesense focuses on smaller documents and provided links to their test suites.
1
Oct 30, 2023 (1 month ago)
Narayan
05:05 AMNarayan
05:07 AMKishore Nallan
05:12 AMNarayan
05:24 AMKishore Nallan
05:33 AMBy smaller docs, it includes description as well, but typically descriptions are never used standlone but weighted lower when compared to other fields like title etc. None of the MTEB type of benchmarks really reflect searches for these use cases.
Narayan
05:41 AMKishore Nallan
05:45 AMThese types of tests are present throughout the test suite as we test ranking with various other subfeatures like filtering etc.
Kishore Nallan
05:46 AMNarayan
05:47 AMNarayan
05:49 AMKishore Nallan
05:51 AM1
Typesense
Indexed 3005 threads (79% resolved)
Similar Threads
Understanding Indexing and Search-As-You-Type In Typesense
Steven had queries about indexing and search-as-you-type in Typesense. Jason clarified that bulk updates are faster and search-as-you-type is resource intensive but worth it. The discussion also included querying benchmarks and Typesense's drop_tokens_threshold parameter, with participation from bnfd.
Improving Typesense Query Performance
Jonathan queried about slower than expected typesense query performance. Jason and Kishore Nallan offered solutions and explanations. After a series of tests, Jonathan found other queries returned results quickly, indicating the issue was specific to the original query.
Optimizing Dataset of Podcast Feeds for a Searchable Database
Alexander seeks advice on optimizing a podcast database for search. Kishore Nallan suggests data size and stopwords impact RAM usage, and that benchmarking on 1M records would be useful. satish raises the potential need for vector searching. Both recommend feeding user activity data into ML models for relevancy ranking. Collaboration was suggested.
Resolving Typesense Search Issues
Conversation started by Maximilian about Typesense search behavior led to Users Kishore Nallan and Mike discussing and suggesting workaround, with Kishore Nallan promising an official solution soon. No final confirmation of resolution provided.
Phrase Search Relevancy and Weights Fix
Jan reported an issue with phrase search relevancy using Typesense Instantsearch Adapter. The problem occurred when searching phrases with double quotes. The team identified the issue to be related to weights and implemented a fix, improving the search results.