Discussing Large Document Indexing in Word Files
TLDR robert asked about indexing large word files. Kishore Nallan advised splitting into smaller documents for improved performance.
1
Aug 03, 2022 (17 months ago)
robert
02:29 PMSnippet
feature. What's the performance implication?Kishore Nallan
02:31 PM1
Typesense
Indexed 3005 threads (79% resolved)
Similar Threads
Optimal Indexing and Querying of Large Documents
Robert asks about the best practice for indexing large documents and the ideal size of subdocuments. Jason suggests experimenting with 10K words in a single document and performance testing.
Discussing Maximum Words Per Field and Performance Impacts
robert sought advice on the performance impacts of large text fields in search response, and Kishore Nallan advised reading from disk would be slow. They also highlighted a limitation in Typesense.
Estimating RAM Requirements for Indexing Documents
Epi asked about index sizes in relation to document sizes and RAM requirements for their dataset. Kishore Nallan suggested indexing a sample and extrapolating results, and confirmed suitability for indexing large documents like Wikipedia articles in Typesense.