Docsearch Custom Settings in Typesense Scraper
TLDR Marcos wants to specify token_separators
and symbols_to_index
without forking the docsearch scraper. Jason suggests opening a GitHub issue to add support for custom settings in the scraper.
Mar 29, 2023 (6 months ago)
Marcos
10:17 PMMarcos
10:18 PMtoken_separators
and symbols_to_index
using the docsearch scrapper?I can't just update the schema since it'll be overridden next update.
Jason
10:22 PMJason
10:23 PMMarcos
11:29 PMcustom_settings
like the Algolia's?Jason
11:30 PMMarcos
11:33 PMMarcos
11:33 PMcustom_settings
options in the docsearch.config.json:https://github.com/algolia/docsearch-configs/blob/master/configs/docusaurus-2.json#L29-L30
Marcos
11:34 PMMarcos
11:35 PMcustom_settings
are Algolia's specific collection settingsJason
11:38 PMJason
11:39 PMMarcos
11:39 PMTypesense
Indexed 2764 threads (79% resolved)
Similar Threads
Solving Typesense Docsearch Scraper Issues
Sandeep was having issues with Typesense's docsearch scraper and getting fewer results than with Algolia's scraper. Jason helped by sharing the query they use and advised checking the running version of the scraper. The issue was resolved when Sandeep ran the non-base regular docker image.
Docsearch Scrapper Metadata Configuration and Filter Problem
Marcos faced issues with Docsearch scrapper not adding metadata attributes and filtering out documents without content. Jason helped fix the issue by updating the scraper and providing filtering instructions.

Trouble with DocSearch Scraper and Pipenv Across Multiple OSs
James ran into errors when trying to build Typesense DocSearch Scraper from scratch, and believes it’s because of a bad Pipfile.lock. Jason attempted to replicate the error, and spent hours trying to isolate the issue but ultimately fixed the problem and copied his bash history for future reference. The conversation touches briefly on the subject of using a virtual machine for testing.



Crawler Deleting Old Collection and Creating New Name
James faced issues with Typesense as the crawler changed collection names, breaking their production website. Jason suggested changing "index_name" in their config file to their desired name and explained the reason behind the name combination differences.

Issues with migrating search from Algolia to Typesense
Nacho encountered issues with migrating search from Algolia to Typesense, including a custom component that interfered with the search and an incorrect element being used as the title. Jason provided guidance on removing the component and adjusting search configuration.

