Indexing Contact Information with Separators
TLDR CDoe is indexing contact info and asked if the appropriate separators were recommended in the article. @4L5c7 affirms and suggests adding
_ for emails.
Jul 14, 2023 (4 months ago)
profilescollection where I'm looking to index basic contact information like people's names, phone numbers, emails, physical addresses, and general notes. I read through the 'tips for common data types' article. Would combining the recommended separators for phone and email seem to work for a use case like this?
token_separators: ['(', ')', '-', '+', '@', '.']
_as well for email addresses
Indexed 3005 threads (79% resolved)
Restricting `token_separators` to a Specific Field in Typesense
Loic asked Jason about applying `token_separators` to a specific field in Typesense. Jason suggested opening a github issue to add this feature.
Dealing with Hyphenated Search Results in Typesense
Martin faced issues with hyphenated search results in Typesense. Jainil suggested using `token_separators` to index such search terms separately.
Handling Zero-width Non-joiner in Typesense
Arad inquires about managing a Persian character equivalence in Typesense. Jason advises adding the unicode character to `token_separators` but accepts that single byte characters are only supported currently. A GitHub issue was created.