Understanding Locales Supported by Typesense
TLDR Carl-Gerhard asked about locales supported by Typesense. Kishore Nallan confirmed support for some locales and clarified they don't handle singular/plural differences.
1
Nov 11, 2022 (13 months ago)
Carl-Gerhard
11:14 AMI have been trying to figure out what locales Typesense support. Canโt find anything documented about different locales.
Only thing I can find is when creating a new collection each field have a
"locale": ""
. Tried using this with the JS client but getting typescript error that locale
does not exist on CollectionFieldSchema
.Kishore Nallan
11:15 AMThere's decent support for Korean, Thai and Cyrillic. What locale are you looking for?
Kishore Nallan
11:16 AMCarl-Gerhard
11:19 AMKishore Nallan
11:20 AMCarl-Gerhard
11:26 AMAnd search for a pluralized word when itโs indexed as singular
Carl-Gerhard
11:26 AMKishore Nallan
11:27 AMCarl-Gerhard
11:34 AM1
Typesense
Indexed 3015 threads (79% resolved)
Similar Threads
Understanding Typesense Language Search and Locale
Juri inquired about language search in Typesense, and Kishore Nallan clarified it requires a locale to index certain languages. Discussion also included how to specify these locales and that multiple languages can be searched without decreasing performance.
Typesense Support for Non-English Languages
omega enquired about Typesense's support for non-English languages. Kishore Nallan suggested using separate fields for different languages.
Typesense Support for Japanese and Chinese Languages
Juri inquires about the potential of Typesense not supporting Japanese and Chinese-only applications. Kishore Nallan assures that support for CJK languages is under alpha testing, and already has some available use though not publicly documented yet.
Implementing Multilingual Texts in Typesense
Juri wants to implement bilingual text for a blog platform in Typesense. Kishore Nallan suggests creating different fields for each language. After some discussion, it is agreed to use different fields for complex languages requiring tokenization, while combining other languages. They have also discussed possibly disabling typo tolerance for some languages.
Typesense Support for Chinese, Japanese, and Korean Languages
Pete inquired about logographic support in typesense. Jason advised that the support is evolving based on feedback and suggested indexing data for testing and providing feedback.