Korean Language Classification and Support in Typesense
TLDR Pete asked if Typesense classified Korean as logographic. Jason clarified that the software works with all languages that use word spaces, and that special support for Korean was added recently. Advised Pete to test the system with a Korean dataset.
1
1
Nov 09, 2022 (13 months ago)
Pete
09:09 PMJason
09:14 PM1
Jason
09:15 PMlocale
for each field, to use the improved Korean tokenizerPete
09:17 PMThere are definitely spaces between words. I can see how it could seem different to English and romantic languages though.
Jason
09:19 PM1
Typesense
Indexed 3015 threads (79% resolved)
Similar Threads
Typesense Support for Chinese, Japanese, and Korean Languages
Pete inquired about logographic support in typesense. Jason advised that the support is evolving based on feedback and suggested indexing data for testing and providing feedback.
Request for Adding Japanese Language Support in Typesense
Steffen inquired about anticipated support for logographic languages in Typesense. Jason provided updates on Thai, Vietnamese, and Korean support and asked about specific languages. Steffen expressed user interest in Japanese.
Troubleshooting Typo Tolerance Issue with Typesense for Korean
Minyong informed Kishore Nallan about a typo tolerance issue in Typesense with Korean text. Kishore Nallan suggested adjusting the byte difference limit for Korean, but warned this could slow down the search function. Minyong approved testing the solution.