#community-help

Typesense Support for Chinese, Japanese, and Korean Languages

TLDR Pete inquired about logographic support in typesense. Jason advised that the support is evolving based on feedback and suggested indexing data for testing and providing feedback.

Powered by Struct AI

1

7
13mo
Solved
Join the chat
Sep 29, 2022 (13 months ago)
Pete
Photo of md5-76926e1c8a72128d7fee4a61950cfd89
Pete
04:19 PM
Hello! Our company is really interested in using typesense for our search solution. I am a Sr. PM for a large SAAS company that provides service for the health care industry. Anyone able to give me an indication when logographic support will be available?
Jason
Photo of md5-8813087cccc512313602b6d9f9ece19f
Jason
04:20 PM
👋 May I know which specific languages you’re looking for?
Pete
Photo of md5-76926e1c8a72128d7fee4a61950cfd89
Pete
04:20 PM
Chinese, Japanese, and Korean
04:20
Pete
04:20 PM
Kanji specifically for Japanese
Jason
Photo of md5-8813087cccc512313602b6d9f9ece19f
Jason
04:24 PM
We’ve been iterating on these languages slowly over the last year based on community feedback, and we’ve heard that Typesense now works reasonably well for Chinese and Korean (as of v0.24.0.rcn15). So any improvements from here would require more feedback from you, so we can address them as we go along.

For Japanese, we made some improvements last year, but we haven’t heard any feedback from users about how it works…

So long story short, we’ve been making iterative improvements for CJK based on community feedback. I’d recommend indexing your data in Typesense (specifically using the locale setting for each field when creating a collection) and seeing how it works. We can then improve further based on your feedback.
04:24
Jason
04:24 PM
We don’t have native speakers of these languages in our core team, so definitely need this feedback to be able to make progress on this front.
Pete
Photo of md5-76926e1c8a72128d7fee4a61950cfd89
Pete
04:51 PM
Cool this helps thanks Jason

1