Memory Leak Issue with Typesense in Cyrillic
TLDR Alexander found a memory leak when using Typesense's docker image that was rectified by Kishore Nallan with a new build (0.23.0.rc53
), which also supports Cyrillic search.
Mar 31, 2022 (18 months ago)
Alexander
12:59 PMI’m using typesense/typesense:0.23.0.rc8 docker image (rc version for locale aware indexing for queries in Cyrillic) but I found out lately that typesense container apparently has memory leak and it takes ~24h to reach 8gb (server limit) regular RAM usage with total 30mb index and ~20000 documents in multiple indexes.
I tried to upgrade to latest rc version (rc49) but with this version cyrillic not supported at all (neither query nor faceting).
Am I missing some configuration introduced in latest rc’s?
Kishore Nallan
01:01 PMAlexander
04:41 PM{
"searches": [
{
"per_page": 24,
"query_by": "Код,Артикул,Наименование,Бренд",
"sort_by": "",
"highlight_full_fields": "Код,Артикул,Наименование,Бренд",
"collection": "all",
"q": "*",
"facet_by": "Категория,ВНаличии,ДилерскаяЦенаРуб,Бренд,Страна",
"filter_by": "",
"max_facet_values": 300,
"page": 1
}]
}
Response:
{
"results": [
{
"facet_counts": [],
"found": 0,
"hits": [],
"out_of": 11653,
"page": 1,
"request_params":
{
"collection_name": "all",
"per_page": 5,
"q": "стол"
},
"search_cutoff": false,
"search_time_ms": 1
}]
}
Indexed document:
{
"id": "5692",
"Артикул": "282701",
"Бренд": brand",
"Категория": "Столы холодильные",
"КатегорияЛатиницей": "stoly-kholodilnye",
"Код": "5692",
"Модель": "GN 11/TN",
"Наименование": "GN 11/TN стол охл. (-2+10), 2 двери, 1390х700х850мм, HICOLD RUS",
"НаименованиеПолное": "GN 11/TN стол охл. (-2+10), 2 двери, 1390х700х850мм, HICOLD RUS",
"Популярность": 1000000051,
"Статус": "Основной ассортимент",
"Страна": "Россия"
}
Alexander
04:44 PMrc49 - empty search results for cyrillic query (quering against fields indexed with ‘locale: “ru”’)
Apr 04, 2022 (17 months ago)
Kishore Nallan
03:00 PMApr 14, 2022 (17 months ago)
Kishore Nallan
07:08 AMI've also published a new build
0.23.0.rc53
that contains this fix, and I also tested that ru
search works fine on this build, see this gist: https://gist.github.com/kishorenc/b207c23ff59198dbaecca7007fdea0dbTypesense
Indexed 2764 threads (79% resolved)
Similar Threads
Resolving Multilingual Search Function in Typesense Software
Bill is having difficulty with multilingual search functionality in Typesense software. Developer Kishore Nallan suggested setting a language locale and provided a demo build. The build solution had some issues, and after multiple rounds of software updates and troubleshooting, the problem still persists.


Troubleshooting Issues with DocSearch Hits and Scraper Configuration
Rubai encountered issues with search result priorities and ellipsis. Jason helped debug the issue and suggested using different versions of typesense-docsearch.js, updating initialization parameters, and running the scraper on a Linux-based environment. The issues related to hits structure and scraper configuration were resolved.



Large JSONL Documents Import Issue & Resolution
Suraj was having trouble loading large JSONL documents into Typesense server. After several discussions and attempts, it was discovered that the issue was due to data quality. Once the team extracted the data again, the upload process worked smoothly.
