Dian Alhusari
05/15/2025, 8:05 AMembed
feature in Typesense 0.25 for semantic search.
Here’s a part of my schema:
{
"name" : "embedding_multi",
"type" : "float[]",
"embed": {
"from": [
"description",
"item_details_en"
],
"model_config": {
"model_name": "ts/multilingual-e5-base"
}
}
},
In this setup:
• description
is a regular string (e.g., product description)
• item_details_en
is an array of objects like:
[
{ "name": "Material", "title": "Stainless Steel" },
{ "name": "Color", "title": "Black" }
]
I would like to ask:
• How does Typesense internally convert these values into a single string before sending them to the embedding model?
• Will the values in the object array be concatenated, and in what format (e.g., "Material: Stainless Steel\nColor: Black"
or just raw values)?
• Do I need to pre-process item_details_en
into a string manually, or is it handled internally?