Using Multilingual-e5-Base Model in Huggingface
TLDR Bill asked about using the Multilingual-e5-Base model and the required structure. Kishore Nallan advised that prefixes are handled automatically in Typesense. The user had error issues but resolved them, and Kishore Nallan mentioned a recent fix.
2
Sep 06, 2023 (3 months ago)
Bill
01:15 PMKishore Nallan
01:17 PMBill
01:18 PMBill
01:19 PM"model_name": "ts/multilingual-e5-base",
"indexing_prefix": "query:",
"query_prefix": "query:"
}
Bill
01:20 PMKishore Nallan
01:26 PMBill
01:27 PMKishore Nallan
01:34 PMBill
01:35 PM{
"name": "embedding",
"type": "float[]",
"embed": {
"from": ["product_name"],
"model_config": {
"model_name": "ts/multilingual-e5-base",
"indexing_prefix": "passage:",
"query_prefix": "query:"
}
}
}
Am i right?
Kishore Nallan
01:37 PMBill
01:38 PMKishore Nallan
04:17 PM1
Sep 07, 2023 (3 months ago)
Bill
01:32 PM{
"name": "products",
"fields": [
{
"name": "brand",
"type": "string"
},
{
"name": "embedding",
"type": "float[]",
"embed": {
"from": [
"brand"
],
"model_config": {
"model_name": "ts/multilingual-e5-base"
}
}
}
]
}
Bill
01:32 PM"message": "Bad request."
}
Bill
01:32 PMBill
01:33 PMJason
03:13 PMSep 08, 2023 (3 months ago)
Bill
06:18 AMKishore Nallan
06:19 AM1
Typesense
Indexed 3015 threads (79% resolved)
Similar Threads
Utilizing Vector Search and Word Embeddings for Comprehensive Search in Typesense
Bill sought clarification on using vector search with multiple word embeddings in Typesense and using them instead of OpenAI's embedding. Kishore Nallan and Jason informed him that their development version 0.25 supports open source embedding models. They also resolved Bill's concerns regarding search performance, language support, and limitations in the search parameters.
Issue with Creating Embeddings using Specific Model Spec on Typesense Cloud
Joel shared issues while creating embeddings with `ts/multilingual-e5-large` model on Typesense Cloud v0.25.1. Jason suggested an upgrade, resolving the issue.
Issue with Embedding Error in Version 0.25.0.rc63
Bill reported a bug in version 0.25.0.rc63 regarding a problem with updating or emplacing a document and receiving an embedding error. This was resolved in version 0.25.0.rc65, but further discussion ensued regarding the function of 'index' in the update feature.
Resolving Multilingual Search Function in Typesense Software
Bill is having difficulty with multilingual search functionality in Typesense software. Developer Kishore Nallan suggested setting a language locale and provided a demo build. The build solution had some issues, and after multiple rounds of software updates and troubleshooting, the problem still persists.
Discussions on Typesense, Collections, and Dynamic Fields
Tugay shares plans to use Typesense for their SaaS platform and asks about collection sizes and sharding. Jason clarifies Typesense's capabilities and shares a beta feature. They discuss using unique collections per customer and new improvements. Kishore Nallan and Gabe comment on threading and data protection respectively.