About the conversation API again we are having some difficul typesense #community-help

About the conversation API again, we are having so...

Nicolas Chaulet

10/24/2024, 9:07 PM

About the conversation API again, we are having some difficulty getting it to work for our use case, like it kind of works but it is not great, I was wondering if you had more documentation about how it actually works? Like how do you manage the interaction between typesense's collection and the LLM? What is the flow of data when the user makes a request?

Nicolas Chaulet

10/24/2024, 9:08 PM

Before we build our own version I want to make sure we are not missing some basic tuning we could do...

Nicolas Chaulet

10/24/2024, 9:08 PM

Maybe most of what we need can be achieved with a better system prompt but it is hard to design it without knowing how the data flows

Jason Bosco

10/24/2024, 9:10 PM

We take the question, along with all the search parameters you specified (and you want to make sure you're using

query_by: embedding

to do a semantic search so retrieval works properly), fetch the top results for that search query, and then send it to the LLM along with the system prompt

Jason Bosco

10/24/2024, 9:10 PM

For follow-up questions we do this:

Jason Bosco

10/24/2024, 9:11 PM

One key thing to take into account is the

max_bytes

setting in the conversation model resource. If that context window is too low, then you might get odd results

Jason Bosco

10/24/2024, 9:12 PM

Separately, here's another thing to consider: https://threads.typesense.org/2J4577a

Nicolas Chaulet

10/24/2024, 9:14 PM

Yes thanks that's super helpful. I had seen that other thread but wanted to make sure I understood the limitations before going that way.

Nicolas Chaulet

10/24/2024, 9:15 PM

Do you know of examples out there that use your flow as is?

Jason Bosco

10/24/2024, 9:16 PM

This is a demo we just wrapped up last week: https://conversational-search-pg-essays.typesense.org

🚀 1

👍 1

Said

10/25/2024, 7:32 AM

@Jason Bosco Is it also possible to add azure-openai. Furthermore, before sending the searchresult to the LLM, is it possible to somehow send it to a reranker (open-source or e.g. cohere) Another major thing is the search itself. Since you mentioned the use query_by "embedding", is it generally possible to also add additional factors, other than just vector search

Nicolas Chaulet

10/25/2024, 10:46 AM

Super helpful thanks for the demo, it helps anchor the core use case. I think I gravitate more towards the other approach you highlighted where a user actually wants a search but does not necessarily know how to set the correct filters. Or when a follow up search depends on previous results

Fanis Tharropoulos

10/25/2024, 10:53 AM

We actually have a demo that may match your usecase. Instead of human readable responses, the LLM instead formats the correct query to Typesense. Here's a link to the source code and here's the live versions

🙌 1

Nicolas Chaulet

10/25/2024, 10:56 AM

Really cool thanks!

Nicolas Chaulet

10/25/2024, 10:58 AM

Did you try to add a conversation mode on top that would help the user refine results or ask for associated results? For example once you have a first set of results, ask for cars in a similar price range but different brand?

Fanis Tharropoulos

10/25/2024, 11:14 AM

This demo is limited to just generating queries, and does not keep track of the current convo context, so all subsequent queries won't keep the previous ones in mind

Nicolas Chaulet

10/25/2024, 11:48 AM

Ok, I'll see what we can do with the previous responses context

🙌 1

Open in Slack

Previous Next