hello I trying typesense as search solution to my APP I m se typesense #community-help

hello, I trying typesense as search solution to my...

Rolando Guedes

07/05/2021, 9:07 AM

hello, I trying typesense as search solution to my APP. I'm searching by locations address and I did this search: q=maria&query_by=address And I got this results:

Copy code

"highlights": [
        {
          "field": "address",
          "matched_tokens": [
            "Maria"
          ],
          "snippet": "Praça Dona Maria II 1714, Portugal"
        }
      ],

"highlights": [
        {
          "field": "address",
          "matched_tokens": [
            "Carvalho"
          ],
          "snippet": "R. José Carvalho Sá Miranda 44, Portugal"
        }
      ],

The first result is correct, but the second is completely wrong. I have more wrong results like that. Why? What i'm doing wrong?! I dont have any synonyms.

Kishore Nallan

07/05/2021, 9:12 AM

Hi Rolando. Can you please tell me what version of Typesense you are using?

Rolando Guedes

07/05/2021, 9:16 AM

v0.20.0 I'm trying on self-hosted and cloud.typesense.org

Kishore Nallan

07/05/2021, 9:21 AM

I think the issue might be simply because the word

Carvalho

is within typo distance of 2 from

Maria

if you are making a prefix search because

Carva

is compared with

Maria

(replace C with M and V with I). There are two options. You can try setting

num_typos

to 1, or you can try setting

typo_tokens_threshold

parameter to 1.

Kishore Nallan

07/05/2021, 9:21 AM

Having said that, we've also fixed a few issues with typo correction on the pre-release builds. Can you please try it on

typesense/typesense:0.21.0.rc20

Docker image locally once?

Rolando Guedes

07/05/2021, 9:33 AM

with update to 0.21 it was even worse.. with 0.21 +

typo_tokens_threshold

was a little better, at least Carvalho is desapear with the

num_typos

the results are OK.

Kishore Nallan

07/05/2021, 9:34 AM

Can you give me an example where it does not work with 0.21 +

typo_tokens_threshold=1

Rolando Guedes

07/05/2021, 9:37 AM

same query got this result:

Copy code

"highlights": [
        {
          "field": "address",
          "matched_tokens": [
            "Mal."
          ],
          "snippet": "Av. Mal. Humberto Delgado 206, 4760-012 Vila Nova de Famalicão, Portugal"
        }
      ],

Rolando Guedes

07/05/2021, 9:38 AM

with

num_typos

i got the right number of results

Kishore Nallan

07/05/2021, 9:40 AM

Thanks for the example. It is just the way fuzzy prefix matching works on single word queries. First 3 letters of "mal" and "maria" are considered, and they tend to match. If you remove prefix searching option, it won't match even with num_typos:2

Rolando Guedes

07/05/2021, 9:47 AM

More examples 😄 same query + 0.21.0.rc20 + num_typos:2 + prefix=false got this:

Copy code

"highlights": [
        {
          "field": "name",
          "matched_tokens": [
            "Bar"
          ],
          "snippet": "PRÍNCIPE - Restaurante e Snack Bar"
        }
      ],

Kishore Nallan

07/05/2021, 9:48 AM

This is something for me to investigate. I will take a look.

Rolando Guedes

07/05/2021, 9:49 AM

same query + 0.20.0 + num_typos:2 + prefix=false Right results

👍 1

Kishore Nallan

07/06/2021, 2:52 PM

@Rolando Guedes Can you please try against this Docker build:

typesense/typesense:0.21.0.rc21

Open in Slack

Previous Next