Pavel Koroteev
01/17/2025, 3:53 PMPavel Koroteev
01/17/2025, 3:55 PMuniversal search
. It’s the name of my docs search, so I don’t want to stem it actually. It was stemmed to uni
, and found words like unit
, united
, etc.
• rhy
. We have internal service name rhythm
, so before stemming it was found by prefix search. Now it was stemmed to rhi
, therefore it found some another results (rhino
), but not rhythm
.
• ingres grpc
. Without stemming typesense fixes a typo to ingress
and found many related results by correct name of k8s concept and grpc framework. With stemming it gets ingr
stem, and it is too far from ingress
by typos, the results are worse.
• pulse
. We have exactly the same name for a service, so before stemming the service was on the top position by exact match
mechanism. Now it was stemmed to puls
and with typo found non-related results because of word pull
in one typo distance.
• add new limit
. It’s something straightforward_._ So, before it has good results, because we have doc about the topic. Now it found words like: adding, added, or so and results a bit worse.
• adaptive layouts
. Here we have great improvements, because of layouts
-> layout
, and adaptive
-> adapt
.
• deployment times
. Here we have improvement, because of times
-> time
.
• coffee
. It’s our internal term, so before it was found by exact match
or so, now it stemmed to coffe
and has poor results.
• sops
. We use the tool, so before it was found by exact match. Now it stemmed to sop
and has poor results.
• apps responsible
. I guess the intent of search is straight: user wants to find responsible
for every app
. So, the only stemming which is needed is apps
-> app
maybe, but it also has responsible
-> respons
, so overall the results worse.
• localization
. There is a pretty straightforward search term, but it’s stemmed to local
, so the results is much worse.
• analy
. It’s not completed word of analytics
, before stemming it was handled by prefix search, but now it’s stemmed to anali
, and it has poor results.
• multisession
. Better, because multisession
-> multisessions
.
• cookie
. Better, because, cookie
-> cookies
.Pavel Koroteev
01/17/2025, 3:57 PMKishore Nallan
01/17/2025, 4:26 PMPavel Koroteev
01/17/2025, 4:29 PMKishore Nallan
01/17/2025, 4:47 PM28.0.rc35
Kishore Nallan
01/17/2025, 4:48 PM28.0.rc32
as well.Dima
01/20/2025, 10:43 AMuniversal
stems to uni
instead of univers
🤔 Snowball demo shows different resultsKishore Nallan
01/20/2025, 10:46 AMKishore Nallan
01/20/2025, 10:47 AM