Óscar Vicente
11/07/2024, 12:09 PMÓscar Vicente
11/07/2024, 12:10 PMKishore Nallan
11/07/2024, 12:18 PMKishore Nallan
11/07/2024, 12:18 PMÓscar Vicente
11/07/2024, 12:19 PMKishore Nallan
11/07/2024, 12:24 PMk
value that pre-fetches the result. This is because due to the approximate nature of search, as the search radius (k
) expands, more relevant documents could be found which affects overall ranking, which leads to a duplication effect.Kishore Nallan
11/07/2024, 12:25 PMÓscar Vicente
11/07/2024, 12:25 PMKishore Nallan
11/07/2024, 12:26 PMÓscar Vicente
11/07/2024, 12:29 PMk
you will start to see this issues, even if you limit the pagination to found / pageSize
pages, right? So without knowing the aproximate results, you can't really guess a good k
for the search. Can I mitigate it by tweaking ef and M parameters?Óscar Vicente
11/07/2024, 12:39 PMÓscar Vicente
11/07/2024, 12:40 PMKishore Nallan
11/07/2024, 12:40 PMÓscar Vicente
11/07/2024, 12:42 PMef
and M
for index, I can improve the quality of the index avoiding some but not all of the duplicates, right?
As a side note, If the index fit within a gpu memory, could it speed up the operations so we can play with higher k
and mitigate this further? I mean, in the futureKishore Nallan
11/07/2024, 12:46 PM