#community-help

Duplication Issue in Document Return

TLDR Carl asked why the same document was returned twice, showing missing values in some properties. Kishore Nallan suggested checking the indexing method and use of curation overrides. Carl opted to provide a query for in-depth troubleshooting.

Powered by Struct AI
Sep 07, 2021 (29 months ago)
Carl
Photo of md5-b4b5ae6f21ade3ef3d7824c4b63090c3
Carl
07:41 AM
Hey 🙂 What would cause results to bring back the same document twice? The returned documents have the same id . One document looks compete but the other is missing values in some of the properties. Any suggestions on where I can begin to investigate the cause?
Kishore Nallan
Photo of md5-4e872368b2b2668460205b409e95c2ea
Kishore Nallan
07:42 AM
:thinking_face: Are you sending the "id" field as part of the document while indexing or is it auto generated by Typesense?
Carl
Photo of md5-b4b5ae6f21ade3ef3d7824c4b63090c3
Carl
07:42 AM
We send the ID as part of the document
Kishore Nallan
Photo of md5-4e872368b2b2668460205b409e95c2ea
Kishore Nallan
07:43 AM
Do you have any overrides defined?
07:43
Kishore Nallan
07:43 AM
Even with that, it should not be repeating. We will need access to the dataset either via a reproduceable sample set or to your data, if that is public data that can be accessed.
Carl
Photo of md5-b4b5ae6f21ade3ef3d7824c4b63090c3
Carl
07:44 AM
I'm unsure what you mean by that. We're only setting up a collection with a bunch of facet fields
07:44
Carl
07:44 AM
I'll PM you with a query you can perform
Kishore Nallan
Photo of md5-4e872368b2b2668460205b409e95c2ea
Kishore Nallan
07:44 AM
Carl
Photo of md5-b4b5ae6f21ade3ef3d7824c4b63090c3
Carl
07:46 AM
Ah okay, no. We're not that fancy yet 😄

Typesense

Lightning-fast, open source search engine for everyone | Knowledge Base powered by Struct.AI

Indexed 3015 threads (79% resolved)

Join Our Community

Similar Threads

Understanding Document ID Fields and Rectifying Duplicate Document Error

John queried about understanding document ID fields and fixed a duplication issue with guidance from Jason. They discovered a bug preventing document deletion due to a URL encodable character, with John opening a GitHub issue for it.

2

49
21mo

Threading Problem During Multiple Collection Creation and Batch Insertion in Typesense

Johan has a problem with creating multiple collections and batch-inserting documents into Typesense, which is returning results from different collections. Kishore Nallan helps troubleshoot the issue and suggests a potential local race condition, which is fixed in a later build.

35
17mo

Troubleshooting Typesense Document Import Error

Christopher had trouble importing 2.1M documents into Typesense due to memory errors. Jason clarified the system requirements, explaining the correlation between RAM and dataset size, and ways to tackle the issue. They both also discussed database-like query options.

3

30
11mo

Typesense Filter Bug Involving Similar Facets

SamHendley reported a bug in Typesense where filtering by facet returns wrong documents, providing a reproduction case. Jason and Kishore Nallan recognized the issue, tracked it on GitHub, and implemented a fix in a new Docker build.

1

17
13mo

Typesense Server Bulk Import/Upsert Issue Resolved

Adam was confused about the discrepancy between the successful responses and the actual indexed data while working with a custom WP plugin integrating with Typesense. The issue was a bug related to fetching documents in the wrong order, not a Typesense problem.

2

22
7mo