Hi Team, could it be that a "special character" is...
# community-help
s
Hi Team, could it be that a "special character" is messing up indexing or querying? Although documentation mention that "Typesense will remove special characters", could be that this special character got through, and querying was returning empty values.
We indexed
Apple TV 128GB 4K
as a string which contained some weird split between Apple and TV. (Not visible in Slack). And later querying by Apple TV the document was not being returned.
k
Interesting, is it possible for you to post a code snippet that I can run to index that data?
s
message has been deleted
Pasting the code here seems to clean it 😕
Maybe it's a edge case
k
Ok we might have to handle this explicitly, can you please create an issue on GH?
s
Do you have where do you sanitize the input? Maybe I can take a look
k
I Just checked, we explicitly split on space and newline. Because non visible white space is not differentiated from regular unicode which we can't split on because that's valid.