Hi < Kishore Nallan> i just checked your answer under <https typesense #community-help

Hi <@U01PL2YSG8L>, i just checked your answer unde...

Anton Khatunzev

09/30/2021, 3:58 PM

Hi @Kishore Nallan, i just checked your answer under https://github.com/typesense/typesense/issues/329 is there documentation or any examples for this feature available?

Kishore Nallan

09/30/2021, 3:59 PM

Sorry, let me post a comment to that issue.

Anton Khatunzev

09/30/2021, 4:01 PM

And one more question: i do need to install v0.22.0 server to use this feature, correct? If it is so, where or when can i get it?

Kishore Nallan

09/30/2021, 4:02 PM

Yes, Linux binary is fine?

Kishore Nallan

09/30/2021, 4:02 PM

I've updated my comment on that issue with usage instructions.

Anton Khatunzev

09/30/2021, 4:05 PM

yes, i need the linux binary

Kishore Nallan

09/30/2021, 4:08 PM

Download here:

Copy code

<https://dl.typesense.org/releases/0.22.0.rcs10/typesense-server-0.22.0.rcs10-linux-amd64.tar.gz>

Anton Khatunzev

09/30/2021, 4:10 PM

thank you!

👍 1

Anton Khatunzev

10/04/2021, 12:43 PM

hello Kishore, i have couple updates: 1. Geosearch seems to be working fine and fast. I had no time to debug it properly though... 2. With v.0.22.0 server i got that bad issue when indexing documents(using upsert). We have a lot of documents(about 75k) in our db that needs to be imported into TS. Documents themselves are pretty big. In my current scenario they contain a lot of text and some big non-indexed json fields. We import these documents in batches(5k records in a batch). At the end of the import of the each batch i analyze the response returned by TS client. And it appeared that when i sent 5k records payload, i got back response containning ~4.8k "ImportDocumentResponse" objects. No errors. So my code throws an exception at this stage. I decided to check if this happens on a smaller batches. And it did. I got the same problem on 2k records batch: 2k documents sent, 1996 ImportDocumentResponse items back. So i also decided to check if documents in the tail of the payload are really missing in the TS collection. And it seems that they are there(they do exist on TS side)! Very weird...Also, i need to mention that some time ago i had a problem with similar symptoms. I was able to solve it by setting ConnectionTimeout setting to 0 for my TS client. But now i'm a bit lost... i will try to find more precise testcase... But for now just letting you know that v0.22.0 behaves differently during the import...

Kishore Nallan

10/04/2021, 12:46 PM

@Anton Khatunzev We've made the import atomic in 0.22.0 -- earlier it is possible that if your connection times out mid-way, imports could end mid way too. However, with 0.22.0 the entire response body is first buffered and then only indexing begins, so it is atomic.

Kishore Nallan

10/04/2021, 12:47 PM

Can you please try using curl and see if you find a similar mismatch in response?

Anton Khatunzev

10/04/2021, 12:48 PM

@Kishore Nallan Hi, yes, i think i will have to... As i said, the payloads are quite big, so i'll try to find more precise testcase first

Anton Khatunzev

10/04/2021, 2:13 PM

hello @Kishore Nallan

Anton Khatunzev

10/04/2021, 2:14 PM

i got the testcase

Anton Khatunzev

10/04/2021, 2:14 PM

tried it with 3k batch using curl. And i got 2997 items in the response

Anton Khatunzev

10/04/2021, 2:14 PM

it happens from time to time

Anton Khatunzev

10/04/2021, 2:15 PM

would it be possible to do the screenshare?

Kishore Nallan

10/04/2021, 2:18 PM

Oh cool, is it possible to share that dataset with me?

Anton Khatunzev

10/04/2021, 2:19 PM

yes

Anton Khatunzev

10/04/2021, 2:19 PM

but it is 100M jsonl. How would you like to get it?

Anton Khatunzev

10/04/2021, 2:20 PM

can we do a screenshare maybe? so you can see the problem?

Kishore Nallan

10/04/2021, 2:20 PM

Do you have a public s3 bucket I can download from? You can encrypt with my gpg ID.

Kishore Nallan

10/04/2021, 2:20 PM

I can do a screenshare in 30 mins.

Anton Khatunzev

10/04/2021, 2:22 PM

ok, we dont use our s3 for such things, so i'll prepare gdrive link if you dont mind

Kishore Nallan

10/04/2021, 2:23 PM

Yes that works!

Anton Khatunzev

10/04/2021, 2:31 PM

https://drive.google.com/file/d/1ZfJ5YRKw1Z0WaLeQVE4CnkHijxP8Gcb7/view?usp=sharing

Kishore Nallan

10/04/2021, 2:36 PM

Downloading

Kishore Nallan

10/04/2021, 2:40 PM

Done, I will try reproducing locally.

Anton Khatunzev

10/04/2021, 2:40 PM

you may need to trigger it several times. Sometimes it returns 3k items as it should, but sometimes less than 3k which is not correct. Numbers are always different btw

Kishore Nallan

10/04/2021, 2:40 PM

@Anton Khatunzev Do you use HTTPS?

Anton Khatunzev

10/04/2021, 2:40 PM

at the moment no

Kishore Nallan

10/04/2021, 2:41 PM

Okay I will try with http then.

Kishore Nallan

10/04/2021, 2:49 PM

Unable to reproduce. Tried 10 times 🤔 Are you running this against a server or on local dev?

Anton Khatunzev

10/04/2021, 2:53 PM

against the server

Anton Khatunzev

10/04/2021, 2:53 PM

i can show you i think

Kishore Nallan

10/04/2021, 2:53 PM

Okay let's do that.

Anton Khatunzev

10/04/2021, 2:54 PM

https://us04web.zoom.us/j/76093292054?pwd=K1ZiNUdDM2pZZ0QyeEJPbnhiZ2dwQT09

Kishore Nallan

10/05/2021, 12:28 PM

@Anton Khatunzev I tried uploading multiple times to a server in Frankfurt, which is pretty far from me, but I could not reproduce the line count mismatch. Can I give you a debug build with the

id

also returned in the response? So we can confirm whether only the last few document success status are dropped or documents in the middle of the response are affected.

Anton Khatunzev

10/05/2021, 12:29 PM

hi, yes, sure. But i dont know when i will be able to test this. Bit busy with another stuff at the moment...

Kishore Nallan

10/05/2021, 12:31 PM

Okay, no worries, please ping when you are free sometime this week. In the mean time, I will try launching another server even further away and see if that helps to reproduce.

Kishore Nallan

10/05/2021, 12:49 PM

Also, are you sure that the typesense process did not crash? The truncated http response can also happen if the typesense server had crashed and was subsequently restarted by systemd. Checking the process age will confirm if process had crashed.

Anton Khatunzev

10/05/2021, 2:00 PM

ok, next time i'll take a look at the process age. Yesterday it looked like it did not crash. But i'm not 100% sure

Kishore Nallan

10/05/2021, 2:01 PM

👍

Kishore Nallan

10/07/2021, 10:18 AM

@Anton Khatunzev I've been able to reproduce this issue. Working on a fix. Will keep you posted.

Anton Khatunzev

10/07/2021, 10:19 AM

Great! Thank you

Kishore Nallan

10/09/2021, 4:02 PM

@Anton Khatunzev Please try with this binary: http://dl.typesense.org/releases/0.22.0.rcs14/typesense-server-0.22.0.rcs14-linux-amd64.tar.gz

Kishore Nallan

10/12/2021, 10:42 AM

@Anton Khatunzev Did you get a chance to try with

rcs14

Anton Khatunzev

10/14/2021, 11:19 AM

hi @Kishore Nallan, sorry i forgot to give you feedback. Yes, i installed the rcs14. And it seems to work fine with 500-items batches. No errors. Previously i had problems even with smaller ones. At the moment i cant check it with bigger batches. But probably will be able to do that soon

Anton Khatunzev

10/14/2021, 11:19 AM

But i think the problem gone away

Kishore Nallan

10/14/2021, 11:45 AM

Thanks for confirming!

Anton Khatunzev

10/21/2021, 2:36 PM

Hello @Kishore Nallan i just got another weird problem with TS. I have a collection having field named "displaydata_title"(of type string) and when i try to include this field into sort_by statement i get this error: "message": "Could not find a field named

displaydata_title

in the schema for sorting."

Kishore Nallan

10/21/2021, 2:37 PM

We don't support sort by string.

Anton Khatunzev

10/21/2021, 2:37 PM

oh...

Kishore Nallan

10/21/2021, 2:37 PM

It's on our roadmap to support.

Anton Khatunzev

10/21/2021, 2:37 PM

ok, thank you

Kishore Nallan

10/21/2021, 2:37 PM

👍

Kishore Nallan

10/25/2021, 8:50 AM

Can you please give me a sample dataset to reproduce?

Open in Slack

Previous Next