was wondering whether its good for timeseries data typesense #community-help

Join Slack

was wondering whether its good for timeseries data...

# community-help

Josh Benaron

02/05/2023, 8:06 PM

was wondering whether its good for timeseries data?

Jason Bosco

02/05/2023, 8:07 PM

It depends on how you plan to use/query the timeseries data… Could you expand on your use-case?

Josh Benaron

02/05/2023, 8:21 PM

Josh Benaron

02/05/2023, 8:21 PM

saw your tweet

Josh Benaron

02/05/2023, 8:21 PM

gm @Jason Bosco

Josh Benaron

02/05/2023, 8:22 PM

Jason Bosco

02/05/2023, 8:22 PM

Oh haha! Hey Josh!

Josh Benaron

02/05/2023, 8:23 PM

I’m founder of Bundlr a blockchain based startup (immutable dataset) context on what we’re trying to achieve: - build an index/query service for bundlr txs (can be in the billions in next year) - each tx has key-value metadata attached to it (like S3) - each tx has a unique timestamp attached to it (i.e. not per block) - building an api which lets you query in time order with filters on metadata + sender address graphql API we’re building

Copy code

query {
 transactions(
  owners: ["0x..."], 
  tags: [{ name: ..., key: ... }],
  order: "ASC"
 ) {
  edges {
   node {
    id
    address
    receipt {
     timestamp
    }
   }
  }
 }
}

so multivariat timeseries

Josh Benaron

02/05/2023, 8:23 PM

fairly high qps

Josh Benaron

02/05/2023, 8:23 PM

hopefully can handle 40TB+ (10B documents)

Josh Benaron

02/05/2023, 8:24 PM

lmk if that makes sense

Jason Bosco

02/05/2023, 8:27 PM

Yup, that query pattern should be fine with Typesense, you’re essentially filtering on numeric values + string values from Typesense’s perspective. Now in terms of dataset size, Typesense is an in-memory datastore (optimized for performance), and it typically takes 2x-3x RAM to index a dataset of size X. So you’d have to weigh the cost-benefit of putting 1B 10B documents in memory vs the performance gain / UX improvement you get for that

Josh Benaron

02/05/2023, 8:27 PM

Josh Benaron

02/05/2023, 8:27 PM

right

Josh Benaron

02/05/2023, 8:27 PM

40TB

Josh Benaron

02/05/2023, 8:27 PM

of RAM

Josh Benaron

02/05/2023, 8:27 PM

ouch

Josh Benaron

02/05/2023, 8:27 PM

im not that rich

Jason Bosco

02/05/2023, 8:27 PM

Hahaha!

Josh Benaron

02/05/2023, 8:28 PM

…yet

😄 1

Jason Bosco

02/05/2023, 8:28 PM

Sadly, I think ES might be your best bet for that scale

Josh Benaron

02/05/2023, 8:28 PM

yeah i figured

Josh Benaron

02/05/2023, 8:28 PM

im surprised nothing else has come out

Josh Benaron

02/05/2023, 8:28 PM

most java written applications have been beaten by now

Josh Benaron

02/05/2023, 8:28 PM

lol

Jason Bosco

02/05/2023, 8:28 PM

There’s also Zinc search, but they’re tackling the log-search use-case primarily

Jason Bosco

02/05/2023, 8:29 PM

Might want to check it out to see how it works for your use case

2 Views

Open in Slack

Previous Next