was wondering whether its good for timeseries data...
# community-help
j
was wondering whether its good for timeseries data?
j
It depends on how you plan to use/query the timeseries data… Could you expand on your use-case?
j
ha
saw your tweet
gm @Jason Bosco
So
j
Oh haha! Hey Josh!
j
I’m founder of Bundlr a blockchain based startup (immutable dataset) context on what we’re trying to achieve: - build an index/query service for bundlr txs (can be in the billions in next year) - each tx has key-value metadata attached to it (like S3) - each tx has a unique timestamp attached to it (i.e. not per block) - building an api which lets you query in time order with filters on metadata + sender address graphql API we’re building
Copy code
query {
 transactions(
  owners: ["0x..."], 
  tags: [{ name: ..., key: ... }],
  order: "ASC"
 ) {
  edges {
   node {
    id
    address
    receipt {
     timestamp
    }
   }
  }
 }
}
so multivariat timeseries
fairly high qps
hopefully can handle 40TB+ (10B documents)
lmk if that makes sense
j
Yup, that query pattern should be fine with Typesense, you’re essentially filtering on numeric values + string values from Typesense’s perspective. Now in terms of dataset size, Typesense is an in-memory datastore (optimized for performance), and it typically takes 2x-3x RAM to index a dataset of size X. So you’d have to weigh the cost-benefit of putting 1B 10B documents in memory vs the performance gain / UX improvement you get for that
j
ah
right
40TB
of RAM
ouch
im not that rich
j
Hahaha!
j
…yet
😄 1
j
Sadly, I think ES might be your best bet for that scale
j
yeah i figured
im surprised nothing else has come out
most java written applications have been beaten by now
lol
j
There’s also Zinc search, but they’re tackling the log-search use-case primarily
Might want to check it out to see how it works for your use case