We imported ~1.7M book records into typesense for ...
# community-help
m
We imported ~1.7M book records into typesense for our project... unfortunately, the data was from OpenLibrary and book data is notoriously messy and inconsistent. Any recommendations on a clean data source for books? Primarily looking for children's books through YA and novels. Thanks in advance!
j
I know this is not what you were asking for, but in case you haven't seen this before, we have a demo that uses OpenLibrary data: https://books-search.typesense.org You'll see the source code linked under the title, and the scripts/indexer directory has code that shows how we parsed this dataset before pushing it into Typesense
1
m
that's awesome! how frequently do you update the data from OL? or is it live?
j
That's a static snapshot from 2021 IIRC
1