Faceting High Cardinality Field for Grouping in Data
TLDR Jack questioned faceting a high cardinality field for grouping, Kishore Nallan confirmed it is fine except for additional memory use.
Powered by Struct AI
6
2mo
Sep 14, 2023 (2 months ago)
Jack
Jack
12:40 AMis it a smart idea to facet a somewhat unique field?
like for child-parent relationship, where 3 or 4 items will share the parent_id, is it okay to group by the parent_id to display only one child item per parent_id?
My assumption was that faceting over a largely varying field is not a good idea, but I just realized I may be wrong on this.
like for child-parent relationship, where 3 or 4 items will share the parent_id, is it okay to group by the parent_id to display only one child item per parent_id?
My assumption was that faceting over a largely varying field is not a good idea, but I just realized I may be wrong on this.
Kishore Nallan
Kishore Nallan
02:25 AMGroup by parent ID is fine but that's different from faceting.
Jack
Jack
05:29 PMI see. In the docs, it says that
> To group on a particular field, it must be a faceted field.
so, youโre saying is that having a parent_id a faceted field is fine, right?
> To group on a particular field, it must be a faceted field.
so, youโre saying is that having a parent_id a faceted field is fine, right?
Sep 15, 2023 (2 months ago)
Kishore Nallan
Kishore Nallan
12:20 AMAre you looking to count the occurrences of field values like brands, colors etc.?
Jack
Jack
12:34 AMyes
Kishore Nallan
Kishore Nallan
01:49 AMFacets on high cardinality field should not be a problem except for extra memory usage.
Typesense
Lightning-fast, open source search engine for everyone | Knowledge Base powered by Struct.AI
Indexed 3015 threads (79% resolved)
Similar Threads
Nested Facets and Collection Limits
Tuhin asks about hierarchical/nested facets and collection limits. Kishore Nallan shares a sample app and suggests having 100-1000 collections with user assignments.
2
6mo
Handling Large Amount of Facet Values
Dima asked about handling large amounts of facet values. Jason assured that it is fine and suggested setting `max_facet_values` to a large number.
13
9mo
Understanding Facet Results in Typesense
Prabhu had difficulty understanding the count results of his facet results in Typesense. Kishore Nallan explained the behaviour and suggested creating a Github issue for a feature request to modify count behaviour.
10
23mo