surfaceclassificationbucket
Classificationdimension · returns varchar

BUCKET

Classify text into user-specified buckets via zero-shot NLI

Per-row classifier — stable across GROUP BY.

classificationnlispecialist-zootext

Arguments

nametypedescription
textVARCHAR
bucketsVARCHARComma-separated list of bucket names
strictBOOLEAN

About

General-purpose closed-label classifier — buckets text into labels the user provides as a comma-separated string. The most flexible dimension operator: you bring the label set, it buckets the rows. Backend: specialist zoo zero-shot NLI via /classify_batch. User- provided labels are passed straight through to the cross-encoder hypothesis templating. When `strict=false` (default), an "Other" bucket is added for rows that don't fit any user label well — the cell adds it by detecting when the top entailment score is low across all provided labels. For LLM-style bucketing with nuanced label semantics, use BUCKET_LLM — see bucket_dimension_llm.cascade.yaml.

Examples

Value bucketed

SELECT
  bucket (75, 'numeric,text,date')

Nearby rabbit holes

same domain
Climb back to The Looking Glass