Standardize a value to its canonical form for a given type
Per-row — runs once for each row.
NORMALIZE({{ value }}, '{{ type }}')NORMALIZE({{ value }}, '{{ type }}', '{{ context }}'){{ value }} NORMALIZE AS '{{ type }}'| name | type | description |
|---|---|---|
| value | VARCHAR | — |
| type | VARCHAR | Entity type: company, address, name, phone, email, country, state |
| context(optional) | VARCHAR | Optional context for disambiguation |
Company name normalized
SELECT
normalize('MICROSOFT CORPORATION', 'company')Email lowercased
SELECT
normalize('john.smith@GMAIL.COM', 'email')State abbreviation expanded or kept
SELECT
normalize('CA', 'state')Remove or mask personally identifiable information from text
Type-cast messy real-world values that trip up standard CAST
Return the canonical/official form of a value (auto-detects entity type)
Extracts 4-digit year from messy text, returns -1 if undetermined
LLM-backed year extraction (escape hatch for CLEAN_YEAR)
Pick the best non-null value from a group (quality-aware COALESCE)
Standardize a value to its canonical form for a given type
Per-row — runs once for each row.
NORMALIZE({{ value }}, '{{ type }}')NORMALIZE({{ value }}, '{{ type }}', '{{ context }}'){{ value }} NORMALIZE AS '{{ type }}'| name | type | description |
|---|---|---|
| value | VARCHAR | — |
| type | VARCHAR | Entity type: company, address, name, phone, email, country, state |
| context(optional) | VARCHAR | Optional context for disambiguation |
Company name normalized
SELECT
normalize('MICROSOFT CORPORATION', 'company')Email lowercased
SELECT
normalize('john.smith@GMAIL.COM', 'email')State abbreviation expanded or kept
SELECT
normalize('CA', 'state')Remove or mask personally identifiable information from text
Type-cast messy real-world values that trip up standard CAST
Return the canonical/official form of a value (auto-detects entity type)
Extracts 4-digit year from messy text, returns -1 if undetermined
LLM-backed year extraction (escape hatch for CLEAN_YEAR)
Pick the best non-null value from a group (quality-aware COALESCE)