Deduplicates texts by semantic similarity (embeddings + threshold graph)
Per-group — reads the whole group in one call.
DEDUPE({{ texts }})DEDUPE({{ texts }}, '{{ criteria }}')| name | type | description |
|---|---|---|
| texts | JSON | — |
| criteria | VARCHAR | — |
| threshold(optional) | DOUBLE | — |
Deduplicates similar names
WITH
test_data AS (
SELECT
*
FROM
(
VALUES
('John Smith'),
('Jon Smith'),
('Johnny Smith'),
('Jane Doe'),
('J. Doe'),
('Michael Johnson')
) AS t (name)
)
SELECT
DEDUPE (name)
FROM
test_dataLLM-backed deduplication (escape hatch for DEDUPE)
Merge duplicate records into a composite golden record
Merge records with an explicit conflict-resolution strategy
Check if two values refer to the same entity (fuzzy equality)
Deduplicates texts by semantic similarity (embeddings + threshold graph)
Per-group — reads the whole group in one call.
DEDUPE({{ texts }})DEDUPE({{ texts }}, '{{ criteria }}')| name | type | description |
|---|---|---|
| texts | JSON | — |
| criteria | VARCHAR | — |
| threshold(optional) | DOUBLE | — |
Deduplicates similar names
WITH
test_data AS (
SELECT
*
FROM
(
VALUES
('John Smith'),
('Jon Smith'),
('Johnny Smith'),
('Jane Doe'),
('J. Doe'),
('Michael Johnson')
) AS t (name)
)
SELECT
DEDUPE (name)
FROM
test_dataLLM-backed deduplication (escape hatch for DEDUPE)
Merge duplicate records into a composite golden record
Merge records with an explicit conflict-resolution strategy
Check if two values refer to the same entity (fuzzy equality)