LLM-backed deduplication (escape hatch for DEDUPE)
Per-group — reads the whole group in one call.
DEDUPE_LLM({{ texts }})DEDUPE_LLM({{ texts }}, '{{ criteria }}')| name | type | description |
|---|---|---|
| texts | JSON | — |
| criteria | VARCHAR | — |
LLM escape hatch dedups abbreviations with expanded forms
WITH
test_data AS (
SELECT
*
FROM
(
VALUES
('IBM'),
('International Business Machines'),
('Apple Inc.'),
('Apple Computer')
) AS t (name)
)
SELECT
DEDUPE_LLM (name)
FROM
test_dataDeduplicates texts by semantic similarity (embeddings + threshold graph)
Merge duplicate records into a composite golden record
Merge records with an explicit conflict-resolution strategy
Check if two values refer to the same entity (fuzzy equality)
LLM-backed deduplication (escape hatch for DEDUPE)
Per-group — reads the whole group in one call.
DEDUPE_LLM({{ texts }})DEDUPE_LLM({{ texts }}, '{{ criteria }}')| name | type | description |
|---|---|---|
| texts | JSON | — |
| criteria | VARCHAR | — |
LLM escape hatch dedups abbreviations with expanded forms
WITH
test_data AS (
SELECT
*
FROM
(
VALUES
('IBM'),
('International Business Machines'),
('Apple Inc.'),
('Apple Computer')
) AS t (name)
)
SELECT
DEDUPE_LLM (name)
FROM
test_dataDeduplicates texts by semantic similarity (embeddings + threshold graph)
Merge duplicate records into a composite golden record
Merge records with an explicit conflict-resolution strategy
Check if two values refer to the same entity (fuzzy equality)