CHECKS DATASET FOR UNIQUE COLUMN NAME¶
Extracts all unique values from a specified column in a dataset. Use this worker to deduplicate a column and obtain its distinct value set for downstream filtering, validation, or enumeration tasks.
When to use¶
Classification: process.
Tagged: column, dataset, deduplicate, distinct, transform, unique.
Inputs¶
| Label | ID | Type | Default | Required | Description |
|---|---|---|---|---|---|
| Dataset Name | datasetname | dataset | — | The input dataset to inspect; must contain the column specified in column_name. | |
| Column Name | column_name | scalar | — | Name of the column whose values will be deduplicated; provide as a quoted string matching the exact column header in the dataset. |
Outputs¶
| Label | ID | Type | Description |
|---|---|---|---|
| dataset_unique_output_1 | dataset_unique_output_1 | set | An ordered set (array) of distinct values found in the specified column, with duplicates removed. |
Disciplines¶
- data.dataset.transform
Runnable example¶
A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_unique
Auto-generated from transformation schema. Worker id: dataset_unique. Schema hash: 5a1da8d6a231. Hand-curated docs in workerexamples/ override this page when present.