DATASET KEEP COLUMNS¶
Filters a dataset down to a specified subset of columns, discarding all others. Provide the input dataset and a comma-separated list (or regex-style selection) of column names to retain; the resulting dataset contains only those columns in their original order.
When to use¶
Classification: process.
Tagged: column_filter, dataset, keep_columns, projection, subset.
Inputs¶
| Label | ID | Type | Default | Required | Description |
|---|---|---|---|---|---|
| Dataset | dataset_1 | dataset | — | The source dataset whose columns will be filtered; accepts any tabular dataset object available in the workflow. | |
| CSV Column Names | column_names_regex | scalar | — | Comma-separated list of column names (or regex patterns) to retain from the dataset; columns not matched are dropped. Leave blank to pass all columns through unchanged. |
Outputs¶
| Label | ID | Type | Description |
|---|---|---|---|
| dataset_keep_columns_output_1 | dataset_keep_columns_output_1 | dataset | Filtered dataset containing only the columns specified in CSV Column Names, preserving the original row order and data types. |
Disciplines¶
- data.dataset.transform
Runnable example¶
A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_keep_columns
Auto-generated from transformation schema. Worker id: dataset_keep_columns. Schema hash: 92c80e594937. Hand-curated docs in workerexamples/ override this page when present.