DATASET FILL NA¶
Fills missing values in a dataset by replacing NAs in numeric and categorical columns using configurable strategies (average, min, max, mode, or a custom value). Optionally restricts the operation to a subset of columns and supports user-defined null-value mappings per column.
When to use¶
Classification: process.
Tagged: categorical, dataset, fill_na, imputation, missing_values, null_handling, numeric, preprocessing.
Inputs¶
| Label | ID | Type | Default | Required | Description |
|---|---|---|---|---|---|
| Dataset | dataset | dataset | — | Input dataset (tabular) whose missing values will be filled; accepts any d3VIEW dataset object. | |
| Columns To Check | columnstocheck | scalar | — | Comma-separated list of column names to target for NA-filling; leave empty to apply the fill strategy to all columns. | |
| Numeric Fill Type | numeric_fill_type | select | avg | Strategy for imputing missing values in numeric columns: ‘avg’ (default), ‘min’, ‘max’, ‘mode’, or ‘custom’ (uses the custom numeric value input below). | |
| Categorical Fill Type | cat_fill_type | select | most | Strategy for imputing missing values in categorical columns: ‘most’ (most-frequent, default), ‘least’ (least-frequent), or ‘custom’ (uses the custom categorical value input below). | |
| Null Values | what_is_null | textarea | — | Per-column declaration of additional values to treat as null, specified as a comma-separated list in the format ‘COL1=>a b c,COL2=>d e f’; leave blank if only standard NA/NaN should be treated as null. | |
| Numeric Custom Value | numeric_custom_value | textarea | — | Default numeric custom value when type of fill is set to custom | |
| Categorical Custom Value | cat_custom_value | textarea | — | Default numeric custom value when type of fill is set to custom |
Outputs¶
| Label | ID | Type | Description |
|---|---|---|---|
| dataset_fill_na_output_1 | dataset_fill_na_output_1 | dataset | Output dataset identical in schema to the input but with all targeted missing values replaced according to the chosen numeric and categorical fill strategies. |
Disciplines¶
- ai_ml.preprocessing
- data.dataset.transform
Runnable example¶
A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_fill_na
Auto-generated from transformation schema. Worker id: dataset_fill_na. Schema hash: b5dcacb93f4e. Hand-curated docs in workerexamples/ override this page when present.