GET STATS FOR NUMERIC COLUMNS¶
Computes descriptive statistics (count, mean, std, min, max, quartiles, etc.) for the selected numeric columns of a dataset. Use this worker whenever you need a quick numerical summary of one or more quantitative fields before further analysis or modelling.
When to use¶
Classification: process.
Tagged: column_stats, descriptive_statistics, eda, numeric, stats, summary.
Inputs¶
| Label | ID | Type | Default | Required | Description |
|---|---|---|---|---|---|
| Dataset | dataset_1 | dataset | — | Input dataset containing the columns to be analysed; accepts any tabular dataset available in the workflow context — leave unconnected only if the dataset is injected dynamically at runtime. | |
| Choose Numeric Columns | numeric_columns | scalar | — | One or more numeric column names from dataset_1 for which statistics will be computed; multi-select list is populated automatically from the connected dataset — leave blank to compute stats for all numeric columns. |
Outputs¶
| Label | ID | Type | Description |
|---|---|---|---|
| dataset_stats_1 | dataset_stats_1 | dataset | Tabular dataset where each row corresponds to a selected numeric column and each column represents a descriptive statistic (e.g., count, mean, std, min, 25th percentile, median, 75th percentile, max). |
Disciplines¶
- data.statistics
Runnable example¶
A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_get_numeric_column_stats
Auto-generated from transformation schema. Worker id: dataset_get_numeric_column_stats. Schema hash: 07b98e144113. Hand-curated docs in workerexamples/ override this page when present.