GET STATS FOR NUMERIC COLUMNS

Computes descriptive statistics (count, mean, std, min, max, quartiles, etc.) for the selected numeric columns of a dataset. Use this worker whenever you need a quick numerical summary of one or more quantitative fields before further analysis or modelling.

When to use

Classification: process.

Tagged: column_stats, descriptive_statistics, eda, numeric, stats, summary.

Inputs

Label ID Type Default Required Description
Dataset dataset_1 dataset   Input dataset containing the columns to be analysed; accepts any tabular dataset available in the workflow context — leave unconnected only if the dataset is injected dynamically at runtime.
Choose Numeric Columns numeric_columns scalar   One or more numeric column names from dataset_1 for which statistics will be computed; multi-select list is populated automatically from the connected dataset — leave blank to compute stats for all numeric columns.

Outputs

Label ID Type Description
dataset_stats_1 dataset_stats_1 dataset Tabular dataset where each row corresponds to a selected numeric column and each column represents a descriptive statistic (e.g., count, mean, std, min, 25th percentile, median, 75th percentile, max).

Disciplines

  • data.statistics

Runnable example

A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_get_numeric_column_stats


Auto-generated from transformation schema. Worker id: dataset_get_numeric_column_stats. Schema hash: 07b98e144113. Hand-curated docs in workerexamples/ override this page when present.