DATASET GROUP AND AGGREGATE BY¶

Groups a dataset by a specified column and computes an aggregate (average, min, max, or sum) of a second column within each group. Use this worker to summarize or roll up tabular data by category before downstream analysis or reporting.

When to use¶

Classification: process.

Tagged: aggregate, avg, dataset_transform, group_by, max, min, rollup, sum.

Inputs¶

Label	ID	Type	Default	Description
Choose Dataset	dataset	dataset	—	Input tabular dataset to be grouped and aggregated; accepts any d3VIEW dataset object — leave empty only if the dataset is provided dynamically by a preceding worker.
Group By	groupby	scalar	unclassified	Name of the column whose distinct values define the groups (e.g., a category or label column); defaults to ‘unclassified’ — must be changed to a valid column name in the connected dataset.
Aggregate By	aggregateby	scalar	unclassified	Name of the numeric column whose values will be aggregated within each group (e.g., a measurement or KPI column); defaults to ‘unclassified’ — must be changed to a valid numeric column name.
Aggregation Type	aggregation_type	string	avg	Aggregation function applied to the target column within each group — choose from ‘avg’ (mean), ‘min’, ‘max’, or ‘sum’; defaults to ‘avg’.

Outputs¶

Label	ID	Type	Description
dataset_group_and_aggregate_by_output_1	dataset_group_and_aggregate_by_output_1	dataset	Resulting dataset with one row per unique group-by value and a single aggregated column, ready for further processing or visualization.

Disciplines¶

data.dataset.transform
data.statistics

Runnable example¶

A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_group_and_aggregate_by

Auto-generated from transformation schema. Worker id: dataset_group_and_aggregate_by. Schema hash: eeebdc47584e. Hand-curated docs in workerexamples/ override this page when present.