DATASET GROUP AND AGGREGATE BY

Groups a dataset by a specified column and computes an aggregate (average, min, max, or sum) of a second column within each group. Use this worker to summarize or roll up tabular data by category before downstream analysis or reporting.

When to use

Classification: process.

Tagged: aggregate, avg, dataset_transform, group_by, max, min, rollup, sum.

Inputs

Label ID Type Default Required Description
Choose Dataset dataset dataset   Input tabular dataset to be grouped and aggregated; accepts any d3VIEW dataset object — leave empty only if the dataset is provided dynamically by a preceding worker.
Group By groupby scalar unclassified   Name of the column whose distinct values define the groups (e.g., a category or label column); defaults to ‘unclassified’ — must be changed to a valid column name in the connected dataset.
Aggregate By aggregateby scalar unclassified   Name of the numeric column whose values will be aggregated within each group (e.g., a measurement or KPI column); defaults to ‘unclassified’ — must be changed to a valid numeric column name.
Aggregation Type aggregation_type string avg   Aggregation function applied to the target column within each group — choose from ‘avg’ (mean), ‘min’, ‘max’, or ‘sum’; defaults to ‘avg’.

Outputs

Label ID Type Description
dataset_group_and_aggregate_by_output_1 dataset_group_and_aggregate_by_output_1 dataset Resulting dataset with one row per unique group-by value and a single aggregated column, ready for further processing or visualization.

Disciplines

  • data.dataset.transform
  • data.statistics

Runnable example

A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_group_and_aggregate_by


Auto-generated from transformation schema. Worker id: dataset_group_and_aggregate_by. Schema hash: eeebdc47584e. Hand-curated docs in workerexamples/ override this page when present.