COMBINE DATASETS BASED ON PRIMARY KEYS

Merges two datasets by joining on one or more primary-key columns and appending a specified column from Dataset2 into Dataset1. Use this worker when you need to enrich a dataset with an additional column sourced from a second dataset that shares common key columns.

When to use

Classification: process.

Tagged: column_append, combine, dataset, join, merge, primary_key, process.

Inputs

Label ID Type Default Required Description
Dataset1 dataset1 dataset   The base dataset (tabular) into which the new column will be merged; must contain the primary-key column(s) specified in ‘primarykeys’.
Dataset2 dataset2 dataset Unknown   The secondary dataset (tabular) supplying the column to be combined; must share the same primary-key column(s) with Dataset1.
Primary Keys primarykeys scalar   Comma-separated list of column name(s) used as join keys to match rows between Dataset1 and Dataset2 (e.g., ‘id’ or ‘id,timestamp’).
Column To Combine columntocombine scalar   Name of the single column in Dataset2 that will be appended to Dataset1 after the key-based join (e.g., ‘temperature’).

Outputs

Label ID Type Description
dataset_combine_column_output_1 dataset_combine_column_output_1 dataset The enriched dataset containing all columns from Dataset1 plus the requested column from Dataset2, aligned on the specified primary keys.

Disciplines

  • data.dataset.transform

Runnable example

A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_combine_column


Auto-generated from transformation schema. Worker id: dataset_combine_column. Schema hash: 8e5dddb8a588. Hand-curated docs in workerexamples/ override this page when present.