SUBTRACT TWO DATASETS AND RETURN THE DIFFERENCE¶

Subtracts Dataset 2 from Dataset 1 column-by-column and returns the difference as a new dataset. Supports scalar difference modes (raw, absolute, squared) and vector distance metrics (Euclidean, Fréchet, DTW, pDTW). Use when you need to compare or diff two datasets with matching or mapped column structures.

When to use¶

Classification: process.

Tagged: column_mapping, dataset_comparison, dataset_diff, difference, dtw, euclidean, frechet, pdtw.

Inputs¶

Label	ID	Type	Default	Description
Dataset 1	dataset_1	dataset	—	The minuend dataset (Dataset 1); tabular dataset whose columns will be subtracted from — connect the primary or reference dataset here.
Dataset 2	dataset_2	dataset	—	The subtrahend dataset (Dataset 2); tabular dataset to subtract from Dataset 1 — must share compatible columns or be aligned via the mapper.
Type Of Difference For Scalars	diff_type	list	raw	Difference mode applied to scalar (non-vector) columns: ‘raw’ (D1−D2), ‘raw_abs’ (/D1−D2/), or ‘squared’ ((D1−D2)²); defaults to ‘raw’.
Type Of Difference For Vectors	vector_diff_type	list	raw	Distance metric applied to vector/curve columns: ‘raw’ (element-wise), ‘euclidean’, ‘frechet’, ‘dtw’ (Dynamic Time Warping), or ‘pdtw’ (partial DTW); defaults to ‘raw’.
Choose Mapper	mapper_input	mapper	—	Optional column-mapping definition that aligns columns between Dataset 1 and Dataset 2 by name when they differ; leave unset if both datasets share identical column names.
Choose Columns To Subtract	columns	text	—	Columns to Subract
Add Difference As New Column	add_diff_as_new_column	list	no
Vector Operations for First Dataset	vector_operations_1	dataset	—	Vector operations that needs to be performed before the subtractions. Operation can be any name of the worker. The parameters are the arguments for that operation separated by a comma for multiple parameters. The first argument to the operation will be the curve or curves found in the column. The value of other column names can be used
Vector Operations for Second Dataset	vector_operations_2	dataset	—	Vector operations that needs to be performed before the subtractions. Operation can be any name of the worker. The parameters are the arguments for that operation separated by a comma for multiple parameters. The first argument to the operation will be the curve or curves found in the column. The value of other column names can be used

Outputs¶

Label	ID	Type	Description
dataset_subtract_output_1	dataset_subtract_output_1	dataset	Resulting dataset containing the computed difference between Dataset 1 and Dataset 2, with one output column per matched/mapped input column pair using the selected diff and vector-diff modes.

Disciplines¶

data.correlation
data.dataset.transform
data.statistics

Runnable example¶

A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_subtract

Auto-generated from transformation schema. Worker id: dataset_subtract. Schema hash: ffcaceb9cc27. Hand-curated docs in workerexamples/ override this page when present.