SUBTRACT TWO DATASETS AND RETURN THE DIFFERENCE

Subtracts Dataset 2 from Dataset 1 column-by-column and returns the difference as a new dataset. Supports scalar difference modes (raw, absolute, squared) and vector distance metrics (Euclidean, Fréchet, DTW, pDTW). Use when you need to compare or diff two datasets with matching or mapped column structures.

When to use

Classification: process.

Tagged: column_mapping, dataset_comparison, dataset_diff, difference, dtw, euclidean, frechet, pdtw.

Inputs

Label ID Type Default Required Description
Dataset 1 dataset_1 dataset   The minuend dataset (Dataset 1); tabular dataset whose columns will be subtracted from — connect the primary or reference dataset here.
Dataset 2 dataset_2 dataset   The subtrahend dataset (Dataset 2); tabular dataset to subtract from Dataset 1 — must share compatible columns or be aligned via the mapper.
Type Of Difference For Scalars diff_type list raw   Difference mode applied to scalar (non-vector) columns: ‘raw’ (D1−D2), ‘raw_abs’ (/D1−D2/), or ‘squared’ ((D1−D2)²); defaults to ‘raw’.
Type Of Difference For Vectors vector_diff_type list raw   Distance metric applied to vector/curve columns: ‘raw’ (element-wise), ‘euclidean’, ‘frechet’, ‘dtw’ (Dynamic Time Warping), or ‘pdtw’ (partial DTW); defaults to ‘raw’.
Choose Mapper mapper_input mapper   Optional column-mapping definition that aligns columns between Dataset 1 and Dataset 2 by name when they differ; leave unset if both datasets share identical column names.
Choose Columns To Subtract columns text   Columns to Subract
Add Difference As New Column add_diff_as_new_column list no    
Vector Operations for First Dataset vector_operations_1 dataset   Vector operations that needs to be performed before the subtractions. Operation can be any name of the worker. The parameters are the arguments for that operation separated by a comma for multiple parameters. The first argument to the operation will be the curve or curves found in the column. The value of other column names can be used
Vector Operations for Second Dataset vector_operations_2 dataset   Vector operations that needs to be performed before the subtractions. Operation can be any name of the worker. The parameters are the arguments for that operation separated by a comma for multiple parameters. The first argument to the operation will be the curve or curves found in the column. The value of other column names can be used

Outputs

Label ID Type Description
dataset_subtract_output_1 dataset_subtract_output_1 dataset Resulting dataset containing the computed difference between Dataset 1 and Dataset 2, with one output column per matched/mapped input column pair using the selected diff and vector-diff modes.

Disciplines

  • data.correlation
  • data.dataset.transform
  • data.statistics

Runnable example

A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_subtract


Auto-generated from transformation schema. Worker id: dataset_subtract. Schema hash: ffcaceb9cc27. Hand-curated docs in workerexamples/ override this page when present.