SUBTRACT TWO DATASETS AND RETURN THE DIFFERENCE¶
Subtracts Dataset 2 from Dataset 1 column-by-column and returns the difference as a new dataset. Supports scalar difference modes (raw, absolute, squared) and vector distance metrics (Euclidean, Fréchet, DTW, pDTW). Use when you need to compare or diff two datasets with matching or mapped column structures.
When to use¶
Classification: process.
Tagged: column_mapping, dataset_comparison, dataset_diff, difference, dtw, euclidean, frechet, pdtw.
Inputs¶
| Label | ID | Type | Default | Required | Description |
|---|---|---|---|---|---|
| Dataset 1 | dataset_1 | dataset | — | The minuend dataset (Dataset 1); tabular dataset whose columns will be subtracted from — connect the primary or reference dataset here. | |
| Dataset 2 | dataset_2 | dataset | — | The subtrahend dataset (Dataset 2); tabular dataset to subtract from Dataset 1 — must share compatible columns or be aligned via the mapper. | |
| Type Of Difference For Scalars | diff_type | list | raw | Difference mode applied to scalar (non-vector) columns: ‘raw’ (D1−D2), ‘raw_abs’ (/D1−D2/), or ‘squared’ ((D1−D2)²); defaults to ‘raw’. | |
| Type Of Difference For Vectors | vector_diff_type | list | raw | Distance metric applied to vector/curve columns: ‘raw’ (element-wise), ‘euclidean’, ‘frechet’, ‘dtw’ (Dynamic Time Warping), or ‘pdtw’ (partial DTW); defaults to ‘raw’. | |
| Choose Mapper | mapper_input | mapper | — | Optional column-mapping definition that aligns columns between Dataset 1 and Dataset 2 by name when they differ; leave unset if both datasets share identical column names. | |
| Choose Columns To Subtract | columns | text | — | Columns to Subract | |
| Add Difference As New Column | add_diff_as_new_column | list | no | ||
| Vector Operations for First Dataset | vector_operations_1 | dataset | — | Vector operations that needs to be performed before the subtractions. Operation can be any name of the worker. The parameters are the arguments for that operation separated by a comma for multiple parameters. The first argument to the operation will be the curve or curves found in the column. The value of other column names can be used | |
| Vector Operations for Second Dataset | vector_operations_2 | dataset | — | Vector operations that needs to be performed before the subtractions. Operation can be any name of the worker. The parameters are the arguments for that operation separated by a comma for multiple parameters. The first argument to the operation will be the curve or curves found in the column. The value of other column names can be used |
Outputs¶
| Label | ID | Type | Description |
|---|---|---|---|
| dataset_subtract_output_1 | dataset_subtract_output_1 | dataset | Resulting dataset containing the computed difference between Dataset 1 and Dataset 2, with one output column per matched/mapped input column pair using the selected diff and vector-diff modes. |
Disciplines¶
- data.correlation
- data.dataset.transform
- data.statistics
Runnable example¶
A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_subtract
Auto-generated from transformation schema. Worker id: dataset_subtract. Schema hash: ffcaceb9cc27. Hand-curated docs in workerexamples/ override this page when present.