GET NUMBER OF ROWS FROM A DATASET

Counts the number of rows in a dataset that satisfy one or more user-defined conditions on a specified column. Use this worker when you need a conditional row count as a scalar KPI within a workflow — for example, counting defective records or threshold-exceeding events.

When to use

Classification: process.

Tagged: conditional_filter, dataset, kpi, row_count, statistics.

Inputs

Label ID Type Default Required Description
Dataset dataset dataset   Input dataset to evaluate; must be a tabular dataset object — leave empty only if the dataset is injected dynamically by an upstream worker.
Column To Check columntocheck scalar   Name of the column in the dataset against which the conditions are evaluated; populated automatically from the connected dataset’s column list.
Conditions conditions labelcondition   One or more label-condition expressions (e.g., >, <, ==, contains) that a row’s column value must satisfy to be counted; supports multiple repeating entries for compound filters.

Outputs

Label ID Type Description
dataset_get_num_rows_by_condition_output_1 dataset_get_num_rows_by_condition_output_1 scalar Scalar integer representing the number of rows in the input dataset whose specified column satisfies all supplied conditions.

Disciplines

  • data.dataset.transform
  • data.statistics

Runnable example

A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_get_num_rows_by_condition


Auto-generated from transformation schema. Worker id: dataset_get_num_rows_by_condition. Schema hash: ad467de34be9. Hand-curated docs in workerexamples/ override this page when present.