GET NUMBER OF ROWS FROM A DATASET¶
Counts the number of rows in a dataset that satisfy one or more user-defined conditions on a specified column. Use this worker when you need a conditional row count as a scalar KPI within a workflow — for example, counting defective records or threshold-exceeding events.
When to use¶
Classification: process.
Tagged: conditional_filter, dataset, kpi, row_count, statistics.
Inputs¶
| Label | ID | Type | Default | Required | Description |
|---|---|---|---|---|---|
| Dataset | dataset | dataset | — | Input dataset to evaluate; must be a tabular dataset object — leave empty only if the dataset is injected dynamically by an upstream worker. | |
| Column To Check | columntocheck | scalar | — | Name of the column in the dataset against which the conditions are evaluated; populated automatically from the connected dataset’s column list. | |
| Conditions | conditions | labelcondition | — | One or more label-condition expressions (e.g., >, <, ==, contains) that a row’s column value must satisfy to be counted; supports multiple repeating entries for compound filters. |
Outputs¶
| Label | ID | Type | Description |
|---|---|---|---|
| dataset_get_num_rows_by_condition_output_1 | dataset_get_num_rows_by_condition_output_1 | scalar | Scalar integer representing the number of rows in the input dataset whose specified column satisfies all supplied conditions. |
Disciplines¶
- data.dataset.transform
- data.statistics
Runnable example¶
A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: /api/workflow/example?id=dataset_get_num_rows_by_condition
Auto-generated from transformation schema. Worker id: dataset_get_num_rows_by_condition. Schema hash: ad467de34be9. Hand-curated docs in workerexamples/ override this page when present.