.. _auto_dataset_get_summary: *GET DATASET SUMMARY* ===================== Computes a statistical summary (count, mean, min, max, std, etc.) for a dataset, optionally restricted to a selected subset of columns. Use this worker to quickly profile a dataset's schema and descriptive statistics before further processing or modelling. When to use ----------- Classification: **process**. Tagged: ``descriptive_stats``, ``eda``, ``profiling``, ``schema``, ``summary``. Inputs ------ .. list-table:: :header-rows: 1 :widths: 20 20 20 20 20 20 * - Label - ID - Type - Default - Required - Description * - Dataset - dataset - dataset - — - - Input dataset to be summarised; accepts any tabular dataset available in the workflow context — leave empty only if the dataset is piped implicitly from an upstream worker. * - Choose Columns - columns - scalar - — - - Optional comma-separated or multi-select list of column names to restrict the summary to; leave blank to include all columns in the dataset. Outputs ------- .. list-table:: :header-rows: 1 :widths: 20 20 20 20 * - Label - ID - Type - Description * - dataset_get_summary_output_1 - dataset_get_summary_output_1 - dataset - Tabular dataset containing per-column descriptive statistics (e.g. count, mean, std, min, 25/50/75 percentiles, max) for the selected columns of the input dataset. Disciplines ----------- - data.statistics Runnable example ---------------- A runnable example is registered for this worker. Open the example workflow on the d3VIEW canvas: `/api/workflow/example?id=dataset_get_summary `_ .. raw:: html

Auto-generated from transformation schema. Worker id: dataset_get_summary. Schema hash: 9e0258aa4ca2. Hand-curated docs in workerexamples/ override this page when present.