USE REGULAR EXPRESSION TO EXTRACT THE DATA FROM A TEXT

Applies a regular expression pattern to an input text string and extracts matching content. Use this worker whenever you need to parse structured or semi-structured text (e.g., log lines, simulation output headers, report strings) and capture specific substrings via regex groups.

When to use

Classification: process.

Tagged: pattern_matching, regex, string_extraction, text_parsing, transformations.

Inputs

Label ID Type Default Required Description
Text To Be Parsed texttobeparsed text   The raw text string to search; accepts any plain-text content such as a simulation log, file output, or free-form string — leave empty only if the text will be injected at runtime.
Parsertype parsertype string regex   Parsing strategy to apply; currently only ‘regex’ is supported — leave at the default value ‘regex’ unless additional parser types become available.
Expression expression string   The regular expression pattern used to match and extract data from the input text (e.g., ‘([0-9]+.?[0-9]*)’ to capture numeric values); must be a valid regex string — required for the worker to produce any output.

Outputs

Label ID Type Description
text_parser_output_1 text_parser_output_1 text The first match (or captured group) extracted from the input text by the supplied regular expression; returned as a plain-text string.

Disciplines

  • data.dataset.transform

Auto-generated from transformation schema. Worker id: text_parser. Schema hash: 4f0a0e1e5d1f. Hand-curated docs in workerexamples/ override this page when present.