USE REGULAR EXPRESSION TO EXTRACT THE DATA FROM A TEXT¶

Applies a regular expression pattern to an input text string and extracts matching content. Use this worker whenever you need to parse structured or semi-structured text (e.g., log lines, simulation output headers, report strings) and capture specific substrings via regex groups.

When to use¶

Classification: process.

Tagged: pattern_matching, regex, string_extraction, text_parsing, transformations.

Inputs¶

Label	ID	Type	Default	Description
Text To Be Parsed	texttobeparsed	text	—	The raw text string to search; accepts any plain-text content such as a simulation log, file output, or free-form string — leave empty only if the text will be injected at runtime.
Parsertype	parsertype	string	regex	Parsing strategy to apply; currently only ‘regex’ is supported — leave at the default value ‘regex’ unless additional parser types become available.
Expression	expression	string	—	The regular expression pattern used to match and extract data from the input text (e.g., ‘([0-9]+.?[0-9]*)’ to capture numeric values); must be a valid regex string — required for the worker to produce any output.

Outputs¶

Label	ID	Type	Description
text_parser_output_1	text_parser_output_1	text	The first match (or captured group) extracted from the input text by the supplied regular expression; returned as a plain-text string.

Disciplines¶

data.dataset.transform

Auto-generated from transformation schema. Worker id: text_parser. Schema hash: 4f0a0e1e5d1f. Hand-curated docs in workerexamples/ override this page when present.