USE REGULAR EXPRESSION TO EXTRACT THE DATA FROM A TEXT¶
Applies a regular expression pattern to an input text string and extracts matching content. Use this worker whenever you need to parse structured or semi-structured text (e.g., log lines, simulation output headers, report strings) and capture specific substrings via regex groups.
When to use¶
Classification: process.
Tagged: pattern_matching, regex, string_extraction, text_parsing, transformations.
Inputs¶
| Label | ID | Type | Default | Required | Description |
|---|---|---|---|---|---|
| Text To Be Parsed | texttobeparsed | text | — | The raw text string to search; accepts any plain-text content such as a simulation log, file output, or free-form string — leave empty only if the text will be injected at runtime. | |
| Parsertype | parsertype | string | regex | Parsing strategy to apply; currently only ‘regex’ is supported — leave at the default value ‘regex’ unless additional parser types become available. | |
| Expression | expression | string | — | The regular expression pattern used to match and extract data from the input text (e.g., ‘([0-9]+.?[0-9]*)’ to capture numeric values); must be a valid regex string — required for the worker to produce any output. |
Outputs¶
| Label | ID | Type | Description |
|---|---|---|---|
| text_parser_output_1 | text_parser_output_1 | text | The first match (or captured group) extracted from the input text by the supplied regular expression; returned as a plain-text string. |
Disciplines¶
- data.dataset.transform
Auto-generated from transformation schema. Worker id: text_parser. Schema hash: 4f0a0e1e5d1f. Hand-curated docs in workerexamples/ override this page when present.