CONVERTS PDF TO TEXT¶
Extracts all text content from an uploaded PDF file using the platform’s FileHelper utility and returns it as a plain-text string. Use this worker whenever downstream steps require searchable or parseable text from a PDF document.
When to use¶
Tagged: file_conversion, pdf, pdf_to_text, text_extraction.
Inputs¶
| Label | ID | Type | Default | Required | Description |
|---|---|---|---|---|---|
| PDF File | pdf_file | file | — | ✓ | The PDF file to convert; must be uploaded to the vault or provided as an attachment — accepts any standard PDF document. |
Outputs¶
| Label | ID | Type | Description |
|---|---|---|---|
| Converted Text | converted_text | textarea | Full plain-text content extracted from the PDF, returned as a single string with whitespace and line breaks preserved as parsed by the underlying PDF-to-text utility. |
Disciplines¶
- data.io.file
Auto-generated from platform schema. Worker id: pdf_to_text. Schema hash: baace46a00e0. Hand-curated docs in workerexamples/ override this page when present.