CONVERTS PDF TO TEXT

Extracts all text content from an uploaded PDF file using the platform’s FileHelper utility and returns it as a plain-text string. Use this worker whenever downstream steps require searchable or parseable text from a PDF document.

When to use

Tagged: file_conversion, pdf, pdf_to_text, text_extraction.

Inputs

Label ID Type Default Required Description
PDF File pdf_file file The PDF file to convert; must be uploaded to the vault or provided as an attachment — accepts any standard PDF document.

Outputs

Label ID Type Description
Converted Text converted_text textarea Full plain-text content extracted from the PDF, returned as a single string with whitespace and line breaks preserved as parsed by the underlying PDF-to-text utility.

Disciplines

  • data.io.file

Auto-generated from platform schema. Worker id: pdf_to_text. Schema hash: baace46a00e0. Hand-curated docs in workerexamples/ override this page when present.