.. _auto_pdf_to_text: *CONVERTS PDF TO TEXT* ====================== Extracts all text content from an uploaded PDF file using the platform's FileHelper utility and returns it as a plain-text string. Use this worker whenever downstream steps require searchable or parseable text from a PDF document. When to use ----------- Tagged: ``file_conversion``, ``pdf``, ``pdf_to_text``, ``text_extraction``. Inputs ------ .. list-table:: :header-rows: 1 :widths: 20 20 20 20 20 20 * - Label - ID - Type - Default - Required - Description * - PDF File - pdf_file - file - — - ✓ - The PDF file to convert; must be uploaded to the vault or provided as an attachment — accepts any standard PDF document. Outputs ------- .. list-table:: :header-rows: 1 :widths: 20 20 20 20 * - Label - ID - Type - Description * - Converted Text - converted_text - textarea - Full plain-text content extracted from the PDF, returned as a single string with whitespace and line breaks preserved as parsed by the underlying PDF-to-text utility. Disciplines ----------- - data.io.file .. raw:: html

Auto-generated from platform schema. Worker id: pdf_to_text. Schema hash: baace46a00e0. Hand-curated docs in workerexamples/ override this page when present.