# Intelligent Document Reader

The Intelligent Document Reader turns PDFs, Word files, and images into clean, structured text that Duvo can work with. It handles both text-based and scanned documents, extracting text and tables so your assignments can process real-world paperwork automatically.

## Setup

No setup is needed. The Intelligent Document Reader is automatically available to all assignments.

## Capabilities

* **PDF extraction** — Read and extract content from both text-based and scanned PDF documents.
* **Word document processing** — Parse Word files and extract their text and structure.
* **Image text recognition** — Use OCR to read text from photos, screenshots, and image files (JPG, PNG, GIF, WebP).
* **Scanned document handling** — Convert scanned paperwork into usable text through optical character recognition.
* **Table extraction** — Recognize and extract tabular data from documents while preserving structure.
* **Complex layout handling** — Process documents with multiple columns, forms, and mixed content while preserving structure.

## Key Benefits

* **Universal document reading** — Handle PDFs, Word files, images, and scanned paperwork through a single connection.
* **No manual data entry** — Automate the extraction of text and tables from documents that would otherwise require manual processing.
* **Scanned document support** — Process paper-based documents and photos just as easily as digital files, using built-in OCR.
* **Structured output** — Get clean, organized text that your assignments can immediately use for analysis, comparison, or reporting.

## Works Well With

* **Google Sheets** — Extract data from PDF invoices or scanned forms and write the results into a spreadsheet for tracking.
* **Gmail / Outlook** — Process document attachments from emails automatically, extracting key information without opening each file manually.
* **Browser** — Download documents from web portals, then extract and analyze their contents in a single workflow.

## Not for Spreadsheets

Do not use the Intelligent Document Reader for spreadsheet files like XLSX, XLS, or CSV. These are structured data formats that should be handled by their dedicated connections (such as Google Sheets).
