Intelligent Document Reader
What is the Intelligent Document Reader?
The Intelligent Document Reader is a system-managed tool that turns PDFs, Word files, and images into clean text that Duvo can easily work with. It extracts text, tables, and structure from documents — whether text-based or scanned.
What Can It Do?
The Intelligent Document Reader enables your assignments to:
Read PDFs: Extract content from both text-based and scanned PDF documents
Process images: Read text from photos, screenshots, and image files
Handle scanned documents: Use OCR to extract text from scanned paperwork
Extract tables: Recognize and extract tabular data from documents
Interpret visual content: Read charts, graphs, and embedded images
Supported Formats
PDFs (text-based and scanned)
Word documents
Images (JPG, PNG, GIF, WebP)
Photos of documents
Not For Spreadsheets
Important: Do not use the Intelligent Document Reader for spreadsheet files like XLSX, XLS, or CSV. These are structured data formats, not documents.
When to Use It
Use the Intelligent Document Reader when your assignment needs to:
Process PDF documents: Extract text and tables from any PDF
Extract text from photos: Read text from photos of documents or screenshots
Handle complex layouts: Work with documents that have tables, columns, or mixed content
Process scanned paperwork: Convert paper documents into usable text
Read visual content: Interpret charts, diagrams, or annotated images
When NOT to Use It
XLSX/XLS files: These are spreadsheets, not documents
CSV files: These are structured data, not documents
Google Sheets: Use the Google Sheets connection instead
How It Works
When your assignment encounters a document, the Intelligent Document Reader analyzes its content and structure. For text-based documents, it extracts and organizes the text and tables. For scanned documents or images, it uses OCR to recognize text and structure. The result is clean, structured data your assignment can use.
Key Benefits
Universal document reading: Handle PDFs, Word files, images, and scanned paperwork
Table extraction: Recognize and extract tabular data from documents
Visual interpretation: Process photos, screenshots, and scanned content that other tools can't read
Automation enabler: Bring document-heavy processes into your automated workflows
The Intelligent Document Reader is essential for organizations that deal with diverse document types, enabling automation of processes that involve real-world paperwork and files.
Last updated