How OCR works
The scanner uses Tesseract.js to extract text from scanned PDFs and document images. It processes each page locally in your browser — no data is sent to any server. Supports standard document formats and handles mixed text and image content.