Extract text from scanned PDFs
Run OCR directly in your browser — files never leave your device.
① Source
Drop a PDF or image here
or click to browse · .pdf, .jpg, .png, .webp
Quality
Preprocessing
Layout
Parallel workers
Pages
ℹ First run downloads ~22 MB of OCR models. Cached by your browser —
works offline after that.
Drop a file to get started.
💡 Tip: for PDFs with mixed content (some scanned pages, some with real text), Folio-OCR uses the embedded text layer where available and only OCRs the image-only pages — saving time.
② Extracted text