🔍

PDF OCR Scanner

Extract text from scanned PDFs using Tesseract OCR. Upload a scanned PDF, choose language, and get the recognized text for each page. Runs entirely in your browser — no server.

📄
Drop a scanned PDF here or click to browse
Works best on image-based PDFs
No upload · Powered by Tesseract.js · Free
Language:
Initializing Tesseract…
Advertisement
How to use
  1. Drop a scanned or image-based PDF onto the upload zone.
  2. Pick the document language for best accuracy, then click Run OCR.
  3. Copy individual pages, or download the full text as a .txt file.
FAQ

Tesseract's language model (~5–20 MB depending on language) downloads on the first run. Subsequent runs use the cached model and are much faster.

Very good on clean, high-contrast scans. For complex layouts, handwriting or low-quality scans, cloud OCR (Google Vision, AWS Textract) may do better.

Yes, but it's overkill — use a text-extraction tool for native PDFs. OCR is intended for image-only scans.