Convert scanned PDF pages to searchable text using optical character recognition. Free, browser-based, 100+ languages.
OCR (Optical Character Recognition) is technology that reads text from images and scanned documents. Our free OCR tool uses Tesseract — the world's most accurate open-source OCR engine — to extract text from your scanned PDFs and images directly in your browser.
Recognise text in English, Hindi, French, German, Spanish, Arabic, Chinese, Japanese and many more.
OCR runs entirely in your browser using WebAssembly. Your documents are never uploaded anywhere.
Extract to plain text for copying, or create a searchable PDF that preserves your original layout.
Works directly in Chrome, Firefox and Safari. No plugins, no downloads, no registration.