OCR โ€” PDF & Image to Text

Extract text from PDFs and images using AI-powered OCR. Supports scanned documents, photos of text, and native PDFs. Outputs Markdown, plain text, or HTML with preserved formatting.

Processed on our servers โ€” requires a free account

Have feedback? Let us know

How to Use OCR โ€” PDF & Image to Text

  1. Upload a PDF or image file containing text
  2. Choose your output format โ€” Markdown, Plain Text, or HTML
  3. Click Process to extract text with AI OCR
  4. Download the extracted text file

Features

  • AI-powered OCR using Marker โ€” state-of-the-art document understanding
  • Supports scanned PDFs, photos of text, native PDFs, and images
  • Preserves tables, headings, lists, and document structure
  • Output as Markdown, plain text, or HTML
  • 90+ languages supported with auto-detection

Frequently Asked Questions

What types of documents can I OCR?
Scanned PDFs, photos of documents, screenshots, native PDFs, and images (JPG, PNG, WebP, TIFF, BMP). The AI handles handwriting, tables, equations, and complex layouts.
Does it preserve formatting?
Yes. Markdown output preserves headings, tables, lists, bold/italic text, and document structure. Plain text strips all formatting. HTML gives you web-ready formatted output.
What languages are supported?
Over 90 languages including English, Spanish, French, German, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and many more. Language is auto-detected.