Is my PDF uploaded anywhere?

No. The OCR runs completely in your browser using a self-hosted engine, so your PDF and its images are never uploaded to any server. The file stays on your device the whole time.

How is this different from the PDF to Text tool?

PDF to Text is instant but only works on PDFs that already contain a text layer. This OCR tool is for scanned or image-based PDFs, where it reads the picture of each page and recognises the words, which takes a bit longer.

Why does the first run take a while?

The first time you use it, your browser downloads the OCR engine, which is around 5MB. This only happens once; after that it is cached, so future runs start much faster.

Does it work on handwriting?

It is designed for printed text and works best on clear scans of it. Handwriting is generally not recognised reliably, so results on handwritten pages will usually be poor.

How accurate is the text I get?

Accuracy is high on clean, sharp scans of printed documents, but it is not perfect. Faint print, low-resolution scans, skewed pages, or unusual fonts can cause misreads, so it is worth reviewing the output before you use it.

What can I do with the extracted text?

You can copy it to your clipboard or download it as a plain .txt file. From there it is fully editable in any text editor or word processor, and you can search, quote, or reuse it however you need.

OCR PDF (Scanned to Text)

Extract text from a scanned PDF using OCR that runs on your device.

100% private
No uploads
Free & instant

Drop your files here

or browse, or paste from clipboard

PDF files

Processed in your browser - never uploaded

About OCR PDF (Scanned to Text)

Some PDFs are just pictures. When a document is scanned or photographed, each page is stored as an image, so the words you see on screen are not actually text a computer can select, search, or copy. Try to highlight them and you get nothing. The OCR PDF tool solves this by reading the image and turning it back into real, editable text you can copy or save as a .txt file.

OCR stands for Optical Character Recognition. This tool renders every page of your scanned PDF, looks at the shapes of the letters, and recognises the words, all on your own device. It runs entirely in your browser using a self-hosted engine (tesseract.js), which means your document is never uploaded to a server. The file stays on your computer from start to finish, so even sensitive scans like contracts, bank statements, or ID documents never leave your hands.

Think of it as the companion to our PDF to Text tool. PDF to Text is instant, but it only works when a PDF already contains a hidden text layer. If that tool comes back empty, your PDF is image-based, and this OCR tool is what you need instead. It is free, needs no sign-up, and works best on clear scans of printed text.

How to Extract Text from a Scanned PDF with OCR

1
Open the OCR PDF tool
Go to the OCR PDF (Scanned to Text) tool in your browser. There is no account to create and nothing to install. On your very first run, the tool downloads a small OCR engine (around 5MB); this happens once and your browser keeps it cached for next time.
2
Add your scanned PDF
Drag your PDF onto the page or click to choose it from your device. Because everything runs locally, your file is loaded straight into your browser and is never sent anywhere. This tool is meant for scanned or image-based PDFs; if your PDF already has selectable text, the faster PDF to Text tool will do the job instantly.
3
Let it read each page
The tool renders every page and recognises the text on it, one page at a time. Larger documents and higher-resolution scans take a little longer, since all the processing happens on your own hardware rather than a remote server.
4
Review the recognised text
When it finishes, the extracted text appears on screen ready to check. OCR is very good with clear printed scans but not perfect, so it is worth a quick read to catch the occasional misread character, especially with faint print or unusual fonts.
5
Copy or download your text
Copy the text to your clipboard to paste into a document or email, or download it as a plain .txt file to keep. From there you can edit it freely in any text editor or word processor.

Why use OCR PDF (Scanned to Text)

Your document never leaves your device

The OCR engine is self-hosted and runs entirely in your browser. Your scanned PDF is not uploaded, stored, or seen by anyone else, which makes it safe for private paperwork like statements, agreements, and personal records.

Works where plain extraction fails

Normal PDF text tools only work when a hidden text layer exists. This tool actually reads the image, so it can pull text out of scans, photos of documents, and old photocopies that would otherwise come back completely empty.

Free, editable output

You get real, editable text you can copy or download as a .txt file, with no watermarks, no sign-up, and no page limits imposed by a paid plan. Once it is text, you can search it, edit it, or reuse it however you like.

No installs or accounts

Everything works from a web page. After the one-time engine download, you can OCR documents whenever you need without extra software, subscriptions, or uploads.

Common uses

Pulling the text out of a scanned contract or agreement so you can search and quote from it
Converting scanned bank or brokerage statements into text you can copy into a spreadsheet or note
Digitising old photocopied documents, letters, or printed notes that have no digital text
Extracting details from scanned invoices, receipts, or forms to reuse elsewhere
Making a scanned printed report or study material selectable and searchable

Your files never leave your device

Most online PDF tools upload your document to a server to process it. This one does not. Every operation runs inside your own browser using WebAssembly, so your files are read, processed, and saved locally - they are never transmitted, stored, or seen by Sapphire Broking or anyone else. That makes it safe to use with financial statements, contracts, and identity documents.

Frequently asked questions

Keep going

Related tools

All PDF tools

PDF to JPG

Convert each PDF page into a high-quality JPG image.

Open tool

PDF to PNG

Convert PDF pages into crisp, lossless PNG images.

Open tool

JPG to PDF

Turn JPG, PNG, or WebP images into a single PDF document.

Open tool

PDF to Text

Extract selectable text from a PDF into a plain .txt file.

Open tool

Connecting to Sapphire

Aligning the stars

About OCR PDF (Scanned to Text)

How to Extract Text from a Scanned PDF with OCR

Open the OCR PDF tool

Go to the OCR PDF (Scanned to Text) tool in your browser. There is no account to create and nothing to install. On your very first run, the tool downloads a small OCR engine (around 5MB); this happens once and your browser keeps it cached for next time.

Add your scanned PDF

Drag your PDF onto the page or click to choose it from your device. Because everything runs locally, your file is loaded straight into your browser and is never sent anywhere. This tool is meant for scanned or image-based PDFs; if your PDF already has selectable text, the faster PDF to Text tool will do the job instantly.

Let it read each page

The tool renders every page and recognises the text on it, one page at a time. Larger documents and higher-resolution scans take a little longer, since all the processing happens on your own hardware rather than a remote server.

Review the recognised text

When it finishes, the extracted text appears on screen ready to check. OCR is very good with clear printed scans but not perfect, so it is worth a quick read to catch the occasional misread character, especially with faint print or unusual fonts.

Copy or download your text

Copy the text to your clipboard to paste into a document or email, or download it as a plain .txt file to keep. From there you can edit it freely in any text editor or word processor.

Why use OCR PDF (Scanned to Text)