PDF Tips & Tricks pdf scanning ocr document management

Best PDF Scanning Tips for Clear, Professional Documents

Get the clearest scanned PDFs possible — the right scanner settings, document preparation, and post-scan processing techniques.

A
Admin
· Jun 3, 2026 · 6 min read · 4 views

Why Scanning Quality Matters

A poorly scanned PDF frustrates everyone who reads it — skewed pages, faint text, blotchy backgrounds, and unreadable fine print. Worse, low-quality scans fail OCR (Optical Character Recognition), making the document unsearchable and defeating the purpose of digitisation.

Good scanning isn't about expensive equipment — it's about technique. Most modern flatbed scanners and even smartphone apps can produce excellent results when used correctly.


Prepare Your Document Before Scanning

The scan quality starts before you touch the scanner.

Flatten Creased Pages

Run creased pages through a book, under a heavy object, or use a bone folder to smooth them. Raised areas create shadows that darken unevenly in the scan.

Remove Staples and Binding

Staples and binding prevent flat contact with the scanner glass, causing blurry or warped areas. Remove them before scanning.

Check for Torn or Damaged Pages

Tears don't always show up in a scan as tears — they can appear as unexpected white patches or shadows. Carefully flatten torn pages and tape from behind if needed.

Clean the Scanner Glass

Dust, smudges, and hair on the scanner glass appear as spots or streaks on every page. Clean the glass with a microfibre cloth and glass cleaner before scanning.

Organise Page Order

Sort pages in the correct order before scanning. Re-ordering 40 scanned pages is tedious work.


Choosing the Right DPI (Dots Per Inch)

DPI determines the resolution — and the file size — of your scan. Choosing the right DPI is the single most important scanning setting.

Content Type Recommended DPI
Text-only documents (standard) 300 DPI
Text with fine print or footnotes 400 DPI
Documents with photos/illustrations 300–600 DPI
Archival / preservation scanning 400–600 DPI
Business cards 600 DPI
Line art / technical drawings 600–1200 DPI
Quick preview or web reference 150 DPI

The golden rule: 300 DPI is the minimum for OCR-capable text. Go lower and text recognition degrades. Go to 600 DPI for fine detail or when archiving originals.

Higher DPI means larger files — a 300 DPI scan of an A4 page is roughly 2–3MB (TIFF) or 200–400KB (compressed PDF). At 600 DPI, expect 4–6× the file size.


Colour Mode: Which to Choose

Black and White (Bitonal / 1-bit)

Converts the scan to pure black and white pixels. Smallest file size. Best for clean, high-contrast text on white paper.

Not suitable for: Documents with greyscale images, photos, or coloured content.

Greyscale (8-bit)

Captures 256 shades of grey. Good for documents with faded text, pencil notes, or greyscale photos. Slightly larger files than black-and-white.

Best for: Most office documents, handwritten notes, newspapers.

Colour (24-bit RGB)

Captures full colour. Largest file size. Necessary for coloured logos, charts, and photos.

Best for: Marketing materials, legal documents with official seals, certificates, and anything with colour content that matters.

Tip: For purely text documents, black-and-white or greyscale at 300 DPI produces the smallest, cleanest, most OCR-friendly result.


Flatbed vs. ADF (Auto Document Feeder)

Flatbed Scanner

You place each page individually on the glass. Slower but produces the best quality — no skewing, no jamming, full control.

Use for: Fragile documents, bound materials (open the book flat), anything requiring maximum quality.

ADF (Auto Document Feeder)

Feeds a stack of pages automatically. Much faster for multi-page documents but can skew pages, miss content at edges, or damage fragile originals.

Use for: Large batches of standard office documents in good condition.

Tip: Even with ADF, run the first page through the flatbed to verify alignment before committing a large batch.


Scanner Software Settings

Most scanner software (NAPS2, ScanSnap, VueScan, Epson Scan, etc.) offers options that dramatically affect output quality.

Deskew / Auto Correct Skew

Automatically detects and corrects tilted pages. Always enable this — even a 2° tilt makes text look unprofessional.

Auto Crop / Auto Border Detection

Trims the white border to match the document edge. Prevents large black borders from appearing around pages.

Blank Page Removal

If scanning double-sided documents on a single-sided scanner, auto blank page removal deletes the empty reverse side pages automatically.

Image Enhancement

  • Auto levels / contrast: Boosts contrast so text is darker and backgrounds appear whiter
  • Despeckle: Removes random noise and scanner dust artifacts
  • Sharpen: Adds definition to text edges (use sparingly — over-sharpening creates halos)
  • Background removal: Neutralises yellowed or coloured paper backgrounds to pure white

Compression

For black-and-white text: CCITT Group 4 (G4) compression provides excellent size reduction with no quality loss. For colour/greyscale: JPEG at 80–90% quality balances size and clarity.


Using a Smartphone to Scan PDFs

Modern smartphone scanning apps produce surprisingly good results for everyday documents.

Best apps:

  • Adobe Scan (iOS/Android) — auto-detects document edges, removes backgrounds, creates searchable PDF via OCR
  • Microsoft Lens (iOS/Android) — excellent for whiteboards and documents, integrates with OneDrive
  • Google Drive (Android) — built-in document scanner, saves directly to Drive
  • Scanner Pro (iOS) — strong image enhancement tools

Tips for phone scanning:

  • Use bright, even lighting — avoid flash glare and shadows from your hands
  • Hold the phone directly above the document (not at an angle)
  • Tap to focus on the page before capturing
  • Use the app's edge detection — let it auto-crop rather than shooting a wide frame

Post-Scan Processing

Run OCR to Make It Searchable

A scanned PDF is just an image — text cannot be selected, copied, or searched. OCR adds a searchable text layer.

Free options:

  • Adobe Acrobat Reader (free for basic OCR)
  • ABBYY FineReader Online (free tier)
  • Google Drive — upload a scanned PDF, right-click → Open with Google Docs (performs OCR automatically)

Command line (Tesseract):

tesseract input.tiff output pdf

Reduce File Size After Scanning

High-DPI scans produce large files. Compress after scanning:

  • Acrobat Pro: File → Save as Other → Reduced Size PDF
  • Ghostscript: gs -sDEVICE=pdfwrite -dPDFSETTINGS=/ebook -o output.pdf input.pdf

Rotate and Order Pages

Check page orientation (portrait vs landscape) and correct any upside-down pages before finalising.

Add Bookmarks and Metadata

For long scanned documents, add bookmarks (chapter markers) and metadata (title, author, date) to make them useful:

  • Acrobat Pro: Tools → Edit PDF → More → Add Bookmarks / Document Properties

Common Scanning Problems and Solutions

Problem: Dark edges or black borders around pages Solution: Enable auto-crop in scanner software, or manually trim in Acrobat (Edit PDF → Crop Pages)

Problem: Text appears grey, not black Solution: Increase contrast in scanner software, or use Enhance Scans in Acrobat (Tools → Enhance Scans)

Problem: Pages are slightly tilted Solution: Enable auto-deskew in scanner settings

Problem: Moiré pattern (wavy texture) on photos Solution: Enable the Descreen option in scanner software if scanning printed photos/magazines

Problem: OCR is inaccurate Solution: Re-scan at 300 DPI minimum, increase contrast, ensure pages are flat with no shadows


Summary

Great scanned PDFs come from preparation (flat, clean documents), correct settings (300 DPI, appropriate colour mode, auto-deskew enabled), and post-scan processing (OCR, compression, rotation). For text documents, 300 DPI greyscale hits the sweet spot between quality and file size. Always run OCR on important documents to make them searchable, and compress large scans before archiving or sharing.