LinuxCommandLibrary

ocrmypdf

Generate a searchable PDF or PDF/A from a scanned PDF or an image of text.

TLDR

Create a new searchable PDF/A file from a scanned PDF or image file

$ ocrmypdf [path/to/input_file] [path/to/output.pdf]
copy


Replace a scanned PDF file with a searchable PDF file
$ ocrmypdf [path/to/file.pdf] [path/to/file.pdf]
copy


Skip pages of a mixed-format input PDF file that already contain text
$ ocrmypdf --skip-text [path/to/input.pdf] [path/to/output.pdf]
copy


Clean, de-skew, and rotate pages of a poor scan
$ ocrmypdf --clean --deskew --rotate-pages [path/to/input_file] [path/to/output.pdf]
copy


Set the metadata of the searchable PDF file
$ ocrmypdf --title "[title]" --author "[author]" --subject "[subject]" --keywords "[keyword; key phrase; ...]" [path/to/input_file] [path/to/output.pdf]
copy


Display help
$ ocrmypdf --help
copy

Copied to clipboard