LinuxCommandLibrary
GitHubF-DroidGoogle Play Store

poppler-utils

PDF manipulation command-line utilities

TLDR

Extract text from PDF
$ pdftotext [document.pdf] [output.txt]
copy
Get PDF information
$ pdfinfo [document.pdf]
copy
Convert PDF to images
$ pdftoppm [document.pdf] [output] -png
copy
Extract images from PDF
$ pdfimages [document.pdf] [prefix]
copy
Merge PDF files
$ pdfunite [file1.pdf] [file2.pdf] [output.pdf]
copy

SYNOPSIS

Collection of PDF utilities from Poppler library

DESCRIPTION

poppler-utils is a collection of command-line utilities for working with PDF files, based on the Poppler PDF rendering library. It provides tools for extraction, conversion, and manipulation.

UTILITIES

pdftotext

Extract text content.
pdfinfo
Display PDF metadata.
pdftoppm
Convert to PPM/PNG/JPEG images.
pdfimages
Extract embedded images.
pdfunite
Merge multiple PDFs.
pdfseparate
Split PDF into pages.
pdffonts
List fonts used.
pdfdetach
Extract attachments.
pdfattach
Attach files to PDFs.
pdftocairo
Convert PDF to image formats using Cairo.
pdftohtml
Convert PDF to HTML.
pdftops
Convert PDF to PostScript.

CAVEATS

Text extraction quality varies by PDF structure. Some PDFs are image-only. Install poppler-utils package.

HISTORY

Poppler was forked from Xpdf by Derek Noonburg and is maintained by the freedesktop.org project.

SEE ALSO

pdftotext(1), pdftoppm(1), pdfinfo(1), pdftk(1), gs(1), qpdf(1)

Copied to clipboard
Kai