LinuxCommandLibrary

pdfid

TLDR

Analyze PDF structure

$ pdfid [file.pdf]
copy
Scan multiple files
$ pdfid [file1.pdf] [file2.pdf]
copy
Extra analysis mode
$ pdfid -e [file.pdf]
copy
Output as JSON
$ pdfid -j [file.pdf]
copy

SYNOPSIS

pdfid [options] file...

DESCRIPTION

pdfid scans PDF files for keywords that indicate potentially malicious content. It counts occurrences of JavaScript, embedded files, launch actions, and other suspicious elements.
Essential tool for PDF malware analysis.

PARAMETERS

-e

Extra data analysis.
-j
JSON output.
-s
Scan subfiles.
-d
Disarm (remove /JS and /Launch).
-p plugin
Use plugin.

OUTPUT

$ PDFiD 0.2.7 document.pdf
 PDF Header: %PDF-1.4
 obj                   15
 endobj                15
 stream                 3
 /Page                  2
 /JS                    0
 /JavaScript            0
 /OpenAction            0
 /Launch                0
copy

SUSPICIOUS KEYWORDS

$ /JS, /JavaScript  - Embedded JavaScript
/OpenAction       - Automatic actions
/Launch           - Launch external programs
/EmbeddedFile     - Embedded files
/AcroForm         - Interactive forms
copy

CAVEATS

Keyword presence doesn't confirm malicious intent. Further analysis with pdf-parser recommended. Part of PDF analysis toolkit.

HISTORY

pdfid was created by Didier Stevens as part of his PDF analysis tools for security research.

SEE ALSO

Copied to clipboard