LinuxCommandLibrary

tabula

tabula

TLDR

Extract all tables from a PDF to a CSV file

$ tabula -o [file.csv] [file.pdf]
copy


Extract all tables from a PDF to a JSON file
$ tabula --format JSON -o [file.json] [file.pdf]
copy


Extract tables from pages 1, 2, 3, and 6 of a PDF
$ tabula --pages [1-3,6] [file.pdf]
copy


Extract tables from page 1 of a PDF, guessing which portion of the page to examine
$ tabula --guess --pages [1] [file.pdf]
copy


Extract all tables from a PDF, using ruling lines to determine cell boundaries
$ tabula --spreadsheet [file.pdf]
copy


Extract all tables from a PDF, using blank space to determine cell boundaries
$ tabula --no-spreadsheet [file.pdf]
copy

Copied to clipboard