csvkit

Convert and work with CSV files

TLDR

Run a command on a CSV file with a custom delimiter

$ [command] [[-d|--delimiter]] [delimiter] [path/to/file.csv]

Run a command on a CSV file with a tab as a delimiter (overrides -d)

$ [command] [[-t|--tabs]] [path/to/file.csv]

Run a command on a CSV file with a custom quote character

$ [command] [[-q|--quotechar]] [quote_char] [path/to/file.csv]

Run a command on a CSV file with no header row

$ [command] [[-H|--no-header-row]] [path/to/file.csv]

SYNOPSIS

Suite of tools invoked individually:
csvcut [-c COLUMNS] [-C COLUMNS] [FILE]
csvlook [OPTIONS] [FILE]
csvstat [OPTIONS] [FILE]
csvgrep [OPTIONS] [PATTERN] [FILE]

-d DELIM, --delimiter DELIM
    Field delimiter (default: ',')

-t, --tabs
    Treat tabs as field delimiters

-q QC, --quotechar QC
    Quote character (default: '"')

-u EC, --escapechar EC
    Escape character for quotes

-z L, --zero-lines L
    Line ending specification (CRLF, LF, etc.)

-e E, --encoding E
    Input/output encoding (default: utf-8)

-b, --blanks
    Blank values as empty strings

--lb, --line-breaks
    Line breaks within fields as empty strings

--date-format FMT
    Format for parsing dates

--zero DELIM_ZERO
    Treat specific delimiter as zero value

DESCRIPTION

csvkit is a powerful open-source collection of command-line tools designed for working with CSV files, the most ubiquitous format for tabular data. It empowers users to manipulate, analyze, and transform data directly in the shell without needing spreadsheets or graphical software.

Key tools include:
csvcut: Select, reorder, or exclude columns.
csvlook: Pretty-print CSV as formatted tables.
csvstat: Compute descriptive statistics like min/max/mean.
csvgrep: Search rows with grep-like patterns.
csvjoin: Join multiple CSV files on key columns.
csvsort: Sort rows by specified columns.
in2csv: Convert Excel, JSON, XML, or fixed-width to CSV.
csvsql: Generate SQL CREATE and INSERT statements.

Tools chain effortlessly via pipes, support huge files, custom delimiters/encodings, quoted fields with embeds, and dialects like Excel. Ideal for data journalists, analysts, sysadmins, and ETL pipelines. Written in Python, extensible via csvpy.

csvkit

Convert and work with CSV files

TLDR

SYNOPSIS

PARAMETERS

DESCRIPTION

CAVEATS

INSTALLATION

QUICK EXAMPLE

PIPING WORKFLOW

HISTORY

SEE ALSO