csvstat
Descriptive statistics calculator for CSV columns
TLDR
Show statistics for all columns
SYNOPSIS
csvstat [options] file
DESCRIPTION
csvstat computes descriptive statistics for columns in CSV files. Part of csvkit, it automatically detects data types and provides appropriate statistics for each.
The tool reports counts, unique values, min/max, mean, median, standard deviation, and frequent values, giving a quick overview of data characteristics.
PARAMETERS
-c columns
Columns to analyze.--type
Show column data types only.--unique
Show unique value counts only.--min
Show minimum values only.--max
Show maximum values only.--mean
Show mean values only.--median
Show median values only.--stdev
Show standard deviation only.--freq
Show frequent values only.--count
Show row count only.-d char
Field delimiter.-y n
Sniff limit for type detection.
CAVEATS
Loads entire file into memory. Large files can be slow. Type detection may misclassify mixed data. Part of csvkit, requires Python.
HISTORY
csvstat is part of csvkit, created by Christopher Groskopf in 2011. It brings pandas-like summary statistics to the command line, essential for initial data exploration and validation.
