LinuxCommandLibrary

parquet-tools

A tool to show, inspect and manipulate Parquet file.

TLDR

Display the content of a Parquet file

$ parquet-tools cat [path/to/parquet]
copy


Display the first few lines of a Parquet file
$ parquet-tools head [path/to/parquet]
copy


Print the schema of a Parquet file
$ parquet-tools schema [path/to/parquet]
copy


Print the metadata of a Parquet file
$ parquet-tools meta [path/to/parquet]
copy


Print the content and metadata of a Parquet file
$ parquet-tools dump [path/to/parquet]
copy


Concatenate several Parquet files into the target one
$ parquet-tools merge [path/to/parquet1] [path/to/parquet2] [path/to/target_parquet]
copy


Print the count of rows in a Parquet file
$ parquet-tools rowcount [path/to/parquet]
copy


Print the column and offset indexes of a Parquet file
$ parquet-tools column-index [path/to/parquet]
copy

Copied to clipboard