I built pq - the jq of Parquet. Here's why data engineers need a better CLI

I got tired of spinning up DuckDB or writing throwaway Python just to peek inside a Parquet file. So I built pq - a single binary CLI (Rust) that handles the full Parquet workflow from your terminal Quick taste: pq data.parquet — metadata, schema, compression, row groups at a glance pq head -n 5 -c id,name s3://bucket/data.parquet — preview specific columns directly from S3 pq schema extract --ddl postgres data.parquet — generate CREATE TABLE (supports Postgres, ClickHouse, DuckDB, Spark, BigQuery, Snowflake, Redshift, MySQL) pq check --contract contract.toml data/ — validate file structure and data contracts in CI pq schema diff a.parquet b.parquet — catch schema drift between files pq compact data/ -o s3://bucket/compacted/ — merge small files into optimal sizes pq convert raw/*.csv -o parquet/ — batch convert CSV/JSON to Parquet It auto-detects output format (table on TTY, JSON when piped), supports glob patterns, and works with S3, GCS, Azure Blob, and Cloudflare R2. Install: brew

I built pq - the jq of Parquet. Here's why data engineers need a better CLI

Related Articles

Sony's new theater system lets you upgrade your TV setup gradually - how it works

How to delete your personal info from the internet (while saving money)

Here Is What Programming Taught Me About Growth

I Did Everything “Right” in Programming — Here Is What Actually Mattered

Should You Still Learn DSA in 2026? (A Real Answer)

Related Articles

How-To
Sony's new theater system lets you upgrade your TV setup gradually - how it works
ZDNet • 1h ago

How-To
How to delete your personal info from the internet (while saving money)
ZDNet • 2h ago

How-To
Here Is What Programming Taught Me About Growth
Medium Programming • 3h ago

How-To
I Did Everything “Right” in Programming — Here Is What Actually Mattered
Medium Programming • 3h ago

How-To
Should You Still Learn DSA in 2026? (A Real Answer)
Medium Programming • 3h ago