
DuckDB Has a Free API: The SQLite for Analytics
Why DuckDB DuckDB is an in-process analytical database — think SQLite but for OLAP. Query Parquet files, CSV, JSON, even pandas DataFrames with SQL. No server, no setup, blazing fast. Install pip install duckdb # or brew install duckdb Query Files Directly -- Query Parquet files SELECT * FROM read_parquet ( 'data/*.parquet' ) WHERE year = 2024 ; -- Query CSV SELECT category , SUM ( revenue ) as total FROM read_csv ( 'sales.csv' ) GROUP BY category ; -- Query JSON SELECT * FROM read_json ( 'events.jsonl' ) WHERE event_type = 'purchase' ; -- Query remote files SELECT * FROM read_parquet ( 's3://my-bucket/data/*.parquet' ); Python Integration import duckdb import pandas as pd # Query a DataFrame df = pd . read_csv ( ' sales.csv ' ) result = duckdb . sql ( " SELECT category, SUM(amount) FROM df GROUP BY category " ). df () # Query Parquet files result = duckdb . sql ( """ SELECT date_trunc( ' month ' , created_at) as month, COUNT(*) as orders, SUM(total) as revenue FROM read_parquet( ' ord
Continue reading on Dev.to Python
Opens in a new tab

