How to Read Parquet Files in DuckDB
DuckDB is quickly becoming one of the easiest ways to query Parquet files without complex setup. If you want fast, SQL-based access to your data, this guide shows you exactly how to do it step-by-step.
What is DuckDB?
DuckDB is an in-process SQL database designed for fast analytical queries. It works directly on files like Parquet without needing a server or complex configuration.
This makes it ideal for developers, analysts, and anyone who wants to query large datasets quickly.
Why use DuckDB with Parquet?
No setup
Query parquet files instantly without running a database server.
Fast queries
DuckDB is optimized for analytics and columnar data like Parquet.
SQL support
Use familiar SQL instead of writing complex scripts.
Step 1: Install DuckDB
You can install DuckDB using Python or download the CLI directly.
pip install duckdb
You can also use DuckDB directly in your terminal or integrate it into your applications.
Step 2: Read a Parquet file
DuckDB can query Parquet files directly using SQL.
SELECT * FROM 'file.parquet';
That’s it. No import step required.
Step 3: Query specific columns
One of the biggest advantages of Parquet is columnar access.
SELECT column1, column2
FROM 'file.parquet';
This improves performance because only the needed columns are read.
Step 4: Filter your data
You can apply filters just like a normal SQL query.
SELECT *
FROM 'file.parquet'
WHERE column1 > 100;
DuckDB will efficiently scan only the relevant data.
Step 5: Load into a table (optional)
If you need to run multiple queries, you can load the data into a table.
CREATE TABLE my_table AS
SELECT * FROM 'file.parquet';
This can improve performance for repeated queries.
Need a simpler way to view Parquet files?
DuckDB is powerful, but sometimes you just want to quickly open or convert a file.
Parquet Viewer
Open and inspect parquet files instantly in your browser.
Parquet to CSV Converter
Convert parquet files into a readable CSV in seconds.
No setup, no coding, just upload and view your data.
Frequently asked questions
Can DuckDB read Parquet files directly?
Yes. DuckDB can query Parquet files directly without importing them first.
Is DuckDB faster than Pandas for Parquet?
In many cases, yes. DuckDB is optimized for analytical queries and can outperform Pandas.
Do I need Python to use DuckDB?
No. DuckDB can be used via CLI, embedded in applications, or with Python.
What is the easiest way to view Parquet files?
If you don’t want to set up DuckDB, you can use an online Parquet viewer instead.