Guide

How to Read Parquet Files in DuckDB

DuckDB is quickly becoming one of the easiest ways to query Parquet files without complex setup. If you want fast, SQL-based access to your data, this guide shows you exactly how to do it step-by-step.

What is DuckDB?

DuckDB is an in-process SQL database designed for fast analytical queries. It works directly on files like Parquet without needing a server or complex configuration.

This makes it ideal for developers, analysts, and anyone who wants to query large datasets quickly.

Why use DuckDB with Parquet?

No setup

Query parquet files instantly without running a database server.

Fast queries

DuckDB is optimized for analytics and columnar data like Parquet.

SQL support

Use familiar SQL instead of writing complex scripts.

Step 1: Install DuckDB

You can install DuckDB using Python or download the CLI directly.

pip install duckdb
                

You can also use DuckDB directly in your terminal or integrate it into your applications.

Step 2: Read a Parquet file

DuckDB can query Parquet files directly using SQL.

SELECT * FROM 'file.parquet';
                

That’s it. No import step required.

Step 3: Query specific columns

One of the biggest advantages of Parquet is columnar access.

SELECT column1, column2
FROM 'file.parquet';
                

This improves performance because only the needed columns are read.

Step 4: Filter your data

You can apply filters just like a normal SQL query.

SELECT *
FROM 'file.parquet'
WHERE column1 > 100;
                

DuckDB will efficiently scan only the relevant data.

Step 5: Load into a table (optional)

If you need to run multiple queries, you can load the data into a table.

CREATE TABLE my_table AS
SELECT * FROM 'file.parquet';
                

This can improve performance for repeated queries.

Need a simpler way to view Parquet files?

DuckDB is powerful, but sometimes you just want to quickly open or convert a file.

No setup, no coding, just upload and view your data.

Frequently asked questions

Can DuckDB read Parquet files directly?

Yes. DuckDB can query Parquet files directly without importing them first.

Is DuckDB faster than Pandas for Parquet?

In many cases, yes. DuckDB is optimized for analytical queries and can outperform Pandas.

Do I need Python to use DuckDB?

No. DuckDB can be used via CLI, embedded in applications, or with Python.

What is the easiest way to view Parquet files?

If you don’t want to set up DuckDB, you can use an online Parquet viewer instead.

An error has occurred. This application may no longer respond until reloaded. Reload 🗙