Read Parquet files using Databricks

This article shows you how to read data from Apache Parquet files using Databricks.

What is Parquet?

Apache Parquet is a columnar file format with optimizations that speed up queries. It’s a more efficient file format than CSV or JSON.

For more information, see Parquet Files.

Options

See the following Apache Spark reference articles for supported read and write options.

Notebook example: Read and write to Parquet files

The following notebook shows how to read and write data to Parquet files.

Reading Parquet files notebook

Open notebook in new tab