Skip to main content

Read Parquet files using Databricks

This article shows you how to read data from Apache Parquet files using Databricks.

What is Parquet?

Apache Parquet is a columnar file format with optimizations that speed up queries. It’s a more efficient file format than CSV or JSON.

For more information, see Parquet Files.

Options

See the following Apache Spark reference articles for supported read and write options.

Notebook example: Read and write to Parquet files

The following notebook shows how to read and write data to Parquet files.

Reading Parquet files notebook

Open notebook in new tab
Was this article helpful?