Parquet file

Apache Parquet is a columnar file format that provides optimizations to speed up queries and is a far more efficient file format than CSV or JSON.

For further information, see Parquet Files.

Options

See the following Apache Spark reference articles for supported read and write options.

The following notebook shows how to read and write data to Parquet files.

Reading Parquet files notebook

Open notebook in new tab