ORC file

Apache ORC is a columnar file format that provides optimizations to speed up queries. It is a far more efficient file format than CSV or JSON.

For more information, see ORC Files.

Options

See the following Apache Spark reference articles for supported read and write options.