What is DBFS?

The term DBFS is used to describe two parts of the platform:

  • DBFS root

  • DBFS mounts

Storing and accessing data using DBFS root or DBFS mounts is a deprecated pattern and not recommended by Databricks. For recommendations for working with files, see Work with files on Databricks.

What is the Databricks File System?

The term DBFS comes from Databricks File System, which describes the distributed file system used by Databricks to interact with cloud-based storage.

The underlying technology associated with DBFS is still part of the Databricks platform. For example, dbfs:/ is an optional scheme when interacting with Unity Catalog volumes.

Past and current warnings and caveats about DBFS only apply to the DBFS root or DBFS mounts.

How does DBFS work with Unity Catalog?

Databricks recommends using Unity Catalog to manage access to all data.

Unity Catalog adds the concepts of external locations, storage credentials, and volumes to help organizations provide the least privileged access to data in cloud object storage.

Some security configurations provide direct access to Unity Catalog-managed resources and DBFS, primarily for organizations that have completed migrations or partially migrated to Unity Catalog. See Best practices for DBFS and Unity Catalog.

What is the DBFS root?

The DBFS root is a storage location provisioned during workspace creation in the cloud account containing the Databricks workspace. For details on Databricks Filesystem root configuration and deployment, see Create an S3 bucket for workspace deployment.

Databricks does not recommend storing production data, libraries, or scripts in DBFS root. See Recommendations for working with DBFS root.

Mount object storage

Note

DBFS mounts are deprecated. Databricks recommends using Unity Catalog volumes. See What are Unity Catalog volumes?.

Mounting object storage to DBFS allows you to access objects in object storage as if they were on the local file system. Mounts store Hadoop configurations necessary for accessing storage. For more information, see Mounting cloud object storage on Databricks.