Skip to main content

Databricks well-architected framework

This set of articles provides principles and best practices for implementing and operating Databricks.

Databricks well-architected framework

Well-architected framework: Databricks platform diagram.

The Databricks well-architected framework consists of seven pillars that describe different areas of concern for the implementation of Databricks in the cloud.

Pillars shared with cloud frameworks

The following five pillars align with the pillars in the AWS Well-Architected Framework:

For the five shared pillars, the principles and best practices of the cloud framework still apply. The Databricks well-architected framework extends these with Databricks-specific principles and best practices.

Databricks-specific pillars

The following two pillars address concerns unique to the Databricks well-architected framework:

The two Databricks-specific pillars warrant additional context:

Data and AI governance

Databricks unifies data warehousing and AI use cases on a single platform. This eliminates the data silos that traditionally separate data engineering, analytics, BI, data science, and machine learning.

A unified governance solution simplifies data management by consolidating governance controls into a single processing layer. This approach minimizes data copies and improves your ability to maintain compliance and detect potential breaches.

Interoperability and usability

An integrated Databricks platform provides a consistent user experience for all personas and workloads. This consistency reduces training and onboarding costs while improving collaboration between functions. In contrast, assembling separate data tools can lead to high implementation costs, inconsistent user experiences, and limited collaboration capabilities.

Organizations adopt multi-cloud strategies for various reasons, including mergers, acquisitions, or independent business unit decisions. The Databricks platform delivers a unified user experience across all cloud providers in a multi-cloud setup. This reduces system proliferation and lowers the skill requirements for employees working with data.

Modern business processes require systems to work together across organizational boundaries. Databricks supports secure data flow between internal and external partners' systems, enabling effective collaboration in networked business environments.