Data warehousing on Databricks
Data warehousing on Databricks combines cloud data warehousing capabilities with lakehouse architecture. Databricks SQL provides the tools and services to build highly-performant, cost-effective data warehouses that run directly on your data lake.
How to use Databricks SQL
Databricks SQL runs on SQL warehouses and supports ANSI SQL with Delta Lake extensions. You can access Databricks SQL from multiple interfaces for querying, visualization, and automation.
Interface | Description |
|---|---|
Write and execute SQL queries with integrated Databricks Assistant, code comments, and version history for collaborative query development. | |
Attach a notebook to a SQL warehouse to run SQL alongside Python, Scala, or R. See Notebooks and SQL warehouses for limitations. | |
Schedule SQL queries as jobs for automated data processing and reporting workflows. | |
Create interactive AI/BI dashboards with AI-assisted authoring to share insights across your organization. | |
Define business metrics with consistent calculations using a semantic layer. Reuse metrics across queries and dashboards. | |
Schedule automated query runs, evaluate custom conditions, and deliver notifications with alert history tracking. | |
Review query performance, identify bottlenecks, and find optimization opportunities. | |
Automate tasks on Databricks SQL objects programmatically using the REST API. |
To learn more about using Databricks SQL, see:
Get started
New to Databricks SQL? Build your understanding with foundational concepts, then apply what you've learned with hands-on tutorials.
Resource | Description |
|---|---|
Understand lakehouse architecture, medallion layers, and data modeling approaches for building data warehouses. | |
Learn core Databricks SQL concepts including queries, SQL warehouses, dashboards, and data management. | |
Follow a complete walkthrough covering sample dashboards, notebooks, jobs, data ingestion, and SQL warehouse setup. | |
Build and publish your first dashboard with datasets, visualizations, and filters using AI-assisted authoring. | |
Define consistent, reusable business metrics with a semantic layer for use across queries and dashboards. |