Serverless compute release notes

This article explains the features and behaviors that are currently available and upcoming on serverless compute for notebooks and jobs.

For more information on serverless compute, see Connect to serverless compute.

Databricks periodically releases updates to serverless compute, automatically upgrading the serverless compute runtime to support enhancements and upgrades to the platform. All users get the same updates, rolled out over a short period of time.

Serverless environment versions

Serverless compute for notebooks and jobs uses environment versions, which provide a stable client API based on Spark Connect to ensure application compatibility. This allows Databricks to upgrade the server independently, delivering performance improvements, security enhancements, and bug fixes without requiring any code changes to workloads.

Each environment version includes a specific Python version and a set of Python packages with defined versions. Databricks introduces new features and fixes in the latest environment version while applying security updates to all supported environment versions.

For serverless environment version release notes, see Serverless environment versions.

Release notes

This section includes release notes for serverless compute. Release notes are organized by year and week of year. Serverless compute always runs using the most recently released version listed here.

July 6, 2026
Version 18.2
Version 18.1
Version 18.0
Serverless environment version 5 is now available
Version 17.3
Version 17.2
Version 17.1
Serverless environment version 4
Version 17.0
Serverless performance targets is GA
Version 16.4
Performance mode is now configurable on serverless jobs
Version 16.3
Version 16.2
High memory setting available on serverless notebooks (Public Preview)
Version 16.1
Version 15.4
The JDK is upgraded from JDK 8 to JDK 17
Version 15.1
Version 14.3

July 6, 2026

July 6, 2026

This serverless compute release includes updates from Databricks Runtime 18.

New features

Spark Declarative Pipelines on Lakeflow streaming query ID: Spark Declarative Pipelines on Lakeflow can now set the streaming query ID on demand.
IP address functions (Public Preview): New SQL functions are available for working with IPv4 and IPv6 addresses and CIDR blocks, including ip_host, ip_cidr, ip_version, ip_prefix_length, ip_network, ip_network_last, ip_cidr_contains, ip_as_binary, ip_as_string, and try_* variants for null-safe behavior. See ip_host and related functions.
On-demand state repartitioning (Public Preview): Structured Streaming now supports changing the number of shuffle partitions for stateful queries without losing checkpoint state. See On-demand state repartitioning for stateful streaming queries.
Auto CDC from snapshot with SQL syntax: Spark Declarative Pipelines on Lakeflow now supports Auto CDC from snapshot using SQL syntax. Previously, this feature was only available through the Python API. Use CREATE STREAMING TABLE ... FLOW AUTO CDC FROM SNAPSHOT to process snapshot sources (such as Delta tables, cloud storage, or JDBC) one snapshot at a time. Both SCD Type 1 (default) and SCD Type 2 are supported.

Behavior changes

CREATE OR REPLACE TABLE preserves comments: CREATE OR REPLACE TABLE now preserves existing column and table comments by default. Previously, comments were dropped when recreating a table. Managed tables and views now match the existing behavior of materialized views and streaming tables.
DataFrame by-name writes cast compatible columns: writeTo().append(), writeTo().overwrite(), writeTo().overwritePartitions(), and write.mode("append").saveAsTable() now automatically cast type-compatible columns (for example, int to long) to match the target Delta table schema. Previously, these operations failed with a DELTA_FAILED_TO_MERGE_FIELDS error when column types were compatible but not identical. Behavior now matches SQL INSERT INTO ... BY NAME.
ALTER TABLE SET TBLPROPERTIES for pipelines.pipelineId: ALTER TABLE <table> SET TBLPROPERTIES('pipelines.pipelineId' = '<pipeline-id>') now attempts to make the specified table eligible for writes by the pipeline. Previously, setting this property on a regular table had no effect. If the table isn't eligible for pipeline writes, the command throws SETTING_PIPELINES_PIPELINE_ID_NOT_SUPPORTED.
DESCRIBE EXTENDED AS JSON includes predictive optimization results: DESCRIBE EXTENDED ... AS JSON now includes predictive optimization evaluation results in its output. Previously, this information wasn't returned in the JSON output.
Metric view window measures return correct results: Metric view window measures now return correct results when queries use GROUP BY, IN/BETWEEN filters, or mixed predicates on the window's order column. Previously, these filter patterns could produce incorrect results.
Structured Streaming deduplication with NaN keys: Structured Streaming deduplication now treats NaN (Not-a-Number) values that have different bit patterns as duplicates when a double or float column is used as a deduplication key. Previously, NaN values with different internal representations were treated as distinct and were not deduplicated.
NATURAL JOIN case-insensitive column matching: NATURAL JOIN now matches common columns case-insensitively, consistent with the equivalent USING join. Previously, column matching was case-sensitive, causing columns that differ only in case (for example, ID vs id) to not be recognized as common columns, resulting in a silent cross join instead of the expected equi-join.

Version 18.2

May 13, 2026

This serverless compute release roughly corresponds to Databricks Runtime 18.2.

New features

CREATE OR REPLACE support for temporary tables: CREATE OR REPLACE TEMP TABLE syntax is now supported, allowing you to create or replace temporary tables in a single statement. This eliminates the need to explicitly drop and recreate temporary tables.
agg() alias for measure() function: agg() is now available as an alias for the measure() function. This change is fully backward compatible. Existing queries that use measure() continue to work without modification, and agg() produces identical results when used with the same arguments.
Snowflake JDBC driver upgrade: The Snowflake JDBC driver is upgraded from 3.22.0 to 3.28.0.
pyspark.pipelines.testing namespace alias: pyspark.pipelines.testing is now available as a convenience alias for dlt.testing APIs. Import Spark Declarative Pipelines on Lakeflow pipeline testing utilities through either namespace.
Delta table history includes write option flags: Delta table history (DESCRIBE HISTORY) now includes write option flags in the operationParameters column for WRITE and REPLACE TABLE operations. When the following options are explicitly enabled, they appear as boolean flags in the history (only included when true):

For WRITE and REPLACE TABLE operations:
- isDynamicPartitionOverwrite: present when dynamic partition overwrite mode was used
- canOverwriteSchema: present when schema overwrite (overwriteSchema) was enabled
- canMergeSchema: present when schema merge (mergeSchema) was enabled
For REPLACE TABLE operations:
- predicate: present when replaceWhere was used
- isV1WriterSaveAsTableOverwrite: present when the replace was triggered by a .saveAsTable overwrite
Selectively replace data with replaceOn and replaceUsing DataFrame APIs: The replaceOn and replaceUsing options in the Scala and Python DataFrame APIs are now generally available. Use these options to replace part of the table with the result of a DataFrame. replaceOn replaces rows that match a user-defined condition. replaceUsing replaces rows where specified columns are equal. These APIs complement the INSERT REPLACE ON and INSERT REPLACE USING SQL statements. See Selectively overwrite data with Delta Lake.

Behavior changes

NULL struct preservation in INSERT, MERGE, and streaming writes with schema evolution: For INSERT, MERGE, and streaming writes that use schema evolution, a NULL struct in the source is now stored as NULL in the target. Previously, that value was incorrectly materialized as a non-null struct with every field set to NULL, while the same operations without schema evolution preserved NULL structs correctly. If your code relied on receiving a non-null struct whose fields were all NULL, update your code to handle a NULL struct instead.
Fix for LEFT OUTER JOIN LATERAL dropping rows: A bug that incorrectly dropped rows from LEFT OUTER JOIN LATERAL queries is now fixed. Queries using this construct now return the correct results. To temporarily revert to the previous behavior, set spark.databricks.sql.optimizer.lateralJoinPreserveOuterSemantic to true.
NATURAL JOIN respects case-insensitive column matching: NATURAL JOIN now correctly uses case-insensitive column matching when spark.sql.caseSensitive is set to false (the default). Previously, NATURAL JOIN used case-sensitive comparison to identify common columns, causing columns that differed only in case (for example, ID versus id) to not be recognized as matching. This caused NATURAL JOIN to silently produce cross-join results. This fix aligns NATURAL JOIN behavior with USING joins, which already handled case-insensitivity correctly. Queries affected by this bug now return correct results with properly joined columns.
SQL UDF dependency validation in Unity Catalog: Unity Catalog now enforces dependency validation for SQL user-defined functions (UDFs) to prevent access control bypass. Previously, SQL functions created through the REST API could reference dependencies the user did not have access to. SQL UDFs with invalid dependency configurations are now blocked from execution.
AWS SDK v1 dependencies are shaded: AWS SDK v1 dependencies bundled with the Databricks runtime are now shaded and no longer directly available on the classpath. If your code depends on AWS SDK v1 libraries previously provided by the Databricks runtime, add them as explicit dependencies in your project. This change prepares for the migration to AWS SDK v2, following the end of AWS support for SDK v1.
Fix incorrect EPSG authority for ESRI-defined SRID 102100: The Coordinate Reference System (CRS) mapping for SRID 102100 now correctly uses ESRI:102100 instead of the incorrect EPSG:102100. This fix ensures geospatial data is stored with the correct authority for better interoperability with other systems.

Version 18.1

April 20, 2026

This serverless compute release roughly corresponds to Databricks Runtime 18.1.

New features

DATETIMEOFFSET data type support for Microsoft Azure Synapse: The DATETIMEOFFSET data type is supported for Microsoft Azure Synapse connections.
Google BigQuery table comments: Google BigQuery table descriptions are resolved and exposed as table comments.
JDBC connection: Use a JDBC connection to read and write to a data source with the Spark Data Source API or the Databricks Remote Query SQL API.
Schema evolution with INSERT statements: Use the WITH SCHEMA EVOLUTION clause with SQL INSERT statements to automatically evolve the target table's schema during insert operations. The clause is supported for INSERT INTO, INSERT OVERWRITE, and INSERT INTO ... REPLACE forms. See schema evolution.
Preserved NULL struct values in INSERT operations: INSERT operations with schema evolution or implicit casting preserve NULL struct values when the source and target tables have differing struct field orders.
Delta Sharing multi-statement transaction support: Delta Sharing tables that use pre-signed URL or cloud token sharing modes support multi-statement transactions. On first access within a transaction, the table version is pinned and reused for all subsequent reads in that transaction. Time travel, change data feed, and streaming aren't supported.
parse_timestamp SQL function: The parse_timestamp SQL function parses timestamp strings using multiple patterns. The function runs on the Photon engine for improved performance.
max_by and min_by with optional limit: The aggregate functions max_by and min_by now accept an optional third argument limit (up to 100,000), returning an array of top- or bottom-K values without window functions or CTEs.
Vector aggregate and scalar functions: New SQL functions operate on ARRAY<FLOAT> vectors for embedding and similarity workloads, including vector_avg, vector_sum, vector_cosine_similarity, vector_inner_product, vector_l2_distance, vector_norm, and vector_normalize. See Built-in functions.
SQL cursor support in compound statements: SQL scripting compound statements now support cursor processing. Use DECLARE CURSOR to define a cursor, then OPEN statement, FETCH statement, and CLOSE statement to run the query and consume rows one at a time.
Approximate top-k sketch functions: New functions enable building and combining approximate top-K sketches for distributed top-K aggregation: approx_top_k_accumulate, approx_top_k_combine, and approx_top_k_estimate. See approx_top_k aggregate function.
Tuple sketch functions: New aggregate and scalar functions for tuple sketch support distinct counting and aggregation over key-summary pairs. See Built-in functions.
New geospatial functions: The following geospatial functions are now available:
- st_estimatesrid function: Estimates the best projected spatial reference identifier (SRID) for an input geometry.
- st_force2d function: Converts a geography or geometry to its 2D representation.
- st_nrings function: Counts the total number of rings in a polygon or multipolygon, including both exterior and interior rings.
- st_numpoints function: Counts the number of non-empty points in a geography or geometry.
Photon support for geospatial functions: st_difference function, st_intersection function, and st_union function now run on the Photon engine for faster performance.

Behavior changes

Observation metric errors no longer fail queries: Errors during observation metric collection no longer cause query execution failures. Previously, errors in OBSERVE clauses (such as division by zero) could block or fail the entire query. Now, the query completes successfully and the error is raised when you call observation.get.
DESCRIBE FLOW reserved keyword: The DESCRIBE FLOW command is now available. If you have a table named flow, use DESCRIBE schema.flow, DESCRIBE TABLE flow, or DESCRIBE `flow` with backticks.
SpatialSQL boolean set operations: ST_Difference, ST_Intersection, and ST_Union use a new implementation with approximately 2x faster performance. Valid input geometries always produce a result. Results are normalized for consistent output and can differ after the 15th decimal place for line-segment intersections due to different formulas and order of operations.
Exception types for SQLSTATE: Exception types are updated to support SQLSTATE. If your code parses exceptions by string matching or catches specific exception types, update your error handling logic.

Version 18.0

February 27, 2026

This serverless compute release roughly corresponds to Databricks Runtime 18.0.

New features

SQL scripting is now generally available: SQL scripting is now generally available.
Redshift JDBC driver upgraded to 2.1.0.28: The Redshift JDBC driver has been upgraded to version 2.1.0.28.
Shared isolation execution environment for Unity Catalog Python UDFs: Unity Catalog Python UDFs with the same owner can now share an isolation environment by default, improving performance and reducing memory usage. To ensure a UDF always runs in a fully isolated environment, add the STRICT ISOLATION characteristic clause. See Environment isolation.
SQL window functions in metric views: You can now use SQL window functions in metric views to calculate running totals, rankings, and other window-based calculations.
Dynamic shuffle partition adjustment in stateless streaming queries: You can now change the number of shuffle partitions in stateless streaming queries without restarting the query.
Adaptive Query Execution and auto-optimized shuffle in stateless streaming queries: Adaptive Query Execution (AQE) and auto-optimized shuffle (AOS) are now supported in stateless streaming queries.
Literal string coalescing everywhere: The ability to coalesce sequential string literals such as 'Hello' ' World' into 'Hello World' has been expanded to any place string literals are allowed. See STRING type.
Parameter markers everywhere: You can now use named (:param) and unnamed (?) parameter markers virtually anywhere a literal value of the appropriate type can be used, including DDL statements and column types. See Parameter markers.
IDENTIFIER clause everywhere: The IDENTIFIER clause, which casts strings to SQL object names, has been expanded to nearly everywhere an identifier is permitted, including column aliases and column definitions. See IDENTIFIER clause.
New BITMAP_AND_AGG function: A new bitmap_and_agg aggregate function function is now available.
New Theta sketch functions: A new library of functions for approximate distinct count and set operations using Datasketches Theta Sketch is now available, including theta_sketch_agg, theta_union_agg, theta_intersection_agg, and related functions.
New KLL Sketch function library: A new library of functions for building KLL Sketches for approximate quantile computation is now available, including kll_sketch_agg_bigint, kll_sketch_agg_double, and related functions.
Apache Parquet library upgraded to 1.16.0: The Apache Parquet library has been upgraded to version 1.16.0.
New geospatial functions: The following new geospatial functions are now supported:
- st_azimuth function: Returns the north-based azimuth from the first point to the second in radians.
- st_boundary function: Returns the boundary of the input geometry.
- st_closestpoint function: Returns the 2D projection of a point on the first geometry closest to the second geometry.
- st_geogfromewkt function: Parses an Extended Well-Known Text (EWKT) description of a geography.
- st_geomfromewkt function: Parses an Extended Well-Known Text (EWKT) description of a geometry.
Improved spatial join performance: Spatial join performance is now improved by introducing shuffled spatial join support.
Improved performance for geospatial functions: Photon implementations are now available for st_isvalid function, st_makeline function, and st_makepolygon function.
EWKT input support: The try_to_geography, try_to_geometry, to_geography, and to_geometry functions now accept Extended Well-Known Text (EWKT) as input.

Behavior changes

FSCK REPAIR TABLE includes metadata repair by default: The FSCK REPAIR TABLE command now includes an initial metadata repair step before checking for missing data files. The command can work on tables with corrupt checkpoints or invalid partition values.
Proration factors aligned between reads and auto-optimized writes: Proration factors for partition sizing now use fractional values consistently across read operations and auto-optimized writes. This change might result in a different number of tasks for read operations.
Python UDF execution unified across PySpark and Unity Catalog: Unity Catalog Python UDFs now use Apache Arrow as the default interchange format, improving overall performance. As part of this change, TIMESTAMP values passed to Python UDFs no longer include timezone information in the datetime object's tzinfo attribute. If your UDF relies on timezone information, restore it with date = date.replace(tzinfo=timezone.utc). See Timestamp timezone behavior for inputs.
Improved error messages for Kafka connector login module issues: When using the Kafka connector with an unshaded login module class, Databricks now provides error messages that suggest using the correct shaded class prefix (kafkashaded.org.apache.kafka or kafkashaded.software.amazon.msk.auth.iam).
Time travel restrictions and VACUUM retention behavior: Databricks now blocks time travel queries beyond the deletedFileRetentionDuration threshold for all tables. The VACUUM command ignores the retention duration argument except when the value is 0 hours. You can't set deletedFileRetentionDuration larger than logRetentionDuration or vice versa.
BinaryType maps to bytes by default in PySpark: In PySpark, BinaryType now consistently maps to Python bytes. Previously, PySpark mapped BinaryType to either bytes or bytearray depending on the context. To restore the old behavior, set spark.sql.execution.pyspark.binaryAsBytes to false.
Partition columns materialized in Parquet files: Partitioned Delta tables now materialize partition columns in newly written Parquet data files. Previously, partition values were stored in the Delta transaction log metadata and reflected in directory paths, but not written as columns in the Parquet files themselves. This change might affect workloads that directly read Parquet files written by Delta.
DESCRIBE TABLE output includes metadata column: The output of DESCRIBE TABLE [EXTENDED] now includes a new metadata column for all table types, containing semantic metadata (display name, format, and synonyms) defined on the table as a JSON string.

Serverless environment version 5 is now available

February 25, 2026

You can now use serverless environment version 5 in your serverless notebooks and jobs. This includes both the CPU and GPU environment versions. See Serverless environment version 5.

Version 17.3

October 28, 2025

This serverless compute release roughly corresponds to Databricks Runtime 17.3 LTS.

New features

LIMIT ALL support for recursive CTEs: You can now use the LIMIT ALL clause with recursive common table expressions (rCTEs) to explicitly specify that no row limit should be applied to the query results. See Common table expression (CTE).
Appending to files in Unity Catalog volumes returns correct error: Attempting to append to existing files in Unity Catalog volumes now returns a more descriptive error message to help you understand and resolve the issue.
st_dump function support: You can now use the st_dump function to decompose a geometry object into its constituent parts, returning a set of simpler geometries. See st_dump function.
Polygon interior ring functions are now supported: You can now use the following functions to work with polygon interior rings:
- st_numinteriorrings: Get the number of inner boundaries (rings) of a polygon. See st_numinteriorrings function.
- st_interiorringn: Extract the n-th inner boundary of a polygon and return it as a linestring. See st_interiorringn function.
EXECUTE IMMEDIATE using constant expressions: The EXECUTE IMMEDIATE statement now supports using constant expressions in the query string, allowing for more flexible dynamic SQL execution. See EXECUTE IMMEDIATE.
Allow spark.sql.files.maxPartitionBytes in serverless compute: You can now configure the spark.sql.files.maxPartitionBytes Spark configuration parameter on serverless compute to control the maximum number of bytes to pack into a single partition when reading files. See Configure Spark properties for serverless notebooks and jobs.

Behavior changes

Support MV/ST refresh information in DESCRIBE EXTENDED AS JSON: The DESCRIBE EXTENDED AS JSON command now includes refresh information for materialized views and streaming tables, providing visibility into the last refresh time and status.
Add metadata column to DESCRIBE QUERY and DESCRIBE TABLE: The DESCRIBE QUERY and DESCRIBE TABLE commands now include a metadata column in their output, providing additional information about each column's properties and characteristics.
Correct handling of null structs when dropping NullType columns: Databricks now correctly handles null struct values when dropping columns with NullType, preventing potential data corruption or unexpected behavior.
Improved handling of null structs in Parquet: This release includes improvements to how null struct values are handled when reading from and writing to Parquet files, ensuring more consistent and correct behavior.
Upgrade aws-msk-iam-auth library for Kafka: The aws-msk-iam-auth library used for Amazon MSK IAM authentication has been upgraded to the latest version, providing improved security and compatibility.

Version 17.2

September 25, 2025

This serverless compute release roughly corresponds to Databricks Runtime 17.2.

New features

ST_ExteriorRing function is now supported: You can now use the ST_ExteriorRing function to extract the outer boundary of a polygon and return it as a linestring. See st_exteriorring function.

Support TEMPORARY keyword for metric view creation: You can now use the TEMPORARY keyword when creating a metric view. Temporary metric views are visible only in the session that created them and are dropped when the session ends. See CREATE VIEW.
Use native I/O for LokiFileSystem.getFileStatus on S3: LokiFileSystem.getFileStatus now uses the native I/O stack for Amazon S3 traffic and returns org.apache.hadoop.fs.FileStatus objects instead of shaded.databricks.org.apache.hadoop.fs.s3a.S3AFileStatus.
Auto Loader infers partition columns in singleVariantColumn mode: Auto Loader now infers partition columns from file paths when ingesting data as a semi-structured variant type using the singleVariantColumn option. Previously, partition columns were not automatically detected. See Auto Loader.

Behavior changes

DESCRIBE CONNECTION shows environment settings for JDBC connections: Databricks now includes user-defined environment settings in the DESCRIBE CONNECTION output for JDBC connections that support custom drivers and run in isolation. Other connection types remain unchanged.
Option to truncate uniform history during managed tables migration: You can now truncate uniform history when migrating tables with Uniform/Iceberg enabled using ALTER TABLE...SET MANAGED. This simplifies migrations and reduces downtime compared to disabling and re-enabling Uniform manually.
Correct results for split with empty regex and positive limit: Databricks now returns correct results when using split function with an empty regex and a positive limit. Previously, the function incorrectly truncated the remaining string instead of including it in the last element.
Fix url_decode and try_url_decode error handling in Photon: In Photon, try_url_decode() and url_decode() with failOnError = false now return NULL for invalid URL-encoded strings instead of failing the query.
Shared execution environment for Unity Catalog Python UDTFs: Databricks now shares the execution environment for Python user-defined table functions (UDTFs) from the same owner and Spark session. An optional STRICT ISOLATION clause is available to disable sharing for UDTFs with side effects, such as modifying environment variables or executing arbitrary code.

Version 17.1

August 19, 2025

This serverless compute release roughly corresponds to Databricks Runtime 17.1.

New features

Reduced memory usage for wide schemas in Photon writer: Enhancements were made to the Photon engine that significantly reduce memory usage for wide schemas, addressing scenarios that previously resulted in out-of-memory errors.

Behavior changes

Error thrown for invalid CHECK constraints: Databricks now throws an AnalysisException if a CHECK constraint expression cannot be resolved during constraint validation.
Pulsar connector no longer exposes Bouncy Castle: The Bouncy Castle library is now shaded in the Pulsar connector to prevent classpath conflicts. As a result, Spark jobs can no longer access org.bouncycastle.* classes from the connector. If your code depends on Bouncy Castle, install the library manually on serverless environment.
Auto Loader uses file events by default if available: Auto Loader uses file events instead of directory listing when the load path is an external location with file events enabled. The default for useManagedFileEvents is now if_available (was false). This can improve ingestion performance and logs a warning if file events are not yet enabled.
Teradata connector fixes case-sensitive string comparison: The Teradata connector now defaults to TMODE=ANSI, aligning string comparison behavior with Databricks by making it case-sensitive. This change is configurable and does not affect existing users unless they opt in.

Serverless environment version 4

August 13, 2025

Environment version 4 is now available in your serverless notebooks and jobs. This environment version includes library upgrades and API updates. See Serverless environment version 4.

Version 17.0

July 24, 2025

This serverless compute release roughly corresponds to Databricks Runtime 17.0.

New features

SQL procedure support: SQL scripts can now be encapsulated in a procedure stored as a reusable asset in Unity Catalog. You can create a procedure using the CREATE PROCEDURE command, and then call it using the CALL command.
Set a default collation for SQL Functions: Using the new DEFAULT COLLATION clause in the CREATE FUNCTION command defines the default collation used for STRING parameters, the return type, and STRING literals in the function body.
Recursive common table expressions (rCTE) support: Databricks now supports navigation of hierarchical data using recursive common table expressions (rCTEs). Use a self-referencing CTE with UNION ALL to follow the recursive relationship.
PySpark and Spark Connect now support the DataFrames df.mergeInto API: PySpark and Spark Connect now support the df.mergeInto API.
Support ALL CATALOGS in SHOW SCHEMAS: The SHOW SCHEMAS syntax is updated to accept ALL CATALOGS, allowing you to iterate through all active catalogs that support namespaces. The output attributes now include a catalog column indicating the catalog of the corresponding namespace.
Liquid clustering now compacts deletion vectors more efficiently: Delta tables with liquid clustering now apply physical changes from deletion vectors more efficiently when OPTIMIZE is running. For more details, see Apply changes to Parquet data files.
Allow non-deterministic expressions in UPDATE/INSERT column values for MERGE operations: Databricks now allows the use of non-deterministic expressions in updated and inserted column values of MERGE operations. For example, you can now generate dynamic or random values for columns using expressions like rand().
Change Delta MERGE Python APIs to return DataFrame instead of Unit: The Python MERGE APIs (such as DeltaMergeBuilder) now also return a DataFrame like the SQL API does, with the same results.

Behavior changes

Behavioral change for the Auto Loader incremental directory listing option: The value of the deprecated Auto Loader cloudFiles.useIncrementalListing option is now set to a default value of false. As a result, this change causes Auto Loader to perform a full directory listing each time it's run. Databricks recommends against using this option. Instead, use file notification mode with file events.
CREATE VIEW column-level clauses now throw errors when the clause would only apply to materialized views: CREATE VIEW commands that specify a column-level clause that is only valid for MATERIALIZED VIEWs now throw an error. The affected clauses include NOT NULL, specified datatypes, DEFAULT, and COLUMN MASK.

Serverless performance targets is GA

June 10, 2025

Selecting the serverless performance setting for jobs and pipelines is now generally available.

When the Performance optimized setting is enabled, your workload is optimized for faster startup and execution time. When disabled, the serverless workload runs on standard performance mode, which is optimized for cost and has a slightly higher launch latency.

For more information, see Select a performance mode and Select a performance mode.

Version 16.4

May 28, 2025

This serverless compute release roughly corresponds to Databricks Runtime 16.4 LTS.

Behavior changes

Fix to respect options for data source cached plans: This update ensures table reads respect options set for all data source plans when cached, not just the first cached table read. Previously, data source table reads cached the first plan but failed to account for different options in subsequent queries.
Enable flag to require source materialization for MERGE operations: Previously, users could turn off source materialization in MERGE by setting merge.materializeSource to none. With the new flag enabled, source materialization will always be required, and attempts to disable it will result in an error. Databricks plans to enable this flag only for customers who have not previously changed this configuration, so most users should not experience any change in behavior.

New features

Auto Loader can now clean processed files in the source directory: You can now instruct Auto Loader to automatically move or delete files that have been processed. Opt in to this feature by using the cloudFiles.cleanSource Auto Loader option. See Common, under cloudFiles.cleanSource.
Type widening support added for streaming from Delta tables: This release adds support for streaming from a Delta table that has type-widened column data, and for sharing a Delta table with type widening enabled using Databricks-to-Databricks Delta Sharing. The type widening feature is currently in Public Preview. See Type widening.
IDENTIFIER support now available in DBSQL for catalog operations: You can now use the IDENTIFIER clause when performing the following catalog operations:
- CREATE CATALOG
- DROP CATALOG
- COMMENT ON CATALOG
- ALTER CATALOG
This new syntax allows you to dynamically specify catalog names using parameters defined for these operations, enabling more flexible and reusable SQL workflows. As an example of the syntax, consider CREATE CATALOG IDENTIFIER(:param) where param is a parameter provided to specify a catalog name. See IDENTIFIER clause.
Collated expressions now provide autogenerated transient aliases: Autogenerated aliases for collated expressions now deterministically incorporate COLLATE information. Autogenerated aliases are transient (unstable) and should not be relied on. Instead, as a best practice, use expression AS alias consistently and explicitly.
Add filter pushdown API support to Python data sources: Serverless compute now supports filter pushdown to Python data source batch read as an a API, similar to SupportsPushDownFilters interface. See 16.4 LTS release notes.
Python UDF traceback improvement: The Python UDF traceback now includes frames from both the driver and executor along with client frames, resulting in better error messages that show greater and more relevant details (such as the line content of frames inside a UDF).
UNION/EXCEPT/INTERSECT inside a view and EXECUTE IMMEDIATE now return correct results: Queries for temporary and persistent view definitions with top-level UNION/EXCEPT/INTERSECT and un-aliased columns previously returned incorrect results because UNION/EXCEPT/INTERSECT keywords were considered aliases. Now those queries will correctly perform the whole set operation.
Data source cached plan conf and migration guide: Reading from a file source table will correctly respect query options (for example delimiters). Previously, the first query plan was cached and subsequent option changes ignored. To restore the previous behavior, set spark.sql.legacy.readFileSourceTableCacheIgnoreOptions to true.
New listagg and string_agg functions: Starting with this release you can use the listagg or string_agg functions to aggregate STRING and BINARY values within a group. See string_agg.

Performance mode is now configurable on serverless jobs

April 14, 2025

You can now select the performance mode of a serverless job using the Performance optimized setting in the job details page. Previously, all serverless jobs were performance optimized. Now, you can disable the Performance optimized setting to run the workload on standard performance mode. Standard peformance mode is designed to reduce costs on workloads where a slightly higher launch latency is acceptable.

Standard performance mode is not supported for continuous pipelines, one-time runs created using the runs/submit endpoint, or SQL warehouse job tasks, including materialized views.

For more on performance mode, see Select a performance mode.

Version 16.3

April 9, 2025

This serverless compute release roughly corresponds to Databricks Runtime 16.3.

Behavior changes

*Improved error message when kafka.sasl.client.callback.handler.class is assigned an invalid value: This release includes a change to return a more descriptive error message when kafka.sasl.client.callback.handler.class is assigned an invalid value.

New features

State reader support is GA: Support for reading state information for Structured Streaming queries is now generally available on serverless compute. See Read Structured Streaming state information.
Delta table protocol downgrade is GA with checkpoint protection: DROP FEATURE is generally available to remove Delta Lake table features and downgrade the table protocol. By default, DROP FEATURE now creates protected checkpoints for a more optimized and simplified downgrade experience that does not require any waiting time or history truncation. See Drop a Delta Lake table feature and downgrade table protocol.
Write procedural SQL scripts based on ANSI SQL/PSM (Public Preview): You can now use scripting capabilities based on ANSI SQL/PSM to write procedural logic with SQL, including control flow statements, local variables, and exception handling. See SQL scripting.
Table and view level default collation: You can now specify a default collation for tables and views. This simplifies the creation of table and views where all or most columns share the same collation. See Collation.
New H3 functions: Three new H3 functions have been added: h3_try_coverash3, h3_try_coverash3string, and h3_try_tessellateaswkb.
Alter multiple table columns in one ALTER TABLE statement: You can now alter multiple columns in a single ALTER TABLE statement. See ALTER TABLE ... COLUMN clause.

Version 16.2

March 13, 2025

This serverless compute release roughly corresponds to Databricks Runtime 16.2.

Behavior changes

In Delta Sharing, table history is enabled by default: Shares created using the SQL command ALTER SHARE <share> ADD TABLE <table> now have history sharing (WITH HISTORY) enabled by default. See ALTER SHARE.
Credential SQL statements return an error when there's a credential type mismatch: Now, if the credential type specified in a credential management SQL statement doesn't match the type of the credential argument, an error is returned and the statement is not run.

New features

Authenticate to Kinesis with service credentials: You can now use Databricks service credentials to authenticate to Kinesis for streaming reads. See Connect to Amazon Kinesis.

Use the timestampdiff & timestampadd in generated column expressions You can now use the timestampdiff and timestampadd functions in Delta Lake generated column expressions. See Delta Lake generated columns.
Update to DESCRIBE TABLE returns metadata as structured JSON: You can now use the DESCRIBE TABLE AS JSON command to return table metadata as a JSON document. The JSON output is more structured than the default human-readable report and can be used to interpret a table's schema programmatically. To learn more, see DESCRIBE TABLE AS JSON.
Trailing blank insensitive collations: Serverless now supports trailing blank insensitive collations. For example, these collations treat 'Hello' and 'Hello ' as equal. To learn more, see RTRIM collation.

Bug fixes

Improved incremental clone processing: This release includes a fix for an edge case where an incremental CLONE might re-copy files already copied from a source table to a target table. See Clone a table on Databricks.

High memory setting available on serverless notebooks (Public Preview)

February 7, 2025

You can now configure a higher memory size for your serverless compute notebook workloads. This setting can be applied to both interactive and scheduled notebook workloads.

Serverless usage with high memory has a higher DBU emission rate than standard memory.

For more information, see Use high memory serverless compute.

Version 16.1

February 5, 2025

This serverless compute release roughly corresponds to Databricks Runtime 16.0 and Databricks Runtime 16.1.

New features

Improved error message for incorrectly configured dependencies: If your Structured Streaming application that connects to Amazon Managed Streaming for Kafka (MSK) with IAM is configured with an incorrect dependency, you'll now get a more descriptive error message to help you fix the error. See Connect to Amazon MSK with IAM.

Avro support for recursive schema: You can now use the recursiveFieldMaxDepth option with the from_avro function and the avro data source. This option sets the maximum depth for schema recursion on the Avro data source. See Read and write streaming Avro data.
Expanded support for Confluent Schema Registry for Avro: Serverless now supports Avro schema reference with the Confluent Schema Registry. See Authenticate to an external Confluent Schema Registry.
Force reclustering on tables with liquid clustering: You can now use the OPTIMIZE FULL syntax to force the reclustering of all records in a table with liquid clustering enabled. See Force reclustering.
The Delta APIs for Python now support identity columns: You can now use the Delta APIs for Python to create tables with identity columns. See Identity columns.
Create liquid clustered tables during streaming writes: You can now use clusterBy to enable liquid clustering when creating new tables with Structured Streaming writes. See Enable liquid clustering.
Support for the OPTIMIZE FULL clause: Serverless compute now supports the OPTIMIZE FULL clause. This clause optimizes all records in a table that uses liquid clustering, including data that might have previously been clustered.
Support for WITH options specification in INSERT and table-reference: Serverless compute now supports an options specification for table references and table names of an INSERT statement which can be used to control the behavior of data sources.
New SQL functions: The following SQL functions are now available on serverless compute:
- try_url_decode is an error-tolerant version of url_decode.
- zeroifnull returns 0 if the input expression to the zeroifnull() function is NULL.
- nullifzero returns NULL if the input is 0 or its input if it is not 0.
- dayname(expr) returns the three-letter English acronym for the day of the week for the given date.
- uniform(expr1, expr2 [,seed]) returns a random value with independent and identically distributed values within the specified range of numbers.
- randstr(length) returns a random string of length alpha-numeric characters.
Enable automatic schema evolution when merging data into a Delta table: Support has been added for the withSchemaEvolution() member of the DeltaMergeBuilder class. Use withSchemaEvolution() to enable automatic schema evolution during MERGE operations. For example, mergeBuilder.whenMatched(...).withSchemaEvolution().execute()}}.
Support for collations in Apache Spark is in Public Preview: You can now assign language-aware, case-insensitive, and access-insensitive collations to STRING columns and expressions. These collations are used in string comparisons, sorting, grouping operations, and many string functions. See Collation.
Support for collations in Delta Lake is in Public Preview: You can now define collations for columns when creating or altering a Delta table. See Collation support for Delta Lake.
LITE mode for vacuum is in Public Preview: You can now use VACUUM table_name LITE to perform a lighter-weight vacuum operation that leverages metadata in the Delta transaction log. See Full versus lite mode and VACUUM.
Support for parameterizing the USE CATALOG with IDENTIFIER clause: The IDENTIFIER clause is now supported for the USE CATALOG statement. With this support, you can parameterize the current catalog based on a string variable or parameter marker.
COMMENT ON COLUMN support for tables and views: The COMMENT ON statement now supports altering comments for view and table columns.
Named parameter invocation for more functions: The following functions support named parameter invocation:
The SYNC METADATA parameter to the REPAIR TABLE command is supported with the Hive metastore: You can now use the SYNC METADATA parameter with the REPAIR TABLE command to update the metadata of a Hive metastore managed table. See REPAIR TABLE.
Enhanced data integrity for compressed Apache Arrow batches: To further protect against data corruption, every LZ4 compressed Arrow batch now includes the LZ4 content and block checksums. See LZ4 Frame Format Description.

Read from existing consumers with Kinesis EFO: You can now use Kinesis enhanced-fan out (EFO) mode to configure Spark Structured Streaming to read from existing consumers. See Configure Kinesis enhanced fan-out (EFO) for streaming query reads.
Use ARNs to identify Kinesis streams: You can now use the streamARN option to identify streaming sources when configuring a Kinesis source. See Configure Kinesis options.

Built-in Oracle JDBC Driver: Serverless compute now has the Oracle JDBC Driver built in. If you use a customer-uploaded JDBC driver JAR via DriverManager, you must rewrite scripts to explicitly use the custom JAR. Otherwise, the built-in driver is used. This driver only supports Lakehouse Federation. For other use cases, you must provide your own driver.
More detailed errors for Delta tables accessed with paths: A new error message experience for Delta tables accessed using paths is now available. All exceptions are now forwarded to the user. The exception DELTA_MISSING_DELTA_TABLE is now reserved for when underlying files cannot be read as a Delta table.

Behavior changes

Breaking change: Hosted RStudio is end-of-life: With this release, Databricks-hosted RStudio Server is end-of-life and unavailable on any Databricks workspace running on serverless compute. To learn more and see a list of alternatives to RStudio, see Connect to a Databricks-hosted RStudio Server.

Breaking change: Removal of support for changing byte, short, int and long types to wider types: To ensure consistent behavior across Delta and Apache Iceberg tables, the following data type changes can no longer be applied to tables with the type widening feature enabled:
- byte, short, int and long to decimal.
- byte, short, and int to double.
Correct parsing of regex patterns with negation in nested character grouping: This release includes a change to support the correct parsing of regex patterns with negation in nested character grouping. For example, [^[abc]] will be parsed as “any character that is NOT one of 'abc'”.

Additionally, Photon behavior was inconsistent with Spark for nested character classes. Regex patterns containing nested character classes will no longer use Photon, and instead will use Spark. A nested character class is any pattern containing square brackets within square brackets, such as [[a-c][1-3]].
Improve duplicate match detection in Delta Lake MERGE: MERGE now considers conditions specified in the WHEN MATCHED clause. See Upsert into a Delta Lake table using merge.
The addArtifact() functionality is now consistent across compute types: When you use addArtifact(archive = True) to add a dependency to serverless compute, the archive is automatically unpacked.

Bug fixes

Timezone offsets now include seconds when serialized to CSV, JSON, and XML: Timestamps with timezone offsets that include seconds (common for timestamps from before 1900) were omitting the seconds when serialized to CSV, JSON, and XML. The default timestamp formatter has been fixed and now returns the correct offset values for these timestamps.

Other changes

Renamed error codes for the cloudFiles Structured Streaming source: The following error codes have been renamed:
- _LEGACY_ERROR_TEMP_DBR_0143 is renamed to CF_INCORRECT_STREAM_USAGE.
- _LEGACY_ERROR_TEMP_DBR_0260 is renamed to CF_INCORRECT_BATCH_USAGE .

Version 15.4

October 28, 2024

This serverless compute release roughly corresponds to Databricks Runtime 15.4

New features

UTF-8 validation functions: This release introduces the following functions for validating UTF-8 strings:
- is_valid_utf8 verified whether a string is a valid UTF-8 string.
- make_valid_utf8 converts a potentially invalid UTF-8 string to a valid UTF-8 string using substitution characters.
- validate_utf8 raises an error if the input is not a valid UTF-8 string.
- try_validate_utf8 returns NULL if the input is not a valid UTF-8 string.
Enable UniForm Iceberg using ALTER TABLE: You can now enable UniForm Iceberg on existing tables without rewriting data files. See On an existing table.
try_url_decode function: This release introduces the try_url_decode function, which decodes a URL-encoded string. If the string is not in the correct format, the function returns NULL instead of raising an error.
Optionally allow the optimizer to rely on unenforced foreign key constraints: To improve query performance, you can now specify the RELY keyword on FOREIGN KEY constraints when you CREATE or ALTER a table.
Parallelized job runs for selective overwrites: Selective overwrites using replaceWhere now run jobs that delete data and insert new data in parallel, improving query performance and cluster utilization.
Improved performance for change data feed with selective overwrites: Selective overwrites using replaceWhere on tables with change data feed no longer write separate change data files for inserted data. These operations use a hidden _change_type column present in the underlying Parquet data files to record changes without write amplification.
Improved query latency for the COPY INTO command: This release includes a change that improves the query latency for the COPY INTO command. This improvement is implemented by making the loading of state by the RocksDB state store asynchronous. With this change, you should see an improvement in start times for queries with large states, such as queries with a large number of already ingested files.
Support for dropping the check constraints table feature: You can now drop the checkConstraints table feature from a Delta table using ALTER TABLE table_name DROP FEATURE checkConstraints. See Remove check constraints.

Behavior changes

Schema binding change for views: When the data types in a view's underlying query change from those used when the view was first created, Databricks no longer throws errors for references to the view when no safe cast can be performed.

Instead, the view compensates by using regular casting rules where possible. This change allows Databricks to tolerate table schema changes more readily.
Disallow undocumented ! syntax toleration for NOT outside boolean logic: Databricks will no longer tolerate the use of ! as a synonym for NOT outside of boolean logic. This change reduces confusion, aligns with the SQL standard, and makes SQL more portable. For example:

CREATE ... IF ! EXISTS, IS ! NULL, ! NULL column or field property, ! IN and ! BETWEEN must be replaced with:

CREATE ... IF NOT EXISTS, IS NOT NULL, NOT NULL column or field property, NOT IN and NOT BETWEEN.

The boolean prefix operator ! (e.g. !is_mgr or !(true AND false)) is unaffected by this change.
Disallow undocumented and unprocessed portions of column definition syntax in views: Databricks supports CREATE VIEW with named columns and column comments.

The specification of column types, NOT NULL constraints, or DEFAULT has been tolerated in the syntax without having any effect. Databricks will remove this syntax toleration. Doing so reduces confusion, aligns with the SQL standard, and allows for future enhancements.
Consistent error handling for Base64 decoding in Spark and Photon: This release changes how Photon handles Base64 decoding errors to match the Spark handling of these errors. Before these changes, the Photon and Spark code generation path sometimes failed to raise parsing exceptions, while the Spark interpreted execution correctly raised IllegalArgumentException or ConversionInvalidInputError. This update ensures that Photon consistently raises the same exceptions as Spark during Base64 decoding errors, providing more predictable and reliable error handling.
Adding a CHECK constraint on an invalid column now returns the UNRESOLVED_COLUMN.WITH_SUGGESTION error class: To provide more useful error messaging, in Databricks Runtime 15.3 and above, an ALTER TABLE ADD CONSTRAINT statement that includes a CHECK constraint referencing an invalid column name returns the UNRESOLVED_COLUMN.WITH_SUGGESTION error class. Previously, an INTERNAL_ERROR was returned.

The JDK is upgraded from JDK 8 to JDK 17

August 15, 2024

Serverless compute for notebooks and workflows has migrated from Java Development Kit (JDK) 8 to JDK 17 on the server side. This upgrade includes the following behavioral changes:

Correct parsing of regex patterns with negation in nested character grouping: With this upgrade, Databricks now supports the correct parsing of regex patterns with negation in nested character grouping. For example, [^[abc]] will be parsed as “any character that is NOT one of 'abc'”.

Additionally, Photon behavior was inconsistent with Spark for nested character classes. Regex patterns containing nested character classes will no longer use Photon, and instead will use Spark. A nested character class is any pattern containing square brackets within square brackets, such as [[a-c][1-3]].

Version 15.1

July 23, 2024

This serverless compute release roughly corresponds to Databricks Runtime 15.1

New features

Support for star (*) syntax in the WHERE clause: You can now use the star (*) syntax in the WHERE clause to reference all columns from the SELECT list.

For example, SELECT * FROM VALUES(1, 2) AS T(a1, a2) WHERE 1 IN(T.*).

Changes

Improved error recovery for JSON parsing: The JSON parser used for from_json() and JSON path expressions now recovers faster from malformed syntax, resulting in less data loss.

When encountering malformed JSON syntax in a struct field, an array value, a map key, or a map value, the JSON parser will now return NULL only for the unreadable field, key, or element. Subsequent fields, keys, or elements will be properly parsed. Prior to this change, the JSON parser abandoned parsing the array, struct, or map and returned NULL for the remaining content.

Version 14.3

April 15, 2024

This is the initial serverless compute version. This version roughly corresponds to Databricks Runtime 14.3 with some modifications that remove support for some non-serverless and legacy features.

Supported Spark configuration parameters

To automate the configuration of Spark on serverless compute, Databricks has removed support for manually setting most Spark configurations. To view a list of supported Spark configuration parameters, see Configure Spark properties for serverless notebooks and jobs.

Job runs on serverless compute will fail if you set an unsupported Spark configuration.

input_file functions are deprecated

The input_file_name(), input_file_block_length(), and input_file_block_start() functions have been deprecated. Using these functions is highly discouraged.

Instead, use the file metadata column to retrieve file metadata information.

Behavioral changes

Serverless compute version 2024.15 includes the following behavioral changes:

unhex(hexStr) bug fix: When using the unhex(hexStr) function, hexStr is always padded left to a whole byte. Previously the unhex function ignored the first half-byte. For example: unhex('ABC') now produces x'0ABC' instead of x'BC'.
Auto-generated column aliases are now stable: When the result of an expression is referenced without a user-specified column alias, this auto-generated alias will now be stable. The new algorithm may result in a change to the previously auto-generated names used in features like materialized views.
Table scans with CHAR type fields are now always padded: Delta tables, certain JDBC tables, and external data sources store CHAR data in non-padded form. When reading, Databricks will now pad the data with spaces to the declared length to ensure correct semantics.
Casts from BIGINT/DECIMAL to TIMESTAMP throw an exception for overflowed values: Databricks allows casting from BIGINT and DECIMAL to TIMESTAMP by treating the value as the number of seconds from the Unix epoch. Previously, Databricks would return overflowed values but now throws an exception in cases of overflow. Use try_cast to return NULL instead of an exception.
PySpark UDF execution has been improved to match the exact behavior of UDF execution on dedicated compute: The following changes have been made:
- UDFs with a string return type no longer implicitly convert non-string values into strings. Previously, UDFs with a return type of str would apply a str(..) wrapper to the result regardless of the actual data type of the returned value.
- UDFs with timestamp return types no longer implicitly apply a timezone conversion to timestamps.

Serverless environment versions​

Release notes​

July 6, 2026​

New features​

Behavior changes​

Version 18.2​

New features​

Behavior changes​

Version 18.1​

New features​

Behavior changes​

Version 18.0​

New features​

Behavior changes​

Serverless environment version 5 is now available​

Version 17.3​

New features​

Behavior changes​

Version 17.2​

New features​

Behavior changes​

Version 17.1​

New features​

Behavior changes​

Serverless environment version 4​

Version 17.0​

New features​

Behavior changes​

Serverless performance targets is GA​

Version 16.4​

Behavior changes​

New features​

Performance mode is now configurable on serverless jobs​

Version 16.3​

Behavior changes​

New features​

Version 16.2​

Behavior changes​

New features​

Bug fixes​

High memory setting available on serverless notebooks (Public Preview)​

Version 16.1​

New features​

Behavior changes​

Bug fixes​

Other changes​

Version 15.4​

New features​

Behavior changes​

The JDK is upgraded from JDK 8 to JDK 17​

Version 15.1​

New features​

Changes​

Version 14.3​

Supported Spark configuration parameters​

input_file functions are deprecated​

Behavioral changes​

Serverless environment versions

Release notes

July 6, 2026

New features

Behavior changes

Version 18.2

New features

Behavior changes

Version 18.1

New features

Behavior changes

Version 18.0

New features

Behavior changes

Serverless environment version 5 is now available

Version 17.3

New features

Behavior changes

Version 17.2

New features

Behavior changes

Version 17.1

New features

Behavior changes

Serverless environment version 4

Version 17.0

New features

Behavior changes

Serverless performance targets is GA

Version 16.4

Behavior changes

New features

Performance mode is now configurable on serverless jobs

Version 16.3

Behavior changes

New features

Version 16.2

Behavior changes

New features

Bug fixes

High memory setting available on serverless notebooks (Public Preview)

Version 16.1

New features

Behavior changes

Bug fixes

Other changes

Version 15.4

New features

Behavior changes

The JDK is upgraded from JDK 8 to JDK 17

Version 15.1

New features

Changes

Version 14.3

Supported Spark configuration parameters

input_file functions are deprecated

Behavioral changes