Databricks Runtime maintenance updates

This article lists maintenance updates for supported Databricks Runtime versions. To add a maintenance update to an existing cluster, restart the cluster. For the maintenance updates on unsupported Databricks Runtime versions, see Maintenance updates for Databricks Runtime (archived).

注記

Releases are staged. Your Databricks account might not update for a few days after the initial release date.

Databricks Runtime releases

Maintenance updates by release:

Databricks Runtime 18.0
Databricks Runtime 17.3 LTS
Databricks Runtime 17.2
Databricks Runtime 17.1
Databricks Runtime 16.4 LTS
Databricks Runtime 16.2
Databricks Runtime 15.4 LTS
Databricks Runtime 14.3 LTS
Databricks Runtime 13.3 LTS
Databricks Runtime 12.2 LTS
Databricks Runtime 9.1 LTS

Databricks Runtime 18.0

See Databricks Runtime 18.0.

January 27, 2026
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.13 from 1.3.6 to 1.3.9
- See DOC linked
- Added batchSizeNumFiles, batchSizeNumBytes, and file processing states(`numFilesProcessed`, numFilesSkippedCorrupted, numFilesSkippedMissing, numFilesUnknownState as reported metrics to Auto Loader.
- See above
- [SPARK-54564] [SQL] Make QueryPlanningTracker as HybridAnalyzer field
- [SPARK-54803] Support BY NAME with INSERT ... REPLACE WHERE
- [SPARK-54679][SQL] Rename spark.sql.(xml.legacyXMLParser.enabled -> legacy.useLegacyXMLParser)
- [SDP][SPARK-54562]](https://issues.apache.org/jira/browse/SPARK-54562) Block eager analysis / execution inside flow function from the server side
- [SPARK-54886] Add base session created in SparkConnectService
- [SPARK-54815][CONNECT] Do not close the class loader of the session state if session is still in use
- [SPARK-41916] [ML] Torch distributor: support multiple torchrun processes per task if task.gpu.amount > 1
- [SPARK-54620][SQL] Add safety check in ObservationManager to avoid Observation blocking
- [SPARK-55015][SS][SQL] Fix decodeRemainingKey numFields calculation in PrefixKeyScanStateEncoder
- [SPARK-54708] Optimize ML cache cleanup with lazy directory creation
- [SPARK-54768][SS]Python Stream Data Source should classify error if data returned doesn't match configured schema
- [SPARK-54711][PYTHON] Add a timeout for daemon created worker connection
- [SPARK-54581][SQL] Making fetchsize option case-insensitive for Postgres connector
- Operating system security updates.

Databricks Runtime 17.3 LTS

See Databricks Runtime 17.3 LTS.

January 27, 2026
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.13 from 1.3.8 to 1.3.9
- See above
- [SPARK-54768][SS]Python Stream Data Source should classify error if data returned doesn't match configured schema
- [SPARK-54803] Support BY NAME with INSERT ... REPLACE WHERE
- [SPARK-53564][CORE] Avoid DAGScheduler exits due to blockManager RPC timeout in DAGSchedulerEventProcessLoop
- [SPARK-55015][SS][SQL] Fix decodeRemainingKey numFields calculation in PrefixKeyScanStateEncoder
- Operating system security updates.

January 9, 2026
- Updated Python libraries:
  - pmdarima from 2.0.4 to 2.1.1
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.13 from 1.3.6 to 1.3.8
- You can now use SQL window functions as a scalar function in metric view dimensions and measure expressions.
- [SDP][17.3 backport][SPARK-54562] Block eager analysis / execution inside flow function from the server side
- [SPARK-54679][SQL] Rename spark.sql.(xml.legacyXMLParser.enabled -> legacy.useLegacyXMLParser)
- [SPARK-54711][PYTHON] Add a timeout for daemon created worker connection
- [SPARK-53127][SQL] Fix LIMIT ALL for unlimted recursion with CTE normalization
- [SPARK-54708] Optimize ML cache cleanup with lazy directory creation
- [SPARK-54581][SQL] Making fetchsize option case-insensitive for Postgres connector
- [SPARK-41916] [ML] Torch distributor: support multiple torchrun processes per task if task.gpu.amount > 1
- [SPARK-54564] [SQL] Make QueryPlanningTracker as HybridAnalyzer field
- [SPARK-54620][SQL] Add safety check in ObservationManager to avoid Observation blocking
- Operating system security updates.

December 9, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.13 from 1.3.5 to 1.3.6
- [SPARK-50906][SQL] Fix Avro nullability check for reordered struct fields
- [SPARK-54180][SQL] Override the toString of BinaryFileFormat
- [SPARK-54427][SQL] Allow ColumnarRow to call copy with variant types
- Operating system security updates.

November 18, 2025
- [SPARK-54047][PYTHON] Use a difference error when kill-on-idle-timeout
- [SPARK-52762][SDP] Add PipelineAnalysisContext message to support pipeline analysis during Spark Connect query execution
- [SPARK-54156][PROTOBUF] Classify errors for ProtobufOptions casting failure
- [SPARK-54078][SS] New test for StateStoreSuite SPARK-40492: maintenance before unload and remove infra from old test
- [SPARK-54015][PYTHON] Relax Py4J requirement to py4j>=0.10.9.7,<0.10.9.10
- [SPARK-54099][SQL] XML variant parser should fall back to string on decimal parsing errors
- [17.3 Backport][spark-54191]](https://issues.apache.org/jira/browse/SPARK-54191)[SDP] Add once to Defineflow Proto
- Operating system security updates.

November 4, 2025
- [SPARK-53729][PYTHON][CONNECT] Fix serialization of pyspark.sql.connect.window.WindowSpec
- [SPARK-46679][SQL] Fix for SparkUnsupportedOperationException Not found an encoder of the type T, when using Parameterized class
- [SPARK-53973][Avro] Classify errors for AvroOptions boolean casting failure
- [SPARK-53794][SS] Add option to limit deletions per maintenance operation associated with rocksdb state provider
- [SPARK-53908][CONNECT] Fix observations on Spark Connect with plan cache
- [SPARK-53972][SS] Fix streaming query recentProgress regression in classic pyspark
- Operating system security updates.

Databricks Runtime 17.2

See Databricks Runtime 17.2.

January 27, 2026
- [SPARK-55015][SS][SQL] Fix decodeRemainingKey numFields calculation in PrefixKeyScanStateEncoder
- [SPARK-54768][SS]Python Stream Data Source should classify error if data returned doesn't match configured schema
- Operating system security updates.

January 9, 2026
- [SPARK-54711][PYTHON] Add a timeout for daemon created worker connection
- Operating system security updates.

December 9, 2025
- Partitioned Delta tables will have partition columns materialized in data parquet files going forward. This enables better synergy with how Iceberg and UniForm tables are handled, and increases compatibility with external non-Delta readers.
- [SPARK-54427][SQL] Allow ColumnarRow to call copy with variant types
- [SPARK-54180][SQL] Override the toString of BinaryFileFormat
- [SPARK-50906][SQL] Fix Avro nullability check for reordered struct fields
- Operating system security updates.

November 18, 2025
- [SPARK-54078][SS] New test for StateStoreSuite SPARK-40492: maintenance before unload and remove infra from old test
- [SPARK-54047][PYTHON] Use a difference error when kill-on-idle-timeout
- [SPARK-54099][SQL] XML variant parser should fall back to string on decimal parsing errors
- [SPARK-54015][PYTHON] Relax Py4J requirement to py4j>=0.10.9.7,<0.10.9.10
- [SPARK-52515]Approx_top_k using Apache DataSketches
- Operating system security updates.

November 4, 2025
- [SPARK-53973][Avro] Classify errors for AvroOptions boolean casting failure
- [SPARK-53972][SS] Fix streaming query recentProgress regression in classic pyspark
- [SPARK-53908][CONNECT] Fix observations on Spark Connect with plan cache
- Operating system security updates.

October 21, 2025
- Operating system security updates.

October 8, 2025
- [SPARK-53555] Fix: SparkML-connect can't load SparkML (legacy mode) saved model
- [SPARK-53598][SQL] Check the existence of numParts before reading large table property
- [SPARK-53625][SS] Propagate metadata columns through projections to address ApplyCharTypePadding incompatibility
- [SPARK-53568][CONNECT][PYTHON] Fix several small bugs in Spark Connect Python client error handling logic
- [SPARK-53574] Fix AnalysisContext being wiped during nested plan resolution
- [SPARK-53623][SQL] improve reading large table prope…
- [SPARK-53729][PYTHON][CONNECT] Fix serialization of pyspark.sql.connect.window.WindowSpec
- [SPARK-53549][SS] Always close the arrow allocator when list state request process is completed
- Operating system security updates.

September 10, 2025
- Fixed an issue that could cause Auto Loader to hang indefinitely.
- [SPARK-53362] [ML] [CONNECT] Fix IDFModel local loader bug
- [SPARK-53382][SQL] Fix rCTE bug with malformed recursion
- Backport flaky test fix for [SPARK-53345]
- [SPARK-49872][CORE] Remove jackson JSON string length limitation
- [SPARK-53423] [SQL] Move all the single-pass resolver related tags to ResolverTag
- [SPARK-53431][PYTHON] Fix Python UDTF with named table arguments in DataFrame API
- [SPARK-53336] [ML] [CONNECT] Reset MLCache.totalMLCacheSizeBytes when MLCache.clear() is called
- [SPARK-53394][CORE] UninterruptibleLock.isInterruptible should avoid duplicated interrupt
- [SPARK-53470][SQL] ExtractValue expressions should always do type checking
- Cherry pick of [SPARK-53389] Improvements for Pandas API on Spark under ANSI
- Operating system security updates.

Databricks Runtime 17.1

See Databricks Runtime 17.1 (EoS).

January 27, 2026
- [SPARK-55015][SS][SQL] Fix decodeRemainingKey numFields calculation in PrefixKeyScanStateEncoder
- [SPARK-54768][SS]Python Stream Data Source should classify error if data returned doesn't match configured schema
- Operating system security updates.

January 9, 2026
- [SPARK-54711][PYTHON] Add a timeout for daemon created worker connection
- Operating system security updates.

December 9, 2025
- Partitioned Delta tables will have partition columns materialized in data parquet files going forward. This enables better synergy with how Iceberg and UniForm tables are handled, and increases compatibility with external non-Delta readers.
- [SPARK-54180][SQL] Override the toString of BinaryFileFormat
- [SPARK-50906][SQL] Fix Avro nullability check for reordered struct fields
- [SPARK-54427][SQL] Allow ColumnarRow to call copy with variant types
- Operating system security updates.

November 18, 2025
- [SPARK-54015][PYTHON] Relax Py4J requirement to py4j>=0.10.9.7,<0.10.9.10
- [SPARK-52515]Approx_top_k using Apache DataSketches
- [SPARK-54047][PYTHON] Use a difference error when kill-on-idle-timeout
- [SPARK-54078][SS] New test for StateStoreSuite SPARK-40492: maintenance before unload and remove infra from old test
- [SPARK-54099][SQL] XML variant parser should fall back to string on decimal parsing errors
- Operating system security updates.

November 4, 2025
- [SPARK-53972][SS] Fix streaming query recentProgress regression in classic pyspark
- [SPARK-53908][CONNECT] Fix observations on Spark Connect with plan cache
- [SPARK-53973][Avro] Classify errors for AvroOptions boolean casting failure
- Operating system security updates.

October 21, 2025
- Operating system security updates.

October 7, 2025
- [SPARK-53574] Fix AnalysisContext being wiped during nested plan resolution
- [SPARK-53549][SS] Always close the arrow allocator when list state request process is completed
- [SPARK-53568][CONNECT][PYTHON] Fix several small bugs in Spark Connect Python client error handling logic
- [SPARK-53625][SS] Propagate metadata columns through projections to address ApplyCharTypePadding incompatibility
- [SPARK-53598][SQL] Check the existence of numParts before reading large table property
- [SPARK-53623][SQL] improve reading large table prope…
- [SPARK-53555] Fix: SparkML-connect can't load SparkML (legacy mode) saved model
- [SPARK-53729][PYTHON][CONNECT] Fix serialization of pyspark.sql.connect.window.WindowSpec
- Operating system security updates.

September 16, 2025
- Operating system security updates.

September 9, 2025
- Fixed an issue that could cause Auto Loader to hang indefinitely.
- [SPARK-53362] [ML] [CONNECT] Fix IDFModel local loader bug
- [SPARK-53394][CORE] UninterruptibleLock.isInterruptible should avoid duplicated interrupt
- [SPARK-53382][SQL] Fix rCTE bug with malformed recursion
- [SPARK-53431][PYTHON] Fix Python UDTF with named table arguments in DataFrame API
- [SPARK-53336] [ML] [CONNECT] Reset MLCache.totalMLCacheSizeBytes when MLCache.clear() is called
- [SPARK-49872][CORE] Remove jackson JSON string length limitation
- Operating system security updates.

August 25, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.13 from 1.3.3 to 1.3.5
- [SPARK-52482][SQL][CORE] Improve exception handling for reading certain corrupt zstd files
- [SPARK-53192][CONNECT] Always cache a DataSource in the Spark Connect Plan Cache
- Operating system security updates.

August 14, 2025
- [SPARK-52833][SQL] Fix VariantBuilder.appendFloat
- [SPARK-52961][PYTHON] Fix Arrow-optimized Python UDTF with 0-arg eval on lateral join
- [SPARK-51505][SQL] Always show empty partition number metrics in AQEShuffleReadExec
- [SPARK-52753][SQL] Make parseDataType binary compatible with previous versions
- [SPARK-52842][SQL] New functionality and bugfixes for single-pass analyzer
- [SPARK-52960][SQL] Show subtree string in LogicalQueryStage toString
- [SPARK-53054][CONNECT] Fix the connect.DataFrameReader default format behavior
- Operating system security updates.

Databricks Runtime 16.4 LTS

See Databricks Runtime 16.4 LTS.

January 27, 2026
- Updated Java libraries:
  - (Scala 2.12 only) io.delta.delta-sharing-client_2.12 from 1.2.9 to 1.2.10
  - (Scala 2.12 only) org.mlflow.mlflow-spark_2.12 from 2.9.1 to 2.19.0
  - (Scala 2.13 only) io.delta.delta-sharing-client_2.13 from 1.2.9 to 1.2.10
  - (Scala 2.13 only) org.mlflow.mlflow-spark_2.13 from 2.9.1 to 2.19.0
- [SPARK-55015][SS][SQL] Fix decodeRemainingKey numFields calculation in PrefixKeyScanStateEncoder
- Operating system security updates.

January 9, 2026
- Updated Java libraries:
  - (Scala 2.12 only) io.delta.delta-sharing-client_2.12 from 1.2.8 to 1.2.9
  - (Scala 2.13 only) io.delta.delta-sharing-client_2.13 from 1.2.8 to 1.2.9
- [SPARK-54620][SQL] Add safety check in ObservationManager to avoid Observation blocking
- [SPARK-54711][PYTHON] Add a timeout for daemon created worker connection
- [SPARK-41916] [ML] Torch distributor: support multiple torchrun processes per task if task.gpu.amount > 1
- Operating system security updates.

December 9, 2025
- Partitioned Delta tables will have partition columns materialized in data parquet files going forward. This enables better synergy with how Iceberg and UniForm tables are handled, and increases compatibility with external non-Delta readers.
- For both the Snowflake connector and Snowflake Lakehouse Federation, TIMESTAMP_NTZ (timestamp without time zone) literals are no longer pushed down to Snowflake. This change prevents query failures caused by incompatible timestamp handling and improves reliability for affected queries.
- [SPARK-54427][SQL] Allow ColumnarRow to call copy with variant types
- [SPARK-54180][SQL] Override the toString of BinaryFileFormat
- Operating system security updates.

November 18, 2025
- [SPARK-54099][SQL] XML variant parser should fall back to string on decimal parsing errors
- [SPARK-54015][PYTHON] Relax Py4J requirement to py4j>=0.10.9.7,<0.10.9.10
- [SPARK-54078][SS] New test for StateStoreSuite SPARK-40492: maintenance before unload and remove infra from old test
- [SPARK-54156][PROTOBUF] Classify errors for ProtobufOptions casting failure
- [SPARK-54047][PYTHON] Use a difference error when kill-on-idle-timeout
- Operating system security updates.

November 4, 2025
- Updated R libraries:
  - arrow from 16.1.0 to 21.0.0
- [SPARK-53973][Avro] Classify errors for AvroOptions boolean casting failure
- Operating system security updates.

October 21, 2025
- Operating system security updates.

October 7, 2025
- [SPARK-53568][CONNECT][PYTHON] Fix several small bugs in Spark Connect Python client error handling logic
- [SPARK-53574] Fix AnalysisContext being wiped during nested plan resolution
- [SPARK-53623][SQL] improve reading large table prope…
- [SPARK-53598][SQL] Check the existence of numParts before reading large table property
- [SPARK-53549][SS] Always close the arrow allocator when list state request process is completed
- Operating system security updates.

September 16, 2025
- The Snowflake connector now uses the INFORMATION_SCHEMA table instead of the SHOW SCHEMAS command to list schemas. This change removes the 10,000-schema limit of the previous approach and improves support for databases with a large number of schemas.
- Operating system security updates.

September 9, 2025
- Fixed an issue that could cause Auto Loader to hang indefinitely.
- Fixes a transient error in Auto Loader that may cause jobs to fail
- [SPARK-49872][CORE] Remove jackson JSON string length limitation
- [SPARK-51821][CORE] Call interrupt() without holding uninterruptibleLock to avoid possible deadlock
- Operating system security updates.

August 26, 2025
- Updated Java libraries:
  - (Scala 2.12 only) io.delta.delta-sharing-client_2.12 from 1.2.7 to 1.2.8
  - (Scala 2.13 only) io.delta.delta-sharing-client_2.13 from 1.2.7 to 1.2.8
- [SPARK-52482][SQL][CORE] Improve exception handling for reading certain corrupt zstd files
- [SPARK-53192][CONNECT] Always cache a DataSource in the Spark Connect Plan Cache
- Operating system security updates.

August 14, 2025
- [SPARK-51011][CORE] Add logging for whether a task is going to be interrupted when killed
- [SPARK-52833][SQL] Fix VariantBuilder.appendFloat
- [SPARK-51505][SQL] Always show empty partition number metrics in AQEShuffleReadExec
- Operating system security updates.

July 29, 2025
- [SPARK-52753][SQL] Make parseDataType binary compatible with previous versions
- Operating system security updates.

July 15, 2025
- Fixed a non-deterministic data loss issue when using Spark Structured Streaming to stream data from Pulsar.
- [SPARK-52579][PYTHON] Set periodical traceback dump for Python workers
- [SPARK-52553][SS] Fix NumberFormatException when reading v1 changelog
- [SPARK-52450] Improve performance of schema deepcopy
- [SPARK-52503][SQL][CONNECT] Fix drop when the input column is not existent
- [SPARK-52599][PYTHON] Support periodical traceback dump in Driver side workers
- Operating system security updates.

July 1, 2025
- ZStandard decompression support for file data source readers (json, csv, xml and text.)
- [15.4-16.4][spark-52521]](https://issues.apache.org/jira/browse/SPARK-52521)[SQL] Right#replacement should not access SQLConf dynamically
- [SPARK-52482][SQL][CORE] ZStandard support for file data source reader
- [SPARK-52312][SQL] Ignore V2WriteCommand when caching DataFrame
- Operating system security updates.

June 17, 2025
- Fixed the limitation that the cloud_files_state table-valued function (TVF) can't be used to read the file-level state of streaming tables across pipelines.
- Fixed Unity Catalog authorization issues for queries on temporary views.
- [SPARK-52040][PYTHON][SQL][CONNECT] ResolveLateralColumnAliasReference should retain the plan id
- Operating system security updates.

June 3, 2025
- [SPARK-52195][PYTHON][SS] Fix initial state column dropping issue for Python TWS
- [SPARK-52159][SQL] Properly handle table existence check for jdbc dialects
- Miscellaneous bug fixes.

May 7, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.13 from 1.2.3 to 1.2.7
  - org.apache.avro.avro from 1.11.3 to 1.11.4
  - org.apache.avro.avro-ipc from 1.11.3 to 1.11.4
  - org.apache.avro.avro-mapred from 1.11.3 to 1.11.4
- Streaming cloned session will be used inside the foreachBatch user function in Shared Clusters/Serverless. This aligns with the behavior in classic (Assigned Clusters).
- Streaming cloned session will be used inside the foreachBatch user function in Shared Clusters/Serverless. This aligns with the behavior in classic (Assigned Clusters).
- Prior to this change, leading whitespaces and tabs in paths in the variant_get expression were being ignored with Photon disabled. For example, select variant_get(parse_json('{"key": "value"}'), '$['key']') would not be effective in extracting the value of "key". However, users will be able to extract such keys now.
- [SPARK-51935][SQL] Fix lazy behavior of iterators in interpreted df.collect()
- [SPARK-51921][SS][PYTHON] Use long type for TTL duration in millisecond in transformWithState
- [SPARK-51940][SS] Add interface for managing streaming checkpoint metadata
- [SPARK-52049] Fix the bug that XML attributes can't be parsed as Variant
- [SPARK-51904][SS] Removing async metadata purging for StateSchemaV3 and ignoring non-batch files when listing OperatorMetadata files
- [SPARK-51869][SS] Create classification for user errors within UDFs for Scala TransformWithState
- [SPARK-51889][PYTHON][SS] Fix a bug for MapState clear() in Python TWS
- [SPARK-51922] [SS] Fix UTFDataFormatException thrown from StateStoreChangelogReaderFactory for v1
- [SPARK-51848][SQL] Fix parsing XML records with defined schema of array/structs/map of Variant
- Operating system security updates.

Databricks Runtime 16.2

See Databricks Runtime 16.2 (EoS).

August 14, 2025
- [SPARK-51011][CORE] Add logging for whether a task is going to be interrupted when killed
- Operating system security updates.

July 29, 2025
- Operating system security updates.

July 15, 2025
- Fixed a non-deterministic data loss issue when using Spark Structured Streaming to stream data from Pulsar.
- [SPARK-52553][SS] Fix NumberFormatException when reading v1 changelog
- Operating system security updates.

July 1, 2025
- ZStandard decompression support for file data source readers (json, csv, xml and text.)
- ZStandard decompression support for file data source readers (json, csv, xml and text.)
- [15.4-16.4][spark-52521]](https://issues.apache.org/jira/browse/SPARK-52521)[SQL] Right#replacement should not access SQLConf dynamically
- [SPARK-52312][SQL] Ignore V2WriteCommand when caching DataFrame
- [SPARK-52482][SQL][CORE] ZStandard support for file data source reader
- Operating system security updates.

June 17, 2025
- Fixed the limitation that the cloud_files_state table-valued function (TVF) can't be used to read the file-level state of streaming tables across pipelines.
- [SPARK-52040][PYTHON][SQL][CONNECT] ResolveLateralColumnAliasReference should retain the plan id
- Operating system security updates.

June 3, 2025
- Updated Python libraries:
  - cryptography from 41.0.7, 41.0.7, 42.0.5 to 42.0.5
  - packaging from 24.0, 24.1 to 24.1
  - platformdirs from 3.10.0, 4.2.2 to 3.10.0
  - pyparsing from 3.0.9, 3.1.1 to 3.0.9
  - Added autocommand 2.2.2
  - Added backports.tarfile 1.2.0
  - Added importlib_resources 6.4.0
  - Added inflect 7.3.1
  - Added jaraco.context 5.3.0
  - Added jaraco.functools 4.0.1
  - Added jaraco.text 3.12.1
  - Added more-itertools 10.3.0
  - Added pip 24.2
  - Added setuptools 74.0.0
  - Added tomli 2.0.1
  - Added typeguard 4.3.0
  - Added wcwidth 0.2.5
  - Added wheel 0.43.0
  - Removed distro 1.9.0
  - Removed distro-info 1.7+build1
  - Removed python-apt 2.7.7+ubuntu4
- [SPARK-52159][SQL] Properly handle table existence check for jdbc dialects
- [SPARK-52195][PYTHON][SS] Fix initial state column dropping issue for Python TWS
- Operating system security updates.

May 20, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.12 from 1.2.6 to 1.2.7
  - org.apache.avro.avro from 1.11.3 to 1.11.4
  - org.apache.avro.avro-ipc from 1.11.3 to 1.11.4
  - org.apache.avro.avro-mapred from 1.11.3 to 1.11.4
- Streaming cloned session will be used inside the foreachBatch user function in Shared Clusters/Serverless. This aligns with the behavior in classic (Assigned Clusters).
- Streaming cloned session will be used inside the foreachBatch user function in Shared Clusters/Serverless. This aligns with the behavior in classic (Assigned Clusters).
- Prior to this change, leading whitespaces and tabs in paths in the variant_get expression were being ignored with Photon disabled. For example, select variant_get(parse_json('{"key": "value"}'), '$[' key']') would not be effective in extracting the value of "key". However, users will be able to extract such keys now.
- [SPARK-51935][SQL] Fix lazy behavior of iterators in interpreted df.collect()
- [SPARK-51921][SS][PYTHON] Use long type for TTL duration in millisecond in transformWithState
- Operating system security updates.

April 22, 2025
- [SPARK-51717][SS][RocksDB] Fix SST mismatch corruption that can happen for second snapshot created for a new query
- Revert "[SPARK-47895][SQL] group by alias should be idempotent" in 15.4, 16.0, 16.1, 16.2 and 16.3
- Operating system security updates.

April 9, 2025
- Updated Java libraries:
  - Removed io.starburst.openjson.openjson 1.8-e.12
  - Removed io.starburst.openx.data.json-serde 1.3.9-e.12
  - Removed io.starburst.openx.data.json-serde-generic-shim 1.3.9-e.12
- [SPARK-47895][SQL] group by alias should be idempotent
- [SPARK-51505][SQL] Log empty partition number metrics in AQE coalesce
- [SPARK-51624][SQL] Propagate GetStructField metadata in CreateNamedStruct.dataType
- [SPARK-51589][SQL] Fix small bug failing to check for aggregate functions in |> SELECT
- Operating system security updates.
March 11, 2025
- Databricks Runtime 14.3 LTS and above include a fix for an issue that caused binary incompatibilities with code that instantiated a SparkListenerApplicationEnd class and was compiled against Apache Spark. This incompatibility resulted from merging SPARK-46399 into Apache Spark. This merge included a change that added a default argument to the SparkListenerApplicationEnd constructor. To restore binary compatibility, this fix adds a single argument constructor to the SparkListenerApplicationEnd class.
- Revert "[SPARK-48273][SQL] Fix late rewrite of PlanWithUnresolvedIdentifier"
- [SPARK-50985][SS] Classify Kafka Timestamp Offsets mismatch error instead of assert and throw error for missing server in KafkaTokenProvider
- [SPARK-51065][SQL] Disallowing non-nullable schema when Avro encoding is used for TransformWithState
- [SPARK-51237][SS] Add API details for new transformWithState helper APIs as needed
- [SPARK-51222][SQL] Optimize ReplaceCurrentLike
- [SPARK-51351][SS] Do not materialize the output in Python worker for TWS
- [SPARK-51084][SQL] Assign appropriate error class for negativeScaleNotAllowedError
- [SPARK-51249][SS] Fixing the NoPrefixKeyStateEncoder and Avro encoding to use the correct number of version bytes
- Operating system security updates.
February 5, 2025
- This release includes a fix for an issue affecting the conversion of certain datatypes when serializing rescued XML data columns. The affected datatypes are dates, non-NTZ timestamps, and decimals when prefersDecimal is enabled. To learn more about the rescued data column, see What is the rescued data column?.
- [SPARK-50770][SS] Removing package scope for transformWithState operator APIs
- Operating system security updates.

Databricks Runtime 15.4 LTS

See Databricks Runtime 15.4 LTS.

January 27, 2026
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.12 from 1.1.9 to 1.1.10
- Operating system security updates.

January 9, 2026
- Partitioned Delta tables will have partition columns materialized in data parquet files going forward. This enables better synergy with how Iceberg and UniForm tables are handled, and increases compatibility with external non-Delta readers.
- [SPARK-54620][SQL] Add safety check in ObservationManager to avoid Observation blocking
- [SPARK-54711][PYTHON] Add a timeout for daemon created worker connection
- Operating system security updates.

December 9, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.12 from 1.1.7 to 1.1.9
- [SPARK-54427][SQL] Allow ColumnarRow to call copy with variant types
- [SPARK-52579][PYTHON] Set periodical traceback dump for Python workers
- [SPARK-54180][SQL] Override the toString of BinaryFileFormat
- Operating system security updates.

November 18, 2025
- [SPARK-54078][SS] New test for StateStoreSuite SPARK-40492: maintenance before unload and remove infra from old test
- [SPARK-54047][PYTHON] Use a difference error when kill-on-idle-timeout
- Operating system security updates.

November 4, 2025
- Updated R libraries:
  - arrow from 14.0.0.2 to 21.0.0
- Operating system security updates.

October 21, 2025
- The scan photonization criteria is update to allow scan photonization when checksum verification is required.
  Determining whether checksum verification is required is now coming from the hadoop conf instead of the SQLConf.
- Operating system security updates.

October 7, 2025
- [SPARK-53568][CONNECT][PYTHON] Fix several small bugs in Spark Connect Python client error handling logic
- [SPARK-53574] Fix AnalysisContext being wiped during nested plan resolution
- Miscellaneous bug fixes.

September 16, 2025
- The Snowflake connector now uses the INFORMATION_SCHEMA table instead of the SHOW SCHEMAS command to list schemas. This change removes the 10,000-schema limit of the previous approach and improves support for databases with a large number of schemas.
- [SPARK-50870][SQL] Add the timezone when casting to timestamp in V2ScanRelationPushDown
- Operating system security updates.

September 9, 2025
- Fixed an issue that could cause Auto Loader to hang indefinitely.
- Fixes a transient error in Auto Loader that may cause jobs to fail
- [SPARK-51821][CORE] Call interrupt() without holding uninterruptibleLock to avoid possible deadlock
- [SPARK-49872][CORE] Remove jackson JSON string length limitation
- Operating system security updates.

August 26, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.12 from 1.1.6 to 1.1.7
- [SPARK-52482][SQL][CORE] Improve exception handling for reading certain corrupt zstd files
- [SPARK-53192][CONNECT] Always cache a DataSource in the Spark Connect Plan Cache
- Operating system security updates.

August 14, 2025
- [SPARK-51011][CORE] Add logging for whether a task is going to be interrupted when killed
- Operating system security updates.

July 29, 2025
- Operating system security updates.

July 21, 2025
- For compute that is enabled for Photon or uses Arm64-based CPU, mlflow-skinny is upgraded to 2.19.0, ray is upgraded to 2.37.0, and databricks-feature-engineering is upgraded to 0.8.0.
July 15, 2025
- Fixed a non-deterministic data loss issue when using Spark Structured Streaming to stream data from Pulsar.
- [SPARK-52503][SQL][CONNECT] Fix drop when the input column is not existent

July 1, 2025
- Updated Java libraries:
  - org.mlflow.mlflow-spark_2.12 from 2.9.1 to 2.11.3
  - Removed com.fasterxml.jackson.dataformat.jackson-dataformat-yaml 2.15.2
  - Removed org.slf4j.slf4j-simple 1.7.25
- ZStandard decompression support for file data source readers (json, csv, xml and text.)
- ZStandard decompression support for file data source readers (json, csv, xml and text.)
- [15.4-16.4][spark-52521]](https://issues.apache.org/jira/browse/SPARK-52521)[SQL] Right#replacement should not access SQLConf dynamically
- [SPARK-52482][SQL][CORE] ZStandard support for file data source reader
- [SPARK-52312][SQL] Ignore V2WriteCommand when caching DataFrame
- Operating system security updates.

June 17, 2025
- Fixed the limitation that the cloud_files_state table-valued function (TVF) can't be used to read the file-level state of streaming tables across pipelines.
- [SPARK-49646][SQL] fix subquery decorrelation for union/set operations when parentOuterReferences has references not covered in collectedChildOuterReferences
- [SPARK-52040][PYTHON][SQL][CONNECT] ResolveLateralColumnAliasReference should retain the plan id

June 3, 2025
- Updated Python libraries:
  - cryptography from 3.4.8, 41.0.3 to 41.0.3
  - filelock from 3.13.4, 3.15.4 to 3.13.4
  - importlib-metadata from 4.6.4, 6.0.0 to 6.0.0
  - platformdirs from 3.10.0, 3.11.0 to 3.10.0
  - pyparsing from 2.4.7, 3.0.9 to 3.0.9
  - zipp from 1.0.0, 3.11.0 to 3.11.0
  - Added pip 23.2.1
  - Added setuptools 68.0.0
  - Added wcwidth 0.2.5
  - Added wheel 0.38.4
  - Removed distro 1.7.0
  - Removed distro-info 1.1+ubuntu0.2
  - Removed python-apt 2.4.0+ubuntu4
- Updated Java libraries:
  - com.github.fommil.netlib.native_ref-java from 1.1, 1.1-natives to 1.1, 1.1
  - com.github.fommil.netlib.native_system-java from 1.1, 1.1-natives to 1.1, 1.1
  - com.github.fommil.netlib.netlib-native_ref-linux-x86_64 from 1.1-natives to 1.1
  - com.github.fommil.netlib.netlib-native_system-linux-x86_64 from 1.1-natives to 1.1
  - io.netty.netty-tcnative-boringssl-static from 2.0.61.Final-db-r16-linux-aarch_64, 2.0.61.Final-db-r16-linux-x86_64, 2.0.61.Final-db-r16-osx-aarch_64, 2.0.61.Final-db-r16-osx-x86_64, 2.0.61.Final-db-r16-windows-x86_64 to 2.0.61.Final-db-r16, 2.0.61.Final-db-r16, 2.0.61.Final-db-r16, 2.0.61.Final-db-r16, 2.0.61.Final-db-r16
  - io.netty.netty-transport-native-epoll from 4.1.96.Final, 4.1.96.Final-linux-aarch_64, 4.1.96.Final-linux-x86_64 to 4.1.96.Final, 4.1.96.Final, 4.1.96.Final
  - io.netty.netty-transport-native-kqueue from 4.1.96.Final-osx-aarch_64, 4.1.96.Final-osx-x86_64 to 4.1.96.Final, 4.1.96.Final
  - org.apache.orc.orc-core from 1.9.2-shaded-protobuf to 1.9.2
  - org.apache.orc.orc-mapreduce from 1.9.2-shaded-protobuf to 1.9.2
  - software.amazon.cryptools.AmazonCorrettoCryptoProvider from 1.6.2-linux-x86_64 to 1.6.2
- [SPARK-52159][SQL] Properly handle table existence check for jdbc dialects
- Operating system security updates.

May 20, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.12 from 1.1.5 to 1.1.6
- Streaming cloned session will be used inside the foreachBatch user function in Shared Clusters/Serverless. This aligns with the behavior in classic (Assigned Clusters).
- Prior to this change, leading whitespaces and tabs in paths in the variant_get expression were being ignored with Photon disabled. For example, select variant_get(parse_json('{"key": "value"}'), '$['key']') would not be effective in extracting the value of "key". However, users will be able to extract such keys now.
- [SPARK-51935][SQL] Fix lazy behavior of iterators in interpreted df.collect()
- Operating system security updates.

April 22, 2025
- Updated Java libraries:
  - org.apache.avro.avro from 1.11.3 to 1.11.4
  - org.apache.avro.avro-ipc from 1.11.3 to 1.11.4
  - org.apache.avro.avro-mapred from 1.11.3 to 1.11.4
- Revert "[SPARK-47895][SQL] group by alias should be idempotent" in 15.4, 16.0, 16.1, 16.2 and 16.3
- [SPARK-50682][SQL] Inner Alias should be canonicalized
- Operating system security updates.

April 9, 2025
- (Behavioral change) To apply critical security patches, the default Python version is updated to Python 3.11.11 from Python 3.11.0rc1. This update might impact some workloads running on Databricks Runtime 15.4 LTS, such as workloads that use Python serialization to store and restore state between executions or workloads that pin to the 3.11.0 Python version.
- Updated Java libraries:
  - Removed io.starburst.openjson.openjson 1.8-e.12
  - Removed io.starburst.openx.data.json-serde 1.3.9-e.12
  - Removed io.starburst.openx.data.json-serde-generic-shim 1.3.9-e.12
- [SPARK-47895][SQL] group by alias should be idempotent
- [SPARK-51624][SQL] Propagate GetStructField metadata in CreateNamedStruct.dataType
- Operating system security updates.

March 31, 2025
- For compute not enabled for Photon, databricks-feature-engineering is upgraded to 0.8.0. For Photon-enabled compute, databricks-feature-engineering remains at 0.6.0.

March 11, 2025
- Databricks Runtime 14.3 LTS and above include a fix for an issue that caused binary incompatibilities with code that instantiated a SparkListenerApplicationEnd class and was compiled against Apache Spark. This incompatibility resulted from merging SPARK-46399 into Apache Spark. This merge included a change that added a default argument to the SparkListenerApplicationEnd constructor. To restore binary compatibility, this fix adds a single argument constructor to the SparkListenerApplicationEnd class.
- [SPARK-50985][SS] Classify Kafka Timestamp Offsets mismatch error instead of assert and throw error for missing server in KafkaTokenProvider
- [SPARK-50791][SQL] Fix NPE in State Store error handling
- [SPARK-50310][PYTHON] Improve Column performance when DQC is disabled
- [SPARK-51222][SQL] Optimize ReplaceCurrentLike
- [SPARK-49525][SS][CONNECT] Minor log improvement to Server Side Streaming Query ListenerBus Listener
- [SPARK-51084][SQL] Assign appropriate error class for negativeScaleNotAllowedError
- Operating system security updates.
February 11, 2025
- This release includes a fix for an issue affecting the conversion of certain datatypes when serializing rescued XML data columns. The affected datatypes are dates, non-NTZ timestamps, and decimals when prefersDecimal is enabled. To learn more about the rescued data column, see What is the rescued data column?.
- [SPARK-50492][SS] Fix java.util.NoSuchElementException when event time column is dropped after dropDuplicatesWithinWatermark
- Operating system security updates.
- For compute not enabled for Photon, mlflow-skinny is upgraded to 2.19.0. For Photon-enabled compute, mlflow-skinny remains at 2.13.1.

December 10, 2024
- The USE CATALOG statement now supports the IDENTIFIER clause. With this support, you can parameterize the current catalog based on a string variable or parameter marker.
- This release includes a fix for an issue that might cause the primary key on a Delta table to be dropped under certain edge cases related to background auto-compaction.
- With this release, the cache size used by an SSD in a Databricks compute node dynamically expands to the SSD's initial size and shrinks when necessary, down to the spark.databricks.io.cache.maxDiskUsage limit. See Optimize performance with caching on Databricks.
- The pyodbc package is updated from version 4.0.38 to version 4.0.39. This change is required because a bug was found in version 4.0.38 and that version has been removed from PyPI.
- [SPARK-50329][SQL] fix InSet$toString
- [SPARK-47435][SQL] Fix overflow issue of MySQL UNSIGNED TINYINT
- [SPARK-49757][SQL] Support IDENTIFIER expression in SET CATALOG statement
- [SPARK-50426][PYTHON] Avoid static Python data source lookup when using builtin or Java data sources
- [SPARK-48863][SQL] Fix ClassCastException when parsing JSON with “spark.sql.json.enablePartialResults” enabled
- [SPARK-50310][PYTHON] Add a flag to disable DataFrameQueryContext for PySpark
- [15.3-15.4] [SPARK-50034][CORE] Fix Misreporting of Fatal Errors as Uncaught Exceptions in SparkUncaughtExceptionHandler
- Operating system security updates.
November 26, 2024
- With this release, you can now query the vector_search function using query_text for text input or query_vector for embedding input.
- You can now set a timeout for Spark Connect queries using the Spark configuration property spark.databricks.execution.timeout. For notebooks running on serverless compute, the default value is 9000 (seconds). Jobs running on serverless compute and compute with standard access mode do not have a timeout unless this configuration property is set. An execution that lasts longer than the specified timeout results in a QUERY_EXECUTION_TIMEOUT_EXCEEDED error.
- [SPARK-50322][SQL] Fix parameterized identifier in a sub-query
- [SPARK-49615] [ML] Make all ML feature transformers dataset schema validation conforming “spark.sql.caseSensitive” config.
- [SPARK-50124][SQL] LIMIT/OFFSET should preserve data ordering
- Operating system security updates.

November 5, 2024
- (Breaking change) In Databricks Runtime 15.4 LTS and above, regular expression handling in Photon is updated to match the behavior of Apache Spark regular expression handling. Previously, regular expression functions run by Photon, such as split() and regexp_extract(), accepted some regular expressions rejected by the Spark parser. To maintain consistency with Apache Spark, Photon queries will now fail for regular expressions that Spark considers not valid. Because of this change, you might see errors if your Spark code includes invalid regular expressions. For example, the expression split(str_col, '{'), which contains an unmatched brace and was previously accepted by Photon, now fails. To fix this expression, you can escape the brace character: split(str_col, '\\{'). Photon and Spark behavior also differed for some regular expression matching of non-ASCII characters. This is also updated so Photon matches the Apache Spark behavior.
- [SPARK-49782][SQL] ResolveDataFrameDropColumns rule resolves UnresolvedAttribute with child output
- [SPARK-49867][SQL] Improve the error message when index is out of bounds when calling GetColumnByOrdinal
- [SPARK-49863][SQL] Fix NormalizeFloatingNumbers to preserve nullability of nested structs
- [SPARK-49829] Revise the optimization on adding input to state store in stream-stream join (correctness fix)
- [SPARK-49905] Use dedicated ShuffleOrigin for stateful operator to prevent the shuffle to be modified from AQE
- [SPARK-46632][SQL] Fix subexpression elimination when equivalent ternary expressions have different children
- [SPARK-49443][SQL][PYTHON] Implement to_variant_object expression and make schema_of_variant expressions print OBJECT for Variant Objects
- [SPARK-49615] Bugfix: Make ML column schema validation conforms with spark config spark.sql.caseSensitive.

October 22, 2024
- [SPARK-49782][SQL] ResolveDataFrameDropColumns rule resolves UnresolvedAttribute with child output
- [SPARK-49867][SQL] Improve the error message when index is out of bounds when calling GetColumnByOrdinal
- [SPARK-49863][SQL] Fix NormalizeFloatingNumbers to preserve nullability of nested structs
- [SPARK-49829] Revise the optimization on adding input to state store in stream-stream join (correctness fix)
- [SPARK-49905] Use dedicated ShuffleOrigin for stateful operator to prevent the shuffle to be modified from AQE
- [SPARK-46632][SQL] Fix subexpression elimination when equivalent ternary expressions have different children
- [SPARK-49443][SQL][PYTHON] Implement to_variant_object expression and make schema_of_variant expressions print OBJECT for Variant Objects
- [SPARK-49615] Bugfix: Make ML column schema validation conforms with spark config spark.sql.caseSensitive.
October 10, 2024
- [SPARK-49743][SQL] OptimizeCsvJsonExpr should not change schema fields when pruning GetArrayStructFields
- [SPARK-49688][CONNECT] Fix a data race between interrupt and execute plan
- [BACKPORT] [SPARK-49474][SS] Classify Error class for FlatMapGroupsWithState user function error
- [SPARK-49460][SQL] Followup: fix potential NPE risk
September 25, 2024
- [SPARK-49628][SQL] ConstantFolding should copy stateful expression before evaluating
- [SPARK-49000][SQL] Fix “select count(distinct 1) from t” where t is empty table by expanding RewriteDistinctAggregates
- [SPARK-49492][CONNECT] Reattach attempted on inactive ExecutionHolder
- [SPARK-49458][CONNECT][PYTHON] Supply server-side session id via ReattachExecute
- [SPARK-49017][SQL] Insert statement fails when multiple parameters are being used
- [SPARK-49451] Allow duplicate keys in parse_json.
- Miscellaneous bug fixes.
September 17, 2024
- [SPARK-48463][ML] Make Binarizer, Bucketizer, Vector Assembler, FeatureHasher, QuantizeDiscretizer, OnehotEncoder, StopWordsRemover, Imputer, Interactor supporting nested input columns
- [SPARK-49409][CONNECT] Adjust the default value of CONNECT_SESSION_PLAN_CACHE_SIZE
- [SPARK-49526][CONNECT][HOTFIX-15.4.2] Support Windows-style paths in ArtifactManager
- Revert “[SPARK-48482][PYTHON] dropDuplicates and dropDuplicatesWIthinWatermark should accept variable length args”
- [SPARK-43242][CORE] Fix throw 'Unexpected type of BlockId' in shuffle corruption diagnose
- [SPARK-49366][CONNECT] Treat Union node as leaf in dataframe column resolution
- [SPARK-49018][SQL] Fix approx_count_distinct not working correctly with collation
- [SPARK-49460][SQL] Remove cleanupResource() from EmptyRelationExec
- [SPARK-49056][SQL] ErrorClassesJsonReader cannot handle null properly
- [SPARK-49336][CONNECT] Limit the nesting level when truncating a protobuf message
August 29, 2024
- The output from a SHOW CREATE TABLE statement now includes any row filters or column masks defined on a materialized view or streaming table. See SHOW CREATE TABLE. To learn about row filters and column masks, see Row filters and column masks.
- On compute configured with shared access mode, Kafka batch reads and writes now have the same limitations enforced as those documented for Structured Streaming. See Streaming limitations.
- [SPARK-48941][SPARK-48970] Backport ML writer / reader fixes
- [SPARK-49074][SQL] Fix variant with df.cache()
- [SPARK-49263][CONNECT] Spark Connect python client: Consistently handle boolean Dataframe reader options
- [SPARK-48955][SQL] Include ArrayCompact changes in 15.4
- [SPARK-48937][SQL] Add collation support for StringToMap string expressions
- [SPARK-48929] Fix view internal error and clean up parser exception context
- [SPARK-49125][SQL] Allow duplicated column names in CSV writing
- [SPARK-48934][SS] Python datetime types converted incorrectly for setting timeout in applyInPandasWithState
- [SPARK-48843] Prevent infinite loop with BindParameters
- [SPARK-48981] Fix simpleString method of StringType in pyspark for collations
- [SPARK-49065][SQL] Rebasing in legacy formatters/parsers must support non JVM default time zones
- [SPARK-48896] [SPARK-48909] [SPARK-48883] Backport spark ML writer fixes
- [SPARK-48725][SQL] Integrate CollationAwareUTF8String.lowerCaseCodePoints into string expressions
- [SPARK-48978][SQL] Implement ASCII fast path in collation support for UTF8_LCASE
- [SPARK-49047][PYTHON][CONNECT] Truncate the message for logging
- [SPARK-49146][SS] Move assertion errors related to watermark missing in append mode streaming queries to error framework
- [SPARK-48977][SQL] Optimize string searching under UTF8_LCASE collation
- [SPARK-48889][SS] testStream to unload state stores before finishing
- [SPARK-48463] Make StringIndexer supporting nested input columns
- [SPARK-48954] try_mod() replaces try_remainder()
- Operating system security updates.

Databricks Runtime 14.3 LTS

See Databricks Runtime 14.3 LTS.

January 27, 2026
- Operating system security updates.

January 9, 2026
- Partitioned Delta tables will have partition columns materialized in data parquet files going forward. This enables better synergy with how Iceberg and UniForm tables are handled, and increases compatibility with external non-Delta readers.
- [SPARK-54711][PYTHON] Add a timeout for daemon created worker connection
- Operating system security updates.

December 9, 2025
- [SPARK-52579][PYTHON] Set periodical traceback dump for Python workers
- [SPARK-54180][SQL] Override the toString of BinaryFileFormat
- Operating system security updates.

November 18, 2025
- [SPARK-54078][SS] New test for StateStoreSuite SPARK-40492: maintenance before unload and remove infra from old test
- [SPARK-54047][PYTHON] Use a difference error when kill-on-idle-timeout
- Operating system security updates.

November 4, 2025
- Updated R libraries:
  - arrow from 12.0.1 to 21.0.0
- Operating system security updates.

October 21, 2025
- Operating system security updates.

October 7, 2025
- [SPARK-53568][CONNECT][PYTHON] Fix several small bugs in Spark Connect Python client error handling logic
- [SPARK-53574] Fix AnalysisContext being wiped during nested plan resolution
- Miscellaneous bug fixes.

September 16, 2025
- Operating system security updates.

September 9, 2025
- Fixed an issue that could cause Auto Loader to hang indefinitely.
- [SPARK-49872][CORE] Remove jackson JSON string length limitation
- Operating system security updates.

August 26, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.12 from 1.1.6 to 1.1.7
- [SPARK-52482][SQL][CORE] Improve exception handling for reading certain corrupt zstd files
- Operating system security updates.

August 14, 2025
- Operating system security updates.

July 29, 2025
- Operating system security updates.

July 15, 2025
- [SPARK-52503][SQL][CONNECT] Fix drop when the input column is not existent
- Miscellaneous bug fixes.

July 1, 2025
- ZStandard decompression support for file data source readers (json, csv, xml and text.)
- ZStandard decompression support for file data source readers (json, csv, xml and text.)
- [SPARK-52521][SQL] Right#replacement should not access SQLConf dynamically
- [SPARK-52482][SQL][CORE] ZStandard support for file data source reader
- Operating system security updates.

June 17, 2025
- Fixed the limitation that the cloud_files_state table-valued function (TVF) can't be used to read the file-level state of streaming tables across pipelines.
- [SPARK-49646][SQL] fix subquery decorrelation for union/set operations when parentOuterReferences has references not covered in collectedChildOuterReferences

June 3, 2025
- Updated Python libraries:
  - cryptography from 3.4.8, 39.0.1 to 39.0.1
  - platformdirs from 2.5.2, 2.6.2 to 2.5.2
  - pyparsing from 2.4.7, 3.0.9 to 3.0.9
  - Added pip 22.3.1
  - Added setuptools 65.6.3
  - Added tomli 2.0.1
  - Added wcwidth 0.2.5
  - Added wheel 0.38.4
  - Removed distro 1.7.0
  - Removed distro-info 1.1+ubuntu0.2
  - Removed python-apt 2.4.0+ubuntu4
- Updated Java libraries:
  - com.github.fommil.netlib.native_ref-java from 1.1, 1.1-natives to 1.1, 1.1
  - com.github.fommil.netlib.native_system-java from 1.1, 1.1-natives to 1.1, 1.1
  - com.github.fommil.netlib.netlib-native_ref-linux-x86_64 from 1.1-natives to 1.1
  - com.github.fommil.netlib.netlib-native_system-linux-x86_64 from 1.1-natives to 1.1
  - io.netty.netty-tcnative-boringssl-static from 2.0.61.Final-db-r16-linux-aarch_64, 2.0.61.Final-db-r16-linux-x86_64, 2.0.61.Final-db-r16-osx-aarch_64, 2.0.61.Final-db-r16-osx-x86_64, 2.0.61.Final-db-r16-windows-x86_64 to 2.0.61.Final-db-r16, 2.0.61.Final-db-r16, 2.0.61.Final-db-r16, 2.0.61.Final-db-r16, 2.0.61.Final-db-r16
  - io.netty.netty-transport-native-epoll from 4.1.96.Final, 4.1.96.Final-linux-aarch_64, 4.1.96.Final-linux-x86_64 to 4.1.96.Final, 4.1.96.Final, 4.1.96.Final
  - io.netty.netty-transport-native-kqueue from 4.1.96.Final-osx-aarch_64, 4.1.96.Final-osx-x86_64 to 4.1.96.Final, 4.1.96.Final
  - org.apache.orc.orc-core from 1.9.2-shaded-protobuf to 1.9.2
  - org.apache.orc.orc-mapreduce from 1.9.2-shaded-protobuf to 1.9.2
  - software.amazon.cryptools.AmazonCorrettoCryptoProvider from 1.6.1-linux-x86_64 to 1.6.1
- [SPARK-52040][PYTHON][SQL][CONNECT] ResolveLateralColumnAliasReference should retain the plan id
- [SPARK-52159][SQL] Properly handle table existence check for jdbc dialects
- Operating system security updates.

May 20, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-client_2.12 from 1.1.5 to 1.1.6
- [SPARK-51935][SQL] Fix lazy behavior of iterators in interpreted df.collect()
- Operating system security updates.

April 22, 2025
- Operating system security updates.

April 9, 2025
- [Behavior Change] Vacuum operations now perform Writer protocol checks similar to other operations, preventing unexpected cleanups on tables with newer features when run from incompatible older Databricks Runtime versions.
- [SPARK-51624][SQL] Propagate GetStructField metadata in CreateNamedStruct.dataType
- Operating system security updates.
March 11, 2025
- Databricks Runtime 14.3 LTS and above include a fix for an issue that caused binary incompatibilities with code that instantiated a SparkListenerApplicationEnd class and was compiled against Apache Spark. This incompatibility resulted from merging SPARK-46399 into Apache Spark. This merge included a change that added a default argument to the SparkListenerApplicationEnd constructor. To restore binary compatibility, this fix adds a single argument constructor to the SparkListenerApplicationEnd class.
- [SPARK-50791][SQL] Fix NPE in State Store error handling
- [SPARK-50705][SQL] Make QueryPlan lock-free
- [SPARK-49525][SS][CONNECT] Minor log improvement to Server Side Streaming Query ListenerBus Listener
- Operating system security updates.
February 11, 2025
- This release includes a fix for an issue affecting the conversion of certain datatypes when serializing rescued XML data columns. The affected datatypes are dates, non-NTZ timestamps, and decimals when prefersDecimal is enabled. To learn more about the rescued data column, see What is the rescued data column?.
- [SPARK-50492][SS] Fix java.util.NoSuchElementException when event time column is dropped after dropDuplicatesWithinWatermark
- [SPARK-51084][SQL] Assign appropriate error class for negativeScaleNotAllowedError
- Operating system security updates.

December 10, 2024
- This release includes a fix for an issue that might cause the primary key on a Delta table to be dropped under certain edge cases related to background auto-compaction.
- [SPARK-50329][SQL] fix InSet$toString
- Operating system security updates.
November 26, 2024
- [SPARK-49615] [ML] Make all ML feature transformers dataset schema validation conforming “spark.sql.caseSensitive” config.
- Operating system security updates.
November 5, 2024
- [SPARK-48843] Prevent infinite loop with BindParameters
- [SPARK-49829] Revise the optimization on adding input to state store in stream-stream join (correctness fix)
- [SPARK-49863][SQL] Fix NormalizeFloatingNumbers to preserve nullability of nested structs
- [BACKPORT] [SPARK-49326][SS] Classify Error class for Foreach sink user function error
- [SPARK-49782][SQL] ResolveDataFrameDropColumns rule resolves UnresolvedAttribute with child output
- [SPARK-46632][SQL] Fix subexpression elimination when equivalent ternary expressions have different children
- [SPARK-49905] Use dedicated ShuffleOrigin for stateful operator to prevent the shuffle to be modified from AQE
- Operating system security updates.
October 22, 2024
- [SPARK-48843] Prevent infinite loop with BindParameters
- [SPARK-49863][SQL] Fix NormalizeFloatingNumbers to preserve nullability of nested structs
- [SPARK-49905] Use dedicated ShuffleOrigin for stateful operator to prevent the shuffle to be modified from AQE
- [SPARK-46632][SQL] Fix subexpression elimination when equivalent ternary expressions have different children
- [SPARK-49782][SQL] ResolveDataFrameDropColumns rule resolves UnresolvedAttribute with child output
- [BACKPORT] [SPARK-49326][SS] Classify Error class for Foreach sink user function error
- [SPARK-49829] Revise the optimization on adding input to state store in stream-stream join (correctness fix)
- Operating system security updates.
October 10, 2024
- [BACKPORT] [SPARK-49474][SS] Classify Error class for FlatMapGroupsWithState user function error
- [SPARK-49743][SQL] OptimizeCsvJsonExpr should not change schema fields when pruning GetArrayStructFields
- [SPARK-49688][CONNECT] Fix a data race between interrupt and execute plan
September 25, 2024
- [SPARK-48810][CONNECT] Session stop() API should be idempotent and not fail if the session is already closed by the server
- [SPARK-48719][SQL] Fix the calculation bug of `RegrS…
- [SPARK-49000][SQL] Fix “select count(distinct 1) from t” where t is empty table by expanding RewriteDistinctAggregates
- [SPARK-49628][SQL] ConstantFolding should copy stateful expression before evaluating
- [SPARK-49492][CONNECT] Reattach attempted on inactive ExecutionHolder
- Operating system security updates.
September 17, 2024
- [SPARK-49336][CONNECT] Limit the nesting level when truncating a protobuf message
- [SPARK-43242][CORE] Fix throw 'Unexpected type of BlockId' in shuffle corruption diagnose
- [SPARK-48463][ML] Make Binarizer, Bucketizer, Vector Assembler, FeatureHasher, QuantizeDiscretizer, OnehotEncoder, StopWordsRemover, Imputer, Interactor supporting nested input columns
- [SPARK-49526][CONNECT] Support Windows-style paths in ArtifactManager
- [SPARK-49409][CONNECT] Adjust the default value of CONNECT_SESSION_PLAN_CACHE_SIZE
- [SPARK-49366][CONNECT] Treat Union node as leaf in dataframe column resolution
August 29, 2024
- [SPARK-49146][SS] Move assertion errors related to watermark missing in append mode streaming queries to error framework
- [SPARK-48862][PYTHON][CONNECT] Avoid calling _proto_to_string when INFO level is not enabled
- [SPARK-49263][CONNECT] Spark Connect python client: Consistently handle boolean Dataframe reader options
August 14, 2024
- [SPARK-48941][SPARK-48970] Backport ML writer / reader fixes
- [SPARK-48706][PYTHON] Python UDF in higher order functions should not throw internal error
- [SPARK-49056][SQL] ErrorClassesJsonReader cannot handle null properly
- [SPARK-48597][SQL] Introduce a marker for isStreaming property in text representation of logical plan
- [SPARK-49065][SQL] Rebasing in legacy formatters/parsers must support non JVM default time zones
- [SPARK-48934][SS] Python datetime types converted incorrectly for setting timeout in applyInPandasWithState
August 1, 2024
- This release includes a bug fix for the ColumnVector and ColumnarArray classes in the Spark Java interface. Previous to this fix, an ArrayIndexOutOfBoundsException might be thrown or incorrect data returned when an instance of one of these classes contained null values.
- On serverless compute for notebooks and jobs, ANSI SQL mode is enabled by default. See Supported Spark configuration parameters.
- On compute configured with shared access mode, Kafka batch reads and writes now have the same limitations enforced as those documented for Structured Streaming. See Streaming limitations.
- The output from a SHOW CREATE TABLE statement now includes any row filters or column masks defined on a materialized view or streaming table. See SHOW CREATE TABLE. To learn about row filters and column masks, see Row filters and column masks.
- On compute configured with shared access mode, Kafka batch reads and writes now have the same limitations enforced as those documented for Structured Streaming. See Streaming limitations.
- The output from a SHOW CREATE TABLE statement now includes any row filters or column masks defined on a materialized view or streaming table. See SHOW CREATE TABLE. To learn about row filters and column masks, see Row filters and column masks.
- [SPARK-48896] [SPARK-48909] [SPARK-48883] Backport spark ML writer fixes
- [SPARK-48889][SS] testStream to unload state stores before finishing
- [SPARK-48705][PYTHON] Explicitly use worker_main when it starts with pyspark
- [SPARK-48047][SQL] Reduce memory pressure of empty TreeNode tags
- [SPARK-48544][SQL] Reduce memory pressure of empty TreeNode BitSets
- [SPARK-46957][CORE] Decommission migrated shuffle files should be able to cleanup from executor
- [SPARK-48463] Make StringIndexer supporting nested input columns
- [SPARK-47202][PYTHON] Fix typo breaking datetimes with tzinfo
- [SPARK-47713][SQL][CONNECT] Fix a self-join failure
- Operating system security updates.
July 11, 2024
- (Behavior change) DataFrames cached against Delta table sources are now invalidated if the source table is overwritten. This change means that all state changes to Delta tables now invalidate cached results. Use .checkpoint() to persist a table state throughout the lifetime of a DataFrame.
- The Snowflake JDBC Driver is updated to version 3.16.1.
- This release includes a fix to an issue that prevented the Spark UI Environment tab from displaying correctly when running in Databricks Container Services.
- On serverless compute for notebooks and jobs, ANSI SQL mode is enabled by default. See Supported Spark configuration parameters.
- To ignore invalid partitions when reading data, file-based data sources, such as Parquet, ORC, CSV, or JSON, can set the ignoreInvalidPartitionPaths data source option to true. For example: spark.read.format(“parquet”).option(“ignoreInvalidPartitionPaths”, “true”).load(…). You can also use the SQL configuration spark.sql.files.ignoreInvalidPartitionPaths. However, the data source option takes precedence over the SQL configuration. This setting is false by default.
- [SPARK-48648][PYTHON][CONNECT] Make SparkConnectClient.tags properly threadlocal
- [SPARK-48445][SQL] Don't inline UDFs with expensive children
- [SPARK-48481][SQL][SS] Do not apply OptimizeOneRowPlan against streaming Dataset
- [SPARK-48383][SS] Throw better error for mismatched partitions in startOffset option in Kafka
- [SPARK-48503][SQL] Fix invalid scalar subqueries with group-by on non-equivalent columns that were incorrectly allowed
- [SPARK-48100][SQL] Fix issues in skipping nested structure fields not selected in schema
- [SPARK-48273][SQL] Fix late rewrite of PlanWithUnresolvedIdentifier
- [SPARK-48252][SQL] Update CommonExpressionRef when necessary
- [SPARK-48475][PYTHON] Optimize _get_jvm_function in PySpark.
- [SPARK-48292][CORE] Revert [SPARK-39195][SQL] Spark OutputCommitCoordinator should abort stage when committed file not consistent with task status
- Operating system security updates.
June 17, 2024
- applyInPandasWithState() is available on compute with standard access mode.
- Fixes a bug where the rank-window optimization using Photon TopK incorrectly handled partitions with structs.
- [SPARK-48310][PYTHON][CONNECT] Cached properties must return copies
- [SPARK-48276][PYTHON][CONNECT] Add the missing __repr__ method for SQLExpression
- [SPARK-48294][SQL] Handle lowercase in nestedTypeMissingElementTypeError
- Operating system security updates.
May 21, 2024
- (Behavior change) dbutils.widgets.getAll() is now supported to get all widget values in a notebook.
- Fixed a bug in the try_divide() function where inputs containing decimals resulted in unexpected exceptions.
- [SPARK-48056][CONNECT][PYTHON] Re-execute plan if a SESSION_NOT_FOUND error is raised and no partial response was received
- [SPARK-48146][SQL] Fix aggregate function in With expression child assertion
- [SPARK-47986][CONNECT][PYTHON] Unable to create a new session when the default session is closed by the server
- [SPARK-48180][SQL] Improve error when UDTF call with TABLE arg forgets parentheses around multiple PARTITION/ORDER BY exprs
- [SPARK-48016][SQL] Fix a bug in try_divide function when with decimals
- [SPARK-48197][SQL] Avoid assert error for invalid lambda function
- [SPARK-47994][SQL] Fix bug with CASE WHEN column filter push down in SQLServer
- [SPARK-48173][SQL] CheckAnalysis should see the entire query plan
- [SPARK-48105][SS] Fix the race condition between state store unloading and snapshotting
- Operating system security updates.

May 9, 2024
- (Behavior change) applyInPandas and mapInPandas UDF types are now supported on shared access mode compute running Databricks Runtime 14.3 LTS and above.
- [SPARK-47739][SQL] Register logical avro type
- [SPARK-47941] [SS] [Connect] Propagate ForeachBatch worker initialization errors to users for PySpark
- [SPARK-48010][SQL] Avoid repeated calls to conf.resolver in resolveExpression
- [SPARK-48044][PYTHON][CONNECT] Cache DataFrame.isStreaming
- [SPARK-47956][SQL] Sanity check for unresolved LCA reference
- [SPARK-47543][CONNECT][PYTHON] Inferring dict as Mapype from Pandas DataFrame to allow DataFrame creation
- [SPARK-47819][CONNECT][Cherry-pick-14.3] Use asynchronous callback for execution cleanup
- [SPARK-47764][CORE][SQL] Cleanup shuffle dependencies based on ShuffleCleanupMode
- [SPARK-48018][SS] Fix null groupId causing missing param error when throwing KafkaException.couldNotReadOffsetRange
- [SPARK-47839][SQL] Fix aggregate bug in RewriteWithExpression
- [SPARK-47371] [SQL] XML: Ignore row tags found in CDATA
- [SPARK-47895][SQL] group by all should be idempotent
- [SPARK-47973][CORE] Log call site in SparkContext.stop() and later in SparkContext.assertNotStopped()
- Operating system security updates.

April 25, 2024
- [SPARK-47543][CONNECT][PYTHON] Inferring dict as MapType from Pandas DataFrame to allow DataFrame creation
- [SPARK-47694][CONNECT] Make max message size configurable on the client side
- [SPARK-47664][PYTHON][CONNECT][Cherry-pick-14.3] Validate the column name with cached schema
- [SPARK-47862][PYTHON][CONNECT]Fix generation of proto files
- Revert “[SPARK-47543][CONNECT][PYTHON] Inferring dict as MapType from Pandas DataFrame to allow DataFrame creation”
- [SPARK-47704][SQL] JSON parsing fails with “java.lang.ClassCastException” when spark.sql.json.enablePartialResults is enabled
- [SPARK-47812][CONNECT] Support Serialization of SparkSession for ForEachBatch worker
- [SPARK-47818][CONNECT][Cherry-pick-14.3] Introduce plan cache in SparkConnectPlanner to improve performance of Analyze requests
- [SPARK-47828][CONNECT][PYTHON] DataFrameWriterV2.overwrite fails with invalid plan
- Operating system security updates.
April 11, 2024
- (Behavior change) To ensure consistent behavior across compute types, PySpark UDFs on compute with standard access mode now match the behavior of UDFs on no-isolation and assigned clusters. This update includes the following changes that might break existing code:
  - UDFs with a string return type no longer implicitly convert non-string values into string values. Previously, UDFs with a return type of str would wrap the return value with a str() function regardless of the actual data type of the returned value.
  - UDFs with timestamp return types no longer implicitly apply a conversion to timestamp with timezone.
  - The Spark cluster configurations spark.databricks.sql.externalUDF.* no longer apply to PySpark UDFs on compute with standard access mode.
  - The Spark cluster configuration spark.databricks.safespark.externalUDF.plan.limit no longer affects PySpark UDFs, removing the Public Preview limitation of 5 UDFs per query for PySpark UDFs.
  - The Spark cluster configuration spark.databricks.safespark.sandbox.size.default.mib no longer applies to PySpark UDFs on compute with standard access mode. Instead, available memory on the system is used. To limit the memory of PySpark UDFs, use spark.databricks.pyspark.udf.isolation.memoryLimit with a minimum value of 100m.
- The TimestampNTZ data type is now supported as a clustering column with liquid clustering. See Use liquid clustering for tables.
- [SPARK-47511][SQL] Canonicalize With expressions by re-assigning IDs
- [SPARK-47509][SQL] Block subquery expressions in lambda and higher-order functions
- [SPARK-46990][SQL] Fix loading empty Avro files emitted by event-hubs
- [SPARK-47638][PS][CONNECT] Skip column name validation in PS
- Operating system security updates.
March 14, 2024
- [SPARK-47135][SS] Implement error classes for Kafka data loss exceptions
- [SPARK-47176][SQL] Have a ResolveAllExpressionsUpWithPruning helper function
- [SPARK-47145][SQL] Pass table identifier to row data source scan exec for V2 strategy.
- [SPARK-47044][SQL] Add executed query for JDBC external datasources to explain output
- [SPARK-47167][SQL] Add concrete class for JDBC anonymous relation
- [SPARK-47070] Fix invalid aggregation after subquery rewrite
- [SPARK-47121][CORE] Avoid RejectedExecutionExceptions during StandaloneSchedulerBackend shutdown
- Revert “[SPARK-46861][CORE] Avoid Deadlock in DAGScheduler”
- [SPARK-47125][SQL] Return null if Univocity never triggers parsing
- [SPARK-46999][SQL] ExpressionWithUnresolvedIdentifier should include other expressions in the expression tree
- [SPARK-47129][CONNECT][SQL] Make ResolveRelations cache connect plan properly
- [SPARK-47241][SQL] Fix rule order issues for ExtractGenerator
- [SPARK-47035][SS][CONNECT] Protocol for Client-Side Listener
- Operating system security updates.
February 29, 2024
- Fixed an issue where using a local collection as source in a MERGE command could result in the operation metric numSourceRows reporting double the correct number of rows.
- Creating a schema with a defined location now requires the user to have SELECT and MODIFY privileges on ANY FILE.
- [SPARK-47071][SQL] Inline With expression if it contains special expression
- [SPARK-47059][SQL] Attach error context for ALTER COLUMN v1 command
- [SPARK-46993][SQL] Fix constant folding for session variables
- Operating system security updates.
January 3, 2024
- [SPARK-46933] Add query execution time metric to connectors which use JDBCRDD.
- [SPARK-46763] Fix assertion failure in ReplaceDeduplicateWithAggregate for duplicate attributes.
- [SPARK-46954] XML: Wrap InputStreamReader with BufferedReader.
- [SPARK-46655] Skip query context catching in DataFrame methods.
- [SPARK-44815] Cache df.schema to avoid extra RPC.
- [SPARK-46952] XML: Limit size of corrupt record.
- [SPARK-46794] Remove subqueries from LogicalRDD constraints.
- [SPARK-46736] retain empty message field in protobuf connector.
- [SPARK-45182] Ignore task completion from old stage after retrying parent-indeterminate stage as determined by checksum.
- [SPARK-46414] Use prependBaseUri to render javascript imports.
- [SPARK-46383] Reduce Driver Heap Usage by Reducing the Lifespan of TaskInfo.accumulables().
- [SPARK-46861] Avoid Deadlock in DAGScheduler.
- [SPARK-46954] XML: Optimize schema index lookup.
- [SPARK-46676] dropDuplicatesWithinWatermark should not fail on canonicalization of the plan.
- [SPARK-46644] Change add and merge in SQLMetric to use isZero.
- [SPARK-46731] Manage state store provider instance by state data source - reader.
- [SPARK-46677] Fix dataframe["*"] resolution.
- [SPARK-46610] Create table should throw exception when no value for a key in options.
- [SPARK-46941] Can't insert window group limit node for top-k computation if contains SizeBasedWindowFunction.
- [SPARK-45433] Fix CSV/JSON schema inference when timestamps do not match specified timestampFormat.
- [SPARK-46930] Add support for a custom prefix for Union type fields in Avro.
- [SPARK-46227] Backport to 14.3.
- [SPARK-46822] Respect spark.sql.legacy.charVarcharAsString when casting jdbc type to catalyst type in jdbc.
- Operating system security updates.

Databricks Runtime 13.3 LTS

See Databricks Runtime 13.3 LTS.

January 27, 2026
- Operating system security updates.

January 9, 2026
- Partitioned Delta tables will have partition columns materialized in data parquet files going forward. This enables better synergy with how Iceberg and UniForm tables are handled, and increases compatibility with external non-Delta readers.
- Operating system security updates.

December 9, 2025
- [SPARK-54180][SQL] Override the toString of BinaryFileFormat
- [SPARK-52579][PYTHON] Set periodical traceback dump for Python workers
- Operating system security updates.

November 18, 2025
- [SPARK-54047][PYTHON] Use a difference error when kill-on-idle-timeout
- Operating system security updates.

November 4, 2025
- Updated R libraries:
  - arrow from 10.0.1 to 21.0.0
- Operating system security updates.

October 21, 2025
- Operating system security updates.

October 7, 2025
- Operating system security updates.

September 24, 2025
- Operating system security updates.

September 9, 2025
- Operating system security updates.

August 26, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-spark_2.12 from 0.7.12 to 0.7.13
- Operating system security updates.

August 14, 2025
- Operating system security updates.

July 29, 2025
- Operating system security updates.

July 15, 2025
- Operating system security updates.
July 1, 2025
- Operating system security updates.

June 17, 2025
- Fixed the limitation that the cloud_files_state table-valued function (TVF) can't be used to read the file-level state of streaming tables across pipelines.
- Operating system security updates.

June 3, 2025
- Updated Python libraries:
  - cryptography from 3.4.8, 37.0.1 to 37.0.1
  - platformdirs from 2.5.2, 2.6.2 to 2.5.2
  - pyparsing from 2.4.7, 3.0.9 to 3.0.9
  - Added pip 22.2.2
  - Added setuptools 63.4.1
  - Added tomli 2.0.1
  - Added wcwidth 0.2.5
  - Added wheel 0.37.1
  - Removed distro 1.7.0
  - Removed distro-info 1.1+ubuntu0.2
  - Removed python-apt 2.4.0+ubuntu4
- Updated Java libraries:
  - com.github.fommil.netlib.native_ref-java from 1.1, 1.1-natives to 1.1, 1.1
  - com.github.fommil.netlib.native_system-java from 1.1, 1.1-natives to 1.1, 1.1
  - com.github.fommil.netlib.netlib-native_ref-linux-x86_64 from 1.1-natives to 1.1
  - com.github.fommil.netlib.netlib-native_system-linux-x86_64 from 1.1-natives to 1.1
  - io.netty.netty-transport-native-epoll from 4.1.87.Final, 4.1.87.Final-linux-aarch_64, 4.1.87.Final-linux-x86_64 to 4.1.87.Final, 4.1.87.Final, 4.1.87.Final
  - io.netty.netty-transport-native-kqueue from 4.1.87.Final-osx-aarch_64, 4.1.87.Final-osx-x86_64 to 4.1.87.Final, 4.1.87.Final
  - org.apache.orc.orc-core from 1.8.4-shaded-protobuf to 1.8.4
  - org.apache.orc.orc-mapreduce from 1.8.4-shaded-protobuf to 1.8.4
  - software.amazon.cryptools.AmazonCorrettoCryptoProvider from 1.6.1-linux-x86_64 to 1.6.1
- [SPARK-52159][SQL] Properly handle table existence check for jdbc dialects
- Operating system security updates.

May 20, 2025
- Updated Java libraries:
  - io.delta.delta-sharing-spark_2.12 from 0.7.11 to 0.7.12
- Operating system security updates.

April 22, 2025
- [Behavior Change] Vacuum operations now perform Writer protocol checks similar to other operations, preventing unexpected cleanups on tables with newer features when run from incompatible older Databricks Runtime versions.
- Operating system security updates.

April 9, 2025
- [SPARK-51624][SQL] Propagate GetStructField metadata in CreateNamedStruct.dataType
- Operating system security updates.

March 11, 2025
- Operating system security updates.

February 11, 2025
- [SPARK-50492][SS] Fix java.util.NoSuchElementException when event time column is dropped after dropDuplicatesWithinWatermark
- [SPARK-45915][SQL] Treat decimal(x, 0) the same as IntegralType in PromoteStrings
- Operating system security updates.

December 10, 2024
- Operating system security updates.
November 26, 2024
- [SPARK-49615] [ML] Make all ML feature transformers dataset schema validation conforming “spark.sql.caseSensitive” config.
- Operating system security updates.
November 5, 2024
- [SPARK-48843] Prevent infinite loop with BindParameters
- [BACKPORT] [SPARK-49326][SS] Classify Error class for Foreach sink user function error
- [SPARK-49905] Use dedicated ShuffleOrigin for stateful operator to prevent the shuffle to be modified from AQE
- Operating system security updates.
October 22, 2024
- [SPARK-48843] Prevent infinite loop with BindParameters
- [BACKPORT] [SPARK-49326][SS] Classify Error class for Foreach sink user function error
- [SPARK-49905] Use dedicated ShuffleOrigin for stateful operator to prevent the shuffle to be modified from AQE
- Operating system security updates.
October 10, 2024
- [SPARK-49743][SQL] OptimizeCsvJsonExpr should not change schema fields when pruning GetArrayStructFields
September 25, 2024
- [SPARK-46601] [CORE] Fix log error in handleStatusMessage
- [SPARK-48719][SQL] Fix the calculation bug of RegrSlope & RegrIntercept when the first parameter is null
- [SPARK-43242][CORE] Fix throw 'Unexpected type of BlockId' in shuffle corruption diagnose
- [SPARK-49000][SQL] Fix “select count(distinct 1) from t” where t is empty table by expanding RewriteDistinctAggregates
- Operating system security updates.
September 17, 2024
- [SPARK-49526][CONNECT] Support Windows-style paths in ArtifactManager
- [SPARK-48463][ML] Make Binarizer, Bucketizer, Vector Assembler, FeatureHasher, QuantizeDiscretizer, OnehotEncoder, StopWordsRemover, Imputer, Interactor supporting nested input columns
- Operating system security updates.
August 29, 2024
August 14, 2024
- [SPARK-49056][SQL] ErrorClassesJsonReader cannot handle null properly
- [SPARK-49065][SQL] Rebasing in legacy formatters/parsers must support non JVM default time zones
- [SPARK-48597][SQL] Introduce a marker for isStreaming property in text representation of logical plan
August 1, 2024
- This release includes a bug fix for the ColumnVector and ColumnarArray classes in the Spark Java interface. Previous to this fix, an ArrayIndexOutOfBoundsException might be thrown or incorrect data returned when an instance of one of these classes contained null values.
- [SPARK-47202][PYTHON] Fix typo breaking datetimes with tzinfo
- [SPARK-48896] [SPARK-48909] [SPARK-48883] Backport spark ML writer fixes
- [SPARK-48463] Make StringIndexer supporting nested input columns
- Operating system security updates.
July 11, 2024
- (Behavior change) DataFrames cached against Delta table sources are now invalidated if the source table is overwritten. This change means that all state changes to Delta tables now invalidate cached results. Use .checkpoint() to persist a table state throughout the lifetime of a DataFrame.
- This release includes a fix to an issue that prevented the Spark UI Environment tab from displaying correctly when running in Databricks Container Services.
- [SPARK-48383][SS] Throw better error for mismatched partitions in startOffset option in Kafka
- [SPARK-48292][CORE] Revert [SPARK-39195][SQL] Spark OutputCommitCoordinator should abort stage when committed file not consistent with task status
- [SPARK-48503][SQL] Fix invalid scalar subqueries with group-by on non-equivalent columns that were incorrectly allowed
- [SPARK-48481][SQL][SS] Do not apply OptimizeOneRowPlan against streaming Dataset
- [SPARK-48475][PYTHON] Optimize _get_jvm_function in PySpark.
- [SPARK-48273][SQL] Fix late rewrite of PlanWithUnresolvedIdentifier
- [SPARK-48445][SQL] Don't inline UDFs with expensive children
- Operating system security updates.
June 17, 2024
- [SPARK-48277] Improve error message for ErrorClassesJsonReader.getErrorMessage
- Operating system security updates.
May 21, 2024
- (Behavior change) dbutils.widgets.getAll() is now supported to get all widget values in a notebook.
- [SPARK-48105][SS] Fix the race condition between state store unloading and snapshotting
- [SPARK-47994][SQL] Fix bug with CASE WHEN column filter push down in SQLServer
- Operating system security updates.
May 9, 2024
- [SPARK-47956][SQL] Sanity check for unresolved LCA reference
- [SPARK-46822][SQL] Respect spark.sql.legacy.charVarcharAsString when casting jdbc type to catalyst type in jdbc
- [SPARK-47895][SQL] group by all should be idempotent
- [SPARK-48018][SS] Fix null groupId causing missing param error when throwing KafkaException.couldNotReadOffsetRange
- [SPARK-47973][CORE] Log call site in SparkContext.stop() and later in SparkContext.assertNotStopped()
- Operating system security updates.
April 25, 2024
- [SPARK-44653][SQL] Non-trivial DataFrame unions should not break caching
- Miscellaneous bug fixes.
April 11, 2024
- [SPARK-47509][SQL] Block subquery expressions in lambda and higher-order functions
- Operating system security updates.
April 1, 2024
- [SPARK-47385] Fix tuple encoders with Option inputs.
- [SPARK-38708][SQL] Upgrade Hive Metastore Client to the 3.1.3 for Hive 3.1
- [SPARK-47200][SS] Error class for Foreach batch sink user function error
- [SPARK-47368][SQL] Remove inferTimestampNTZ config check in ParquetRowConverter
- [SPARK-44252][SS] Define a new error class and apply for the case where loading state from DFS fails
- [SPARK-47135][SS] Implement error classes for Kafka data loss exceptions
- [SPARK-47300][SQL] quoteIfNeeded should quote identifier starts with digits
- [SPARK-47305][SQL] Fix PruneFilters to tag the isStreaming flag of LocalRelation correctly when the plan has both batch and streaming
- [SPARK-47070] Fix invalid aggregation after subquery rewrite
- Operating system security updates.
March 14, 2024
- [SPARK-47145][SQL] Pass table identifier to row data source scan exec for V2 strategy.
- [SPARK-47167][SQL] Add concrete class for JDBC anonymous relation
- [SPARK-47176][SQL] Have a ResolveAllExpressionsUpWithPruning helper function
- [SPARK-47044][SQL] Add executed query for JDBC external datasources to explain output
- [SPARK-47125][SQL] Return null if Univocity never triggers parsing
- Operating system security updates.
February 29, 2024
- Fixed an issue where using a local collection as source in a MERGE command could result in the operation metric numSourceRows reporting double the correct number of rows.
- Creating a schema with a defined location now requires the user to have SELECT and MODIFY privileges on ANY FILE.
- Operating system security updates.
February 8, 2024
- Change data feed (CDF) queries on Unity Catalog materialized views are not supported, and attempting to run a CDF query with a Unity Catalog materialized view returns an error. Unity Catalog streaming tables support CDF queries on non-AUTO CDC tables in Databricks Runtime 14.1 and later. CDF queries are not supported with Unity Catalog streaming tables in Databricks Runtime 14.0 and earlier.
- [SPARK-46794] Remove subqueries from LogicalRDD constraints.
- [SPARK-46933] Add query execution time metric to connectors which use JDBCRDD.
- [SPARK-45582] Ensure that store instance is not used after calling commit within output mode streaming aggregation.
- [SPARK-46396] Timestamp inference should not throw exception.
- [SPARK-46861] Avoid Deadlock in DAGScheduler.
- [SPARK-46941] Can't insert window group limit node for top-k computation if contains SizeBasedWindowFunction.
- Operating system security updates.
January 31, 2024
- [SPARK-46610] Create table should throw exception when no value for a key in options.
- [SPARK-46383] Reduce Driver Heap Usage by Reducing the Lifespan of TaskInfo.accumulables().
- [SPARK-46600] Move shared code between SqlConf and SqlApiConf to SqlApiConfHelper.
- [SPARK-46676] dropDuplicatesWithinWatermark should not fail on canonicalization of the plan.
- [SPARK-46763] Fix assertion failure in ReplaceDeduplicateWithAggregate for duplicate attributes.
- Operating system security updates.
January 17, 2024
- The shuffle node of the explain plan returned by a Photon query is updated to add the causedBroadcastJoinBuildOOM=true flag when an out-of-memory error occurs during a shuffle that is part of a broadcast join.
- To avoid increased latency when communicating over TLSv1.3, this maintenance release includes a patch to the JDK 8 installation to fix JDK bug JDK-8293562.
- [SPARK-46058] Add separate flag for privateKeyPassword.
- [SPARK-46173] Skipping trimAll call during date parsing.
- [SPARK-46370] Fix bug when querying from table after changing column defaults.
- [SPARK-46370] Fix bug when querying from table after changing column defaults.
- [SPARK-46370] Fix bug when querying from table after changing column defaults.
- [SPARK-46609] Avoid exponential explosion in PartitioningPreservingUnaryExecNode.
- [SPARK-46132] Support key password for JKS keys for RPC SSL.
- [SPARK-46602] Propagate allowExisting in view creation when the view/table does not exists.
- [SPARK-46249] Require instance lock for acquiring RocksDB metrics to prevent race with background operations.
- [SPARK-46417] Do not fail when calling hive.getTable and throwException is false.
- [SPARK-46538] Fix the ambiguous column reference issue in ALSModel.transform.
- [SPARK-46478] Revert SPARK-43049 to use oracle varchar(255) for string.
- [SPARK-46250] Deflake test_parity_listener.
- [SPARK-46394] Fix spark.catalog.listDatabases() issues on schemas with special characters when spark.sql.legacy.keepCommandOutputSchema set to true.
- [SPARK-46056] Fix Parquet vectorized read NPE with byteArrayDecimalType default value.
- [SPARK-46145] spark.catalog.listTables does not throw exception when the table or view is not found.
- [SPARK-46466] Vectorized parquet reader should never do rebase for timestamp ntz.
December 14, 2023
- Fixed an issue where escaped underscores in getColumns operations originating from JDBC or ODBC clients were handled incorrectly and interpreted as wildcards.
- [SPARK-45920] group by ordinal should be idempotent.
- [SPARK-44582] Skip iterator on SMJ if it was cleaned up.
- [SPARK-45433] Fix CSV/JSON schema inference when timestamps do not match specified timestampFormat.
- [SPARK-45655] Allow non-deterministic expressions inside AggregateFunctions in CollectMetrics.
- Operating system security updates.
November 29, 2023
- Installed a new package, pyarrow-hotfix to remediate a PyArrow RCE vulnerability.
- Spark-snowflake connector is upgraded to 2.12.0.
- [SPARK-44846] Removed complex grouping expressions after RemoveRedundantAggregates.
- [SPARK-45544] Integrated SSL support into TransportContext.
- [SPARK-45892] Refactor optimizer plan validation to decouple validateSchemaOutput and validateExprIdUniqueness.
- [SPARK-45730] Improved time constraints for ReloadingX509TrustManagerSuite.
- [SPARK-45859] Made UDF objects in ml.functions lazy.
- Operating system security updates.
November 10, 2023
- Partition filters on Delta Lake streaming queries are pushed down before rate limiting to achieve better utilization.
- Changed data feed queries on Unity Catalog streaming tables and materialized views to display error messages.
- [SPARK-45545] SparkTransportConf inherits SSLOptions upon creation.
- [SPARK-45584] Fixed subquery run failure with TakeOrderedAndProjectExec.
- [SPARK-45427] Added RPC SSL settings to SSLOptions and SparkTransportConf.
- [SPARK-45541] Added SSLFactory.
- [SPARK-45430] FramelessOffsetWindowFunction no longer fails when IGNORE NULLS and offset > rowCount.
- [SPARK-45429] Added helper classes for SSL RPC communication.
- [SPARK-44219] Added extra per-rule validations for optimization rewrites.
- [SPARK-45543] Fixed an issue where InferWindowGroupLimit caused an issue if the other window functions didn't have the same window frame as the rank-like functions.
- Operating system security updates.
October 23, 2023
- [SPARK-45256] Fixed an issue where DurationWriter failed when writing more values than initial capacity.
- [SPARK-45419] Avoid reusing rocksdb sst files in a different rocksdb instance by removing file version map entries of larger versions.
- [SPARK-45426] Added support for ReloadingX509TrustManager.
- Miscellaneous fixes.
October 13, 2023
- Snowflake-jdbc dependency upgraded from 3.13.29 to 3.13.33.
- The array_insert function is 1-based for positive and negative indexes, while before, it was 0-based for negative indexes. It now inserts a new element at the end of input arrays for the index -1. To restore the previous behavior, set spark.sql.legacy.negativeIndexInArrayInsert to true.
- Fixed an issue around not ignoring corrupt files when ignoreCorruptFiles is enabled during CSV schema inference with Auto Loader.
- Revert "[SPARK-42946]."
- [SPARK-42205] Updated the JSON protocol to remove Accumulables logging in a task or stage start events.
- [SPARK-45178] Fallback to running a single batch for Trigger.AvailableNow with unsupported sources rather than using the wrapper.
- [SPARK-45316] Add new parameters ignoreCorruptFiles and ignoreMissingFiles to HadoopRDD and NewHadoopRDD.
- [SPARK-44740] Fixed metadata values for Artifacts.
- [SPARK-45360] Initialized Spark session builder configuration from SPARK_REMOTE.
- [SPARK-44551] Edited comments to sync with OSS.
- [SPARK-45346] Parquet schema inference now respects case-sensitive flags when merging schema.
- [SPARK-44658] ShuffleStatus.getMapStatus now returns None instead of Some(null).
- [SPARK-44840] Made array_insert() 1-based for negative indexes.
September 14, 2023
- [SPARK-44873] Added support for alter view with nested columns in Hive client.
- [SPARK-44878] Turned off strict limit for RocksDB write manager to avoid insertion exception on cache complete.
August 30, 2023
- The dbutils cp command (dbutils.fs.cp) has been optimized for faster copying. With this improvement, copy operations can take up to 100 less time, depending on the file size. The feature is available across all Clouds and file systems accessible in Databricks, including for Unity Catalog Volumes and DBFS mounts.
- [SPARK-44455] Quote identifiers with backticks in the SHOW CREATE TABLE result.
- [SPARK-44763] Fixed an issue that showed a string as a double in binary arithmetic with interval.
- [SPARK-44871] Fixed percentile_disc behavior.
- [SPARK-44714] Ease restriction of LCA resolution regarding queries.
- [SPARK-44818] Fixed race for pending task interrupt issued before taskThread is initialized.
- [SPARK-44505] Added override for columnar support in Scan for DSv2.
- [SPARK-44479] Fixed protobuf conversion from an empty struct type.
- [SPARK-44718] Match ColumnVector memory-mode config default to OffHeapMemoryMode config value.
- [SPARK-42941] Added support for StreamingQueryListener in Python.
- [SPARK-44558] Export PySpark's Spark Connect Log Level.
- [SPARK-44464] Fixed applyInPandasWithStatePythonRunner to output rows that have Null as the first column value.
- [SPARK-44643] Fixed Row.__repr__ when the field is an empty row.
- Operating system security updates.

Databricks Runtime 12.2 LTS

See Databricks Runtime 12.2 LTS.

January 27, 2026
- Operating system security updates.

January 9, 2026
- Operating system security updates.

December 9, 2025
- Operating system security updates.

November 18, 2025
- Operating system security updates.

November 4, 2025
- Updated R libraries:
  - arrow from 10.0.0 to 21.0.0
- Operating system security updates.

October 21, 2025
- Updated Python from 2.7.18 to 2.7.18.1
- Operating system security updates.

October 7, 2025
- Operating system security updates.

September 24, 2025
- Operating system security updates.

September 9, 2025
- Operating system security updates.

August 26, 2025
- Operating system security updates.

August 14, 2025
- Operating system security updates.

July 29, 2025
- Operating system security updates.

July 15, 2025
- Operating system security updates.

July 1, 2025
- Operating system security updates.

June 17, 2025
- Operating system security updates.

June 3, 2025
- Updated Python libraries:
  - certifi from 2019.11.28, 2021.10.8 to 2021.10.8
  - chardet from 3.0.4, 4.0.0 to 4.0.0
  - idna from 2.8, 3.3 to 3.3
  - requests from 2.22.0, 2.27.1 to 2.27.1
  - six from 1.14.0, 1.16.0 to 1.16.0
  - urllib3 from 1.25.8, 1.26.9 to 1.26.9
  - Added pip 21.2.4
  - Added setuptools 61.2.0
  - Added tomli 1.2.2
  - Added wcwidth 0.2.5
  - Added wheel 0.37.0
  - Removed distro 1.4.0
  - Removed distro-info 0.23+ubuntu1.1
  - Removed python-apt 2.0.1+ubuntu0.20.4.1
- Updated Java libraries:
  - software.amazon.cryptools.AmazonCorrettoCryptoProvider from 1.6.1-linux-x86_64 to 1.6.1
- Operating system security updates.

May 20, 2025
- [SPARK-42655][SQL] Incorrect ambiguous column reference error
- Operating system security updates.

April 22, 2025
- [Behavior Change] Vacuum operations now perform Writer protocol checks similar to other operations, preventing unexpected cleanups on tables with newer features when run from incompatible older Databricks Runtime versions.
- Operating system security updates.

April 9, 2025
- Operating system security updates.

March 11, 2025
- Operating system security updates.
December 10, 2024
- Operating system security updates.
November 26, 2024
- Miscellaneous bug fixes.
October 10, 2024
- [SPARK-49743][SQL] OptimizeCsvJsonExpr should not change schema fields when pruning GetArrayStructFields
September 25, 2024
- [SPARK-49000][SQL] Fix “select count(distinct 1) from t” where t is empty table by expanding RewriteDistinctAggregates
- [SPARK-46601] [CORE] Fix log error in handleStatusMessage
- Miscellaneous bug fixes.
September 17, 2024
- Operating system security updates.
August 29, 2024
- Miscellaneous bug fixes.
August 14, 2024
- [SPARK-48941][SPARK-48970] Backport ML writer / reader fixes
- [SPARK-49065][SQL] Rebasing in legacy formatters/parsers must support non JVM default time zones
- [SPARK-49056][SQL] ErrorClassesJsonReader cannot handle null properly
- [SPARK-48597][SQL] Introduce a marker for isStreaming property in text representation of logical plan
- [SPARK-48463][ML] Make StringIndexer supporting nested input columns
- Operating system security updates.
August 1, 2024
- [SPARK-48896] [SPARK-48909] [SPARK-48883] Backport spark ML writer fixes
August 1, 2024
- To apply required security patches, the Python version in Databricks Runtime 12.2 LTS is upgraded from 3.9.5 to 3.9.19.
July 11, 2024
- (Behavior change) DataFrames cached against Delta table sources are now invalidated if the source table is overwritten. This change means that all state changes to Delta tables now invalidate cached results. Use .checkpoint() to persist a table state throughout the lifetime of a DataFrame.
- [SPARK-48481][SQL][SS] Do not apply OptimizeOneRowPlan against streaming Dataset
- [SPARK-47070] Fix invalid aggregation after subquery rewrite
- [SPARK-42741][SQL] Do not unwrap casts in binary comparison when literal is null
- [SPARK-48445][SQL] Don't inline UDFs with expensive children
- [SPARK-48503][SQL] Fix invalid scalar subqueries with group-by on non-equivalent columns that were incorrectly allowed
- [SPARK-48383][SS] Throw better error for mismatched partitions in startOffset option in Kafka
- Operating system security updates.
June 17, 2024
- [SPARK-48277] Improve error message for ErrorClassesJsonReader.getErrorMessage
- Miscellaneous bug fixes.
May 21, 2024
- [SPARK-48105][SS] Fix the race condition between state store unloading and snapshotting
- Operating system security updates.
May 9, 2024
- [SPARK-44251][SQL] Set nullable correctly on coalesced join key in full outer USING join
- [SPARK-47973][CORE] Log call site in SparkContext.stop() and later in SparkContext.assertNotStopped()
- [SPARK-47956][SQL] Sanity check for unresolved LCA reference
- [SPARK-48018][SS] Fix null groupId causing missing param error when throwing KafkaException.couldNotReadOffsetRange
- Operating system security updates.
April 25, 2024
- Operating system security updates.
April 11, 2024
- Operating system security updates.
April 1, 2024
- [SPARK-47305][SQL] Fix PruneFilters to tag the isStreaming flag of LocalRelation correctly when the plan has both batch and streaming
- [SPARK-44252][SS] Define a new error class and apply for the case where loading state from DFS fails
- [SPARK-47135][SS] Implement error classes for Kafka data loss exceptions
- [SPARK-47200][SS] Error class for Foreach batch sink user function error
- Operating system security updates.
March 14, 2024
- [SPARK-47176][SQL] Have a ResolveAllExpressionsUpWithPruning helper function
- Revert “[SPARK-46861][CORE] Avoid Deadlock in DAGScheduler”
- [SPARK-47125][SQL] Return null if Univocity never triggers parsing
- [SPARK-47167][SQL] Add concrete class for JDBC anonymous relation
- Operating system security updates.
February 29, 2024
- Fixed an issue where using a local collection as source in a MERGE command could result in the operation metric numSourceRows reporting double the correct number of rows.
- Creating a schema with a defined location now requires the user to have SELECT and MODIFY privileges on ANY FILE.
- [SPARK-45582][SS] Ensure that store instance is not used after calling commit within output mode streaming aggregation
- Operating system security updates.
February 13, 2024
- [SPARK-46861] Avoid Deadlock in DAGScheduler.
- [SPARK-46794] Remove subqueries from LogicalRDD constraints.
- Operating system security updates.
January 31, 2024
- [SPARK-46763] Fix assertion failure in ReplaceDeduplicateWithAggregate for duplicate attributes.
- Operating system security updates.
December 25, 2023
- To avoid increased latency when communicating over TLSv1.3, this maintenance release includes a patch to the JDK 8 installation to fix JDK bug JDK-8293562.
- [SPARK-39440] Add a config to disable event timeline.
- [SPARK-46132] Support key password for JKS keys for RPC SSL.
- [SPARK-46394] Fix spark.catalog.listDatabases() issues on schemas with special characters when spark.sql.legacy.keepCommandOutputSchema set to true.
- [SPARK-46417] Do not fail when calling hive.getTable and throwException is false.
- [SPARK-43067] Correct the location of error class resource file in Kafka connector.
- [SPARK-46249] Require instance lock for acquiring RocksDB metrics to prevent race with background operations.
- [SPARK-46602] Propagate allowExisting in view creation when the view/table does not exists.
- [SPARK-46058] Add separate flag for privateKeyPassword.
- [SPARK-46145] spark.catalog.listTables does not throw exception when the table or view is not found.
- [SPARK-46538] Fix the ambiguous column reference issue in ALSModel.transform.
- [SPARK-42852] Revert NamedLambdaVariable related changes from EquivalentExpressions.
December 14, 2023
- Fixed an issue where escaped underscores in getColumns operations originating from JDBC or ODBC clients were handled incorrectly and interpreted as wildcards.
- [SPARK-44582] Skip iterator on SMJ if it was cleaned up.
- [SPARK-45920] group by ordinal should be idempotent.
- [SPARK-45655] Allow non-deterministic expressions inside AggregateFunctions in CollectMetrics.
- Operating system security updates.
November 29, 2023
- Installed a new package, pyarrow-hotfix to remediate a PyArrow RCE vulnerability.
- Fixed an issue where escaped underscores in getColumns operations originating from JDBC or ODBC clients were wrongly interpreted as wildcards.
- [SPARK-42205] Removed logging accumulables in Stage and Task start events.
- [SPARK-44846] Removed complex grouping expressions after RemoveRedundantAggregates.
- [SPARK-43718] Fixed nullability for keys in USING joins.
- [SPARK-45544] Integrated SSL support into TransportContext.
- [SPARK-43973] Structured Streaming UI now displays failed queries correctly.
- [SPARK-45730] Improved time constraints for ReloadingX509TrustManagerSuite.
- [SPARK-45859] Made UDF objects in ml.functions lazy.
- Operating system security updates.
November 14, 2023
- Partition filters on Delta Lake streaming queries are pushed down before rate limiting to achieve better utilization.
- [SPARK-45545] SparkTransportConf inherits SSLOptions upon creation.
- [SPARK-45427] Added RPC SSL settings to SSLOptions and SparkTransportConf.
- [SPARK-45584] Fixed subquery run failure with TakeOrderedAndProjectExec.
- [SPARK-45541] Added SSLFactory.
- [SPARK-45430] FramelessOffsetWindowFunction no longer fails when IGNORE NULLS and offset > rowCount.
- [SPARK-45429] Added helper classes for SSL RPC communication.
- Operating system security updates.
October 24, 2023
- [SPARK-45426] Added support for ReloadingX509TrustManager.
- Miscellaneous fixes.
October 13, 2023
- Snowflake-jdbc dependency upgraded from 3.13.29 to 3.13.33.
- [SPARK-42553] Ensure at least one time unit after interval.
- [SPARK-45346] Parquet schema inference respects case sensitive flag when merging schema.
- [SPARK-45178] Fallback to running a single batch for Trigger.AvailableNow with unsupported sources rather than using the wrapper.
- [SPARK-45084] StateOperatorProgress to use an accurate, adequate shuffle partition number.
September 12, 2023
- [SPARK-44873] Added support for alter view with nested columns in the Hive client.
- [SPARK-44718] Match ColumnVector memory-mode config default to OffHeapMemoryMode config value.
- [SPARK-43799] Added descriptor binary option to PySpark Protobuf API.
- Miscellaneous fixes.
August 30, 2023
- [SPARK-44485] Optimized TreeNode.generateTreeString.
- [SPARK-44818] Fixed race for pending task interrupt issued before taskThread is initialized.
- [SPARK-44871][11.3-13.0] Fixed percentile_disc behavior.
- [SPARK-44714] Eased restriction of LCA resolution regarding queries.
- Operating system security updates.
August 15, 2023
- [SPARK-44504] Maintenance task cleans up loaded providers on stop error.
- [SPARK-44464] Fixed applyInPandasWithStatePythonRunner to output rows that have Null as the first column value.
- Operating system security updates.
July 29, 2023
- Fixed an issue where dbutils.fs.ls() returned INVALID_PARAMETER_VALUE.LOCATION_OVERLAP when called for a storage location path which clashed with other external or managed storage location.
- [SPARK-44199] CacheManager no longer refreshes the fileIndex unnecessarily.
- Operating system security updates.
July 24, 2023
- [SPARK-44337] Fixed an issue where any field set to Any.getDefaultInstance caused parse errors.
- [SPARK-44136] Fixed an issue where StateManager would get materialized in an executor instead of the driver in FlatMapGroupsWithStateExec.
- Operating system security updates.
June 23, 2023
- Operating system security updates.
June 15, 2023
- Photonized approx_count_distinct.
- Snowflake-jdbc library is upgraded to 3.13.29 to address a security issue.
- [SPARK-43779] ParseToDate now loads EvalMode in the main thread.
- [SPARK-43156][SPARK-43098] Extended scalar subquery count error test with decorrelateInnerQuery turned off.
- Operating system security updates.
June 2, 2023
- The JSON parser in failOnUnknownFields mode drops a record in DROPMALFORMED mode and fails directly in FAILFAST mode.
- Improve the performance of incremental updates with SHALLOW CLONE Apache Iceberg and Apache Parquet.
- Fixed an issue in Auto Loader where different source file formats were inconsistent when the provided schema did not include inferred partitions. This issue could cause unexpected failures when reading files with missing columns in the inferred partition schema.
- [SPARK-43404] Skip reusing the sst file for the same version of RocksDB state store to avoid the ID mismatch error.
- [SPARK-43413][11.3-13.0] Fixed IN subquery ListQuery nullability.
- [SPARK-43522] Fixed creating struct column name with index of array.
- [SPARK-43541] Propagate all Project tags in resolving of expressions and missing columns.
- [SPARK-43527] Fixed catalog.listCatalogs in PySpark.
- [SPARK-43123] Internal field metadata no longer leaks to catalogs.
- [SPARK-43340] Fixed missing stack trace field in eventlogs.
- [SPARK-42444] DataFrame.drop now handles duplicated columns correctly.
- [SPARK-42937] PlanSubqueries now sets InSubqueryExec#shouldBroadcast to true.
- [SPARK-43286] Updated aes_encrypt CBC mode to generate random IVs.
- [SPARK-43378] Properly close stream objects in deserializeFromChunkedBuffer.
May 17, 2023
- Parquet scans are now robust against OOMs when scanning exceptionally structured files by dynamically adjusting batch size. File metadata is analyzed to preemptively lower batch size and is lowered again on task retries as a final safety net.
- If an Avro file was read with just the failOnUnknownFields option or with Auto Loader in the failOnNewColumns schema evolution mode, columns that have different data types would be read as null instead of throwing an error stating that the file cannot be read. These reads now fail and recommend users to use the rescuedDataColumn option.
- Auto Loader now does the following.
- - Correctly reads and no longer rescues Integer, Short, and Byte types if one of these data types is provided, but the Avro file suggests one of the other two types.
- - Prevents reading interval types as date or time stamp types to avoid getting corrupt dates.
- - Prevents reading Decimal types with lower precision.
- [SPARK-43172] Exposes host and token from Spark connect client.
- [SPARK-43293] __qualified_access_only is ignored in normal columns.
- [SPARK-43098] Fixed correctness COUNT bug when scalar subquery is grouped by clause.
- [SPARK-43085] Support for column DEFAULT assignment for multi-part table names.
- [SPARK-43190] ListQuery.childOutput is now consistent with secondary output.
- [SPARK-43192] Removed user agent charset validation.
- Operating system security updates.
April 25, 2023
- If a Parquet file was read with just the failOnUnknownFields option or with Auto Loader in the failOnNewColumns schema evolution mode, columns that had different data types would be read as null instead of throwing an error stating that the file cannot be read. These reads now fail and recommend users to use the rescuedDataColumn option.
- Auto Loader now correctly reads and no longer rescues Integer, Short, and Byte types if one of these data types is provided. The Parquet file suggests one of the other two types. When the rescued data column was previously enabled, the data type mismatch would cause columns to be saved even though they were readable.
- [SPARK-43009] Parameterized sql() with Any constants
- [SPARK-42406] Terminate Protobuf recursive fields by dropping the field
- [SPARK-43038] Support the CBC mode by aes_encrypt()/aes_decrypt()
- [SPARK-42971] Change to print workdir if appDirs is null when worker handle WorkDirCleanup event
- [SPARK-43018] Fix bug for INSERT commands with timestamp literals
- Operating system security updates.
April 11, 2023
- Support legacy data source formats in the SYNC command.
- Fixes an issue in the %autoreload behavior in notebooks outside of a repo.
- Fixed an issue where Auto Loader schema evolution can go into an infinite fail loop when a new column is detected in the schema of a nested JSON object.
- [SPARK-42928] Makes resolvePersistentFunction synchronized.
- [SPARK-42936] Fixes LCan issue when the clause can be resolved directly by its child aggregate.
- [SPARK-42967] Fixes SparkListenerTaskStart.stageAttemptId when a task starts after the stage is canceled.
- Operating system security updates.
March 29, 2023
- Databricks SQL now supports specifying default values for columns of Delta Lake tables, either at table creation time or afterward. Subsequent INSERT, UPDATE, DELETE, and MERGE commands can refer to any column's default value using the explicit DEFAULT keyword. In addition, if any INSERT assignment has an explicit list of fewer columns than the target table, corresponding column default values are substituted for the remaining columns (or NULL if no default is specified).
  
  For example:
  SQL
```
CREATE TABLE t (first INT, second DATE DEFAULT CURRENT_DATE());
INSERT INTO t VALUES (0, DEFAULT);
INSERT INTO t VALUES (1, DEFAULT);
SELECT first, second FROM t;
> 0, 2023-03-28
1, 2023-03-28z
```
- Auto Loader now initiates at least one synchronous RocksDB log cleanup for Trigger.AvailableNow streams to check that the checkpoint can get regularly cleaned up for fast-running Auto Loader streams. This can cause some streams to take longer before they shut down, but it will save you storage costs and improve the Auto Loader experience in future runs.
- You can now modify a Delta table to add support to table features using DeltaTable.addFeatureSupport(feature_name).
- [SPARK-42794] Increase the lockAcquireTimeoutMs to 2 minutes for acquiring the RocksDB state store in Structure Streaming
- [SPARK-42521] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table
- [SPARK-42702][SPARK-42623] Support parameterized query in subquery and CTE
- [SPARK-42668] Catch exception while trying to close the compressed stream in HDFSStateStoreProvider stop
- [SPARK-42403] JsonProtocol should handle null JSON strings
March 8, 2023
- The error message “Failure to initialize configuration” has been improved to provide more context for the customer.
- There is a terminology change for adding features to a Delta table using the table property. The preferred syntax is now 'delta.feature.featureName'='supported' instead of 'delta.feature.featureName'='enabled'. For backward compatibility, using 'delta.feature.featureName'='enabled' still works and will continue to work.
- Starting from this release, it is possible to create/replace a table with an additional table property delta.ignoreProtocolDefaults to ignore protocol-related Spark configs, which includes default reader and writer versions and table features supported by default.
- [SPARK-42070] Change the default value of the argument of the Mask function from -1 to NULL
- [SPARK-41793] Incorrect result for window frames defined by a range clause on significant decimals
- [SPARK-42484] UnsafeRowUtils better error message
- [SPARK-42516] Always capture the session time zone config while creating views
- [SPARK-42635] Fix the TimestampAdd expression.
- [SPARK-42622] Turned off substitution in values
- [SPARK-42534] Fix DB2Dialect Limit clause
- [SPARK-42121] Add built-in table-valued functions posexplode, posexplode_outer, json_tuple and stack
- [SPARK-42045] ANSI SQL mode: Round/Bround should return an error on tiny/small/significant integer overflow
- Operating system security updates.

Databricks Runtime 9.1 LTS

See Databricks Runtime 9.1 LTS.

April 9, 2025
- Operating system security updates.

March 11, 2025
- Operating system security updates.

February 11, 2025
- Operating system security updates.

December 10, 2024
- Operating system security updates.
November 26, 2024
- Operating system security updates.
November 5, 2024
- Operating system security updates.
October 22, 2024
- Operating system security updates.
October 10, 2024
- Operating system security updates.
September 25, 2024
- [SPARK-49000][SQL] Fix “select count(distinct 1) from t” where t is empty table by expanding RewriteDistinctAggregates
- Operating system security updates.
September 6, 2024
- Operating system security updates.
August 29, 2024
- [SPARK-49065][SQL] Rebasing in legacy formatters/parsers must support non JVM default time zones
August 14, 2024
August 1, 2024
- Operating system security updates.
July 11, 2024
- Operating system security updates.
June 17, 2024
- Operating system security updates.
May 21, 2024
- [SPARK-48105][SS] Fix the race condition between state store unloading and snapshotting
- Operating system security updates.
May 9, 2024
- [SPARK-47973][CORE] Log call site in SparkContext.stop() and later in SparkContext.assertNotStopped()
- [SPARK-44251][SQL] Set nullable correctly on coalesced join key in full outer USING join
- Operating system security updates.
April 25, 2024
- Miscellaneous bug fixes.
April 11, 2024
- Operating system security updates.
April 1, 2024
- Revert “[SPARK-46861][CORE] Avoid Deadlock in DAGScheduler”
- Operating system security updates.
March 14, 2024
- Operating system security updates.
February 29, 2024
- Fixed an issue where using a local collection as source in a MERGE command could result in the operation metric numSourceRows reporting double the correct number of rows.
- Operating system security updates.
February 13, 2024
- [SPARK-46861] Avoid Deadlock in DAGScheduler.
- Operating system security updates.
January 31, 2024
- Operating system security updates.
December 25, 2023
- To avoid increased latency when communicating over TLSv1.3, this maintenance release includes a patch to the JDK 8 installation to fix JDK bug JDK-8293562.
- [SPARK-46058] Add separate flag for privateKeyPassword.
- [SPARK-39440] Add a config to disable event timeline.
- [SPARK-46132] Support key password for JKS keys for RPC SSL.
December 14, 2023
- Operating system security updates.
November 29, 2023
- Installed a new package, pyarrow-hotfix to remediate a PyArrow RCE vulnerability.
- [SPARK-45859] Made UDF objects in ml.functions lazy.
- [SPARK-45544] Integrated SSL support into TransportContext.
- [SPARK-45730] Improved time constraints for ReloadingX509TrustManagerSuite.
- Operating system security updates.
November 14, 2023
- [SPARK-45545] SparkTransportConf inherits SSLOptions upon creation.
- [SPARK-45429] Added helper classes for SSL RPC communication.
- [SPARK-45427] Added RPC SSL settings to SSLOptions and SparkTransportConf.
- [SPARK-45584] Fixed subquery run failure with TakeOrderedAndProjectExec.
- [SPARK-45541] Added SSLFactory.
- [SPARK-42205] Removed logging accumulables in Stage and Task start events.
- Operating system security updates.
October 24, 2023
- [SPARK-45426] Added support for ReloadingX509TrustManager.
- Operating system security updates.
October 13, 2023
- Operating system security updates.
September 10, 2023
- Miscellaneous fixes.
August 30, 2023
- Operating system security updates.
August 15, 2023
- Operating system security updates.
June 23, 2023
- Snowflake-jdbc library is upgraded to 3.13.29 to address a security issue.
- Operating system security updates.
June 15, 2023
- [SPARK-43098] Fix correctness COUNT bug when scalar subquery has a group by clause.
- [SPARK-43156][SPARK-43098] Extend scalar subquery count bug test with decorrelateInnerQuery turned off.
- [SPARK-40862] Support non-aggregated subqueries in RewriteCorrelatedScalarSubquery.
- Operating system security updates.
June 2, 2023
- The JSON parser in failOnUnknownFields mode drops a record in DROPMALFORMED mode and fails directly in FAILFAST mode.
- Fixed an issue in JSON rescued data parsing to prevent UnknownFieldException.
- Fixed an issue in Auto Loader where different source file formats were inconsistent when the provided schema did not include inferred partitions. This issue could cause unexpected failures when reading files with missing columns in the inferred partition schema.
- [SPARK-37520] Add the startswith() and endswith() string functions
- [SPARK-43413] Fixed IN subquery ListQuery nullability.
- Operating system security updates.
May 17, 2023
- Operating system security updates.
April 25, 2023
- Operating system security updates.
April 11, 2023
- Fixed an issue where Auto Loader schema evolution can go into an infinite fail loop when a new column is detected in the schema of a nested JSON object.
- [SPARK-42967] Fix SparkListenerTaskStart.stageAttemptId when a task is started after the stage is canceled.
March 29, 2023
- Operating system security updates.
March 14, 2023
- [SPARK-42484] Improved error message for UnsafeRowUtils.
- Miscellaneous fixes.
February 28, 2023
- Users can now read and write specific Delta tables requiring Reader version 3 and Writer version 7, using Databricks Runtime 9.1 LTS or later. To succeed, table features listed in the tables' protocol must be supported by the current version of Databricks Runtime.
- Operating system security updates.
February 16, 2023
- Operating system security updates.
January 31, 2023
- Table types of JDBC tables are now EXTERNAL by default.
January 18, 2023
- Operating system security updates.
November 29, 2022
- Fixed an issue with JSON parsing in Auto Loader when all columns were left as strings (cloudFiles.inferColumnTypes was not set or set to false) and the JSON contained nested objects.
- Operating system security updates.
November 15, 2022
- Upgraded Apache commons-text to 1.10.0.
- Operating system security updates.
- Miscellaneous fixes.
November 1, 2022
- Fixed an issue where if a Delta table had a user-defined column named _change_type, but Change data feed was turned off on that table, data in that column would incorrectly fill with NULL values when running MERGE.
- Fixed an issue with Auto Loader where a file can be duplicated in the same micro-batch when allowOverwrites is enabled
- [SPARK-40596] Populate ExecutorDecommission with messages in ExecutorDecommissionInfo
- Operating system security updates.
October 18, 2022
- Operating system security updates.
October 5, 2022
- Miscellaneous fixes.
- Operating system security updates.
September 22, 2022
- Users can set spark.conf.set(“spark.databricks.io.listKeysWithPrefix.azure.enabled”, “true”) to re-enable the built-in listing for Auto Loader on ADLS. Built-in listing was previously turned off due to performance issues but can have led to increased storage costs for customers.
- [SPARK-40315] Add hashCode() for Literal of ArrayBasedMapData
- [SPARK-40089] Fix sorting for some Decimal types
- [SPARK-39887] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique
September 6, 2022
- [SPARK-40235] Use interruptible lock instead of synchronized in Executor.updateDependencies()
- [SPARK-35542] Fix: Bucketizer created for multiple columns with parameters splitsArray, inputCols and outputCols can not be loaded after saving it
- [SPARK-40079] Add Imputer inputCols validation for empty input case
August 24, 2022
- [SPARK-39666] Use UnsafeProjection.create to respect spark.sql.codegen.factoryMode in ExpressionEncoder
- [SPARK-39962] Apply projection when group attributes are empty
- Operating system security updates.
August 9, 2022
- Operating system security updates.
July 27, 2022
- Make Delta MERGE operation results consistent when the source is non-deterministic.
- [SPARK-39689] Support for 2-chars lineSep in CSV data source
- [SPARK-39575] Added ByteBuffer#rewind after ByteBuffer#get in AvroDeserializer.
- [SPARK-37392] Fixed the performance error for catalyst optimizer.
- Operating system security updates.
July 13, 2022
- [SPARK-39419] ArraySort throws an exception when the comparator returns null.
- Turned off Auto Loader's use of built-in cloud APIs for directory listing on Azure.
- Operating system security updates.
July 5, 2022
- Operating system security updates.
- Miscellaneous fixes.
June 15, 2022
- [SPARK-39283] Fix deadlock between TaskMemoryManager and UnsafeExternalSorter.SpillableIterator.
June 2, 2022
- [SPARK-34554] Implement the copy() method in ColumnarMap.
- Operating system security updates.
May 18, 2022
- Fixed a potential built-in memory leak in Auto Loader.
- Upgrade AWS SDK version from 1.11.655 to 1.11.678.
- [SPARK-38918] Nested column pruning should filter out attributes that do not belong to the current relation
- [SPARK-39084] Fix df.rdd.isEmpty() by using TaskContext to stop iterator on task completion
- Operating system security updates.
April 19, 2022
- Operating system security updates.
- Miscellaneous fixes.
April 6, 2022
- [SPARK-38631] Uses Java-based implementation for un-tarring at Utils.unpack.
- Operating system security updates.
March 22, 2022
- Changed the current working directory of notebooks on High Concurrency clusters with either table access control or credential passthrough enabled to the user's home directory. Previously, the active directory was /databricks/driver.
- [SPARK-38437] Lenient serialization of datetime from datasource
- [SPARK-38180] Allow safe up-cast expressions in correlated equality predicates
- [SPARK-38155] Disallow distinct aggregate in lateral subqueries with unsupported predicates
- [SPARK-27442] Removed a check field when reading or writing data in a parquet.
March 14, 2022
- [SPARK-38236] Absolute file paths specified in the create/alter table are treated as relative
- [SPARK-34069] Interrupt task thread if local property SPARK_JOB_INTERRUPT_ON_CANCEL is set to true.
February 23, 2022
- [SPARK-37859] SQL tables created with JDBC with Spark 3.1 are not readable with Spark 3.2.
February 8, 2022
- [SPARK-27442] Removed a check field when reading or writing data in a parquet.
- Operating system security updates.
February 1, 2022
- Operating system security updates.
January 26, 2022
- Fixed an issue where concurrent transactions on Delta tables could commit in a non-serializable order under certain rare conditions.
- Fixed an issue where the OPTIMIZE command could fail when the ANSI SQL dialect was enabled.
January 19, 2022
- Minor fixes and security enhancements.
- Operating system security updates.
November 4, 2021
- Fixed an issue that could cause Structured Streaming streams to fail with an ArrayIndexOutOfBoundsException.
- Fixed a race condition that might cause a query failure with an IOException like java.io.IOException: No FileSystem for scheme or that might cause modifications to sparkContext.hadoopConfiguration to not take effect in queries.
- The Apache Spark Connector for Delta Sharing was upgraded to 0.2.0.
October 20, 2021
- Upgraded BigQuery connector from 0.18.1 to 0.22.2. This adds support for the BigNumeric type.

Databricks Runtime releases​

Databricks Runtime 18.0​

Databricks Runtime 17.3 LTS​

Databricks Runtime 17.2​

Databricks Runtime 17.1​

Databricks Runtime 16.4 LTS​

Databricks Runtime 16.2​

Databricks Runtime 15.4 LTS​

Databricks Runtime 14.3 LTS​

Databricks Runtime 13.3 LTS​

Databricks Runtime 12.2 LTS​

Databricks Runtime 9.1 LTS​