Databricks Runtime 13.1 (EoS)

note

Support for this Databricks Runtime version has ended. For the end-of-support date, see End-of-support history. For all supported Databricks Runtime versions, see Databricks Runtime release notes versions and compatibility.

The following release notes provide information about Databricks Runtime 13.1, powered by Apache Spark 3.4.0.

Databricks released this version in May 2023.

New features and improvements

Cluster support for JDK 17 (Public Preview)
Add, change, or delete data in streaming tables
Read Kafka with SQL
New SQL built-in functions
Unity Catalog support for cluster-scoped Python libraries
Expanded default enablement for optimized writes in Unity Catalog
Advanced support for stateful operators in Structured Streaming workloads
Delta clone for Unity Catalog is in Public Preview
Pub/Sub support for Structured Streaming
Drop duplicates within watermarks in Structured Streaming
Trigger available now is supported for Kinesis data sources
Expanded support for Delta conversions from Apache Iceberg tables with truncated partition columns
Stream schema changes with column mapping in Delta Lake
Remove START VERSION
New H3 expressions available with Python

Cluster support for JDK 17 (Public Preview)

Databricks now provides cluster support for Java Development Kit (JDK) 17. See Databricks SDK for Java.

Add, change, or delete data in streaming tables

You can now use DML statements to modify streaming tables published to Unity Catalog by Lakeflow Declarative Pipelines. See Add, change, or delete data in a streaming table and Add, change, or delete data in a target streaming table. You can also use DML statements to modify streaming tables created in Databricks SQL.

Read Kafka with SQL

You can now use the read_kafka SQL function for reading Kafka data. Streaming with SQL is supported only in DLT or with streaming tables in Databricks SQL. See read_kafka table-valued function.

New SQL built-in functions

The following functions have been added:

array_prepend(array, elem) Returns array prepended by elem.
try_aes_decrypt(expr, key [, mode [, padding]]) Decrypts a binary produced using AES encryption, and returns NULL if there is an error.
sql_keywords() Returns a table of Databricks SQL keywords.

Unity Catalog support for cluster-scoped Python libraries

Unity Catalog has some limitations on library usage. On Databricks Runtime 13.1 and above, cluster-scoped Python libraries are supported, including Python wheel files that are uploaded as workspace files. Libraries that are referenced using DBFS filepaths are not supported, whether in the DBFS root or an external location mounted to DBFS. Non-Python libraries are not supported. See Compute-scoped libraries.

On Databricks Runtime 13.0 and below, cluster-scoped libraries are not supported on clusters that use standard access mode (formerly shared access mode) in a Unity Catalog-enabled workspace.

Expanded default enablement for optimized writes in Unity Catalog

Default optimized write support for Delta tables registered in Unity Catalog has expanded to include CTAS statements and INSERT operations for partitioned tables. This behavior aligns to defaults on SQL warehouses. See Optimized writes for Delta Lake on Databricks.

Advanced support for stateful operators in Structured Streaming workloads

You can now chain multiple stateful operators together, meaning that you can feed the output of an operation such as a windowed aggregation to another stateful operation such as a join. See What is stateful streaming?.

Delta clone for Unity Catalog is in Public Preview

You can now use shallow clone to create new Unity Catalog managed tables from existing Unity Catalog managed tables. See Shallow clone for Unity Catalog tables.

Pub/Sub support for Structured Streaming

You can now use a built-in connector to subscribe to Google Pub/Sub with Structured Streaming. See Subscribe to Google Pub/Sub.

Drop duplicates within watermarks in Structured Streaming

You can now use dropDuplicatesWithinWatermark in combination with a specified watermark threshold to deduplicate records in Structured Streaming. See Drop duplicates within watermark.

Trigger available now is supported for Kinesis data sources

You can now use Trigger.AvailableNow to consume records from Kinesis as an incremental batch with Structured Streaming. See Ingest Kinesis records as an incremental batch.

Expanded support for Delta conversions from Apache Iceberg tables with truncated partition columns

You can now use CLONE and CONVERT TO DELTA with Apache Iceberg tables that have partitions defined on truncated columns of types int, long, and string. Truncated columns of type decimal are not supported.

Stream schema changes with column mapping in Delta Lake

You now can provide a schema tracking location to enable streaming from Delta tables with column mapping enabled. See Streaming with column mapping and schema changes.

Remove START VERSION

START VERSION is now deprecated for ALTER SHARE.

New H3 expressions available with Python

The h3_coverash3 and h3_coverash3string expressions are available with Python.

Bug fixes

Parquet failOnUnknownFields no longer silently drop data on type mismatch

If a Parquet file was read with just the failOnUnknownFields option or with Auto Loader in the failOnNewColumns schema evolution mode, columns that have different data types now fail and recommend using rescuedDataColumn. Auto Loader now correctly reads and no longer rescues Integer, Short, or Byte types if one of these data types is provided. The Parquet file suggests one of the other two types.

Breaking changes

Upgrade sqlite-jdbc version to 3.42.0.0 to address CVE-2023-32697

Upgrade sqlite-jdbc version from 3.8.11.2 to 3.42.0.0. The APIs of version 3.42.0.0 are not fully compatible with 3.8.11.2. If using sqlite-jdbc in your code, check the sqlite-jdbc compatibility report for details. If you migrate to 13.1 and use sqlite, confirm your methods and return type in version 3.42.0.0.

Library upgrades

Upgraded Python libraries:
- facets-overview from 1.0.2 to 1.0.3
- filelock from 3.10.7 to 3.12.0
- pyarrow from 7.0.0 to 8.0.0
- tenacity from 8.0.1 to 8.1.0
Upgraded R libraries:
Upgraded Java libraries:
- com.github.ben-manes.caffeine.caffeine from 2.3.4 to 2.9.3
- io.delta.delta-sharing-spark_2.12 from 0.6.8 to 0.6.4
- net.snowflake.snowflake-jdbc from 3.13.29 to 3.13.22
- org.checkerframework.checker-qual from 3.5.0 to 3.19.0
- org.scalactic.scalactic_2.12 from 3.0.8 to 3.2.15
- org.scalatest.scalatest_2.12 from 3.0.8 to 3.2.15
- org.xerial.sqlite-jdbc from 3.8.11.2 to 3.42.0.0

Apache Spark

Databricks Runtime 13.1 includes Apache Spark 3.4.0. This release includes all Spark fixes and improvements included in Databricks Runtime 13.0 (EoS), as well as the following additional bug fixes and improvements made to Spark:

[SPARK-42719] [DBRRM-199][sc-131578] Revert “[SC-125225] `MapOutputTracker#getMap…
[SPARK-39696] [DBRRM-166][sc-130056][CORE] Revert [SC-127830]/
[SPARK-43331] [SC-130064][connect] Add Spark Connect SparkSession.interruptAll
[SPARK-43332] [SC-130051][connect][PYTHON] Make it possible to extend ChannelBuilder for SparkConnectClient
[SPARK-43323] [SC-129966][sql][PYTHON] Fix DataFrame.toPandas with Arrow enabled to handle exceptions properly
[SPARK-42940] [SC-129896][ss][CONNECT] Improve session management for streaming queries
[SPARK-43032] [SC-125756] [CONNECT][ss] Add Streaming query manager
[SPARK-16484] [SC-129975][sql] Add support for Datasketches HllSketch
[SPARK-43260] [SC-129281][python] Migrate the Spark SQL pandas arrow type errors into error class.
[SPARK-41766] [SC-129964][core] Handle decommission request sent before executor registration
[SPARK-43307] [SC-129971][python] Migrate PandasUDF value errors into error class
[SPARK-43206] [SC-129903] [SS] [CONNECT] StreamingQuery exception() include stack trace
[SPARK-43311] [SC-129905][ss] Add RocksDB state store provider memory management enhancements
[SPARK-43237] [SC-129898][core] Handle null exception message in event log
[SPARK-43320] [SC-129899][sql][HIVE] Directly call Hive 2.3.9 API
[SPARK-43270] [SC-129897][python] Implement __dir__() in pyspark.sql.dataframe.DataFrame to include columns
[SPARK-43183] Revert “[SC-128938][ss] Introduce a new callback “…
[SPARK-43143] [SC-129902] [SS] [CONNECT] Scala StreamingQuery awaitTermination()
[SPARK-43257] [SC-129675][sql] Replace the error class _LEGACY_ERROR_TEMP_2022 by an internal error
[SPARK-43198] [SC-129470][connect] Fix “Could not initialise class ammonite…” error when using filter
[SPARK-43165] [SC-129777][sql] Move canWrite to DataTypeUtils
[SPARK-43298] [SC-129729][python][ML] predict_batch_udf with scalar input fails with batch size of one
[SPARK-43298] [SC-129700]Revert “[PYTHON][ml] predict_batch_udf with scalar input fails with batch size of one”
[SPARK-43052] [SC-129663][core] Handle stacktrace with null file name in event log
[SPARK-43183] [SC-128938][ss] Introduce a new callback “onQueryIdle” to StreamingQueryListener
[SPARK-43209] [SC-129190][connect][PYTHON] Migrate Expression errors into error class
[SPARK-42151] [SC-128754][sql] Align UPDATE assignments with table attributes
[SPARK-43134] [SC-129468] [CONNECT] [SS] JVM client StreamingQuery exception() API
[SPARK-43298] [SC-129699][python][ML] predict_batch_udf with scalar input fails with batch size of one
[SPARK-43248] [SC-129660][sql] Unnecessary serialize/deserialize of Path on parallel gather partition stats
[SPARK-43274] [SC-129464][spark-43275][PYTHON][connect] Introduce PySparkNotImplementedError
[SPARK-43146] [SC-128804][connect][PYTHON] Implement eager evaluation for repr and repr_html
[SPARK-42953] [SC-129469][connect][Followup] Fix maven test build for Scala client UDF tests
[SPARK-43144] [SC-129280] Scala Client DataStreamReader table() API
[SPARK-43136] [SC-129358][connect] Adding groupByKey + mapGroup + coGroup functions
[SPARK-43156] [SC-129672][sc-128532][SQL] Fix COUNT(*) is null bug in correlated scalar subquery
[SPARK-43046] [SC-129110] [SS] [Connect] Implemented Python API dropDuplicatesWithinWatermark for Spark Connect
[SPARK-43199] [SC-129467][sql] Make InlineCTE idempotent
[SPARK-43293] [SC-129657][sql] __qualified_access_only should be ignored in normal columns
[SPARK-43276] [SC-129461][connect][PYTHON] Migrate Spark Connect Window errors into error class
[SPARK-43174] [SC-129109][sql] Fix SparkSQLCLIDriver completer
[SPARK-43084] [SC-128654] [SS] Add applyInPandasWithState support for spark connect
[SPARK-43119] [SC-129040][sql] Support Get SQL Keywords Dynamically Thru JDBC API and TVF
[SPARK-43082] [SC-129112][connect][PYTHON] Arrow-optimized Python UDFs in Spark Connect
[SPARK-43085] [SC-128432][sql] Support column DEFAULT assignment for multi-part table names
[SPARK-43226] [LC-671] Define extractors for file-constant metadata
[SPARK-43210] [SC-129189][connect][PYTHON] Introduce PySparkAssertionError
[SPARK-43214] [SC-129199][sql] Post driver-side metrics for LocalTableScanExec/CommandResultExec
[SPARK-43285] [SC-129347] Fix ReplE2ESuite consistently failing with JDK 17
[SPARK-43268] [SC-129249][sql] Use proper error classes when exceptions are constructed with a message
[SPARK-43142] [SC-129299] Fix DSL expressions on attributes with special characters
[SPARK-43129] [SC-128896] Scala core API for streaming Spark Connect
[SPARK-43233] [SC-129250] [SS] Add logging for Kafka Batch Reading for topic partition, offset range and task ID
[SPARK-43249] [SC-129195][connect] Fix missing stats for SQL Command
[SPARK-42945] [SC-129188][connect] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect
[SPARK-43178] [SC-129197][connect][PYTHON] Migrate UDF errors into PySpark error framework
[SPARK-43123] [SC-128494][sql] Internal field metadata should not be leaked to catalogs
[SPARK-43217] [SC-129205] Correctly recurse in nested maps/arrays in findNestedField
[SPARK-43243] [SC-129294][python][CONNECT] Add level param to printSchema for Python
[SPARK-43230] [SC-129191][connect] Simplify DataFrameNaFunctions.fillna
[SPARK-43088] [SC-128403][sql] Respect RequiresDistributionAndOrdering in CTAS/RTAS
[SPARK-43234] [SC-129192][connect][PYTHON] Migrate ValueError from Conect DataFrame into error class
[SPARK-43212] [SC-129187][ss][PYTHON] Migrate Structured Streaming errors into error class
[SPARK-43239] [SC-129186][ps] Remove null_counts from info()
[SPARK-43190] [SC-128930][sql] ListQuery.childOutput should be consistent with child output
[SPARK-43191] [SC-128924][core] Replace reflection w/ direct calling for Hadoop CallerContext
[SPARK-43193] [SC-129042][ss] Remove workaround for HADOOP-12074
[SPARK-42657] [SC-128621][connect] Support to find and transfer client-side REPL classfiles to server as artifacts
[SPARK-43098] [SC-77059][sql] Fix correctness COUNT bug when scalar subquery has group by clause
[SPARK-43213] [SC-129062][python] Add DataFrame.offset to vanilla PySpark
[SPARK-42982] [SC-128400][connect][PYTHON] Fix createDataFrame to respect the given schema ddl
[SPARK-43124] [SC-129011][sql] Dataset.show projects CommandResults locally
[SPARK-42998] [SC-127422][connect][PYTHON] Fix DataFrame.collect with null struct
[SPARK-41498] [SC-125343]Revert ” Propagate metadata through Union”
[SPARK-42960] [SC-129010] [CONNECT] [SS] Add await_termination() and exception() API for Streaming Query in Python
[SPARK-42552] [SC-128824][sql] Correct the two-stage parsing strategy of antlr parser
[SPARK-43207] [SC-128937][connect] Add helper functions to extract value from literal expression
[SPARK-43186] [SC-128841][sql][HIVE] Remove workaround for FileSinkDesc
[SPARK-43107] [SC-128533][sql] Coalesce buckets in join applied on broadcast join stream side
[SPARK-43195] [SC-128922][core] Remove unnecessary serializable wrapper in HadoopFSUtils
[SPARK-43137] [SC-128828][sql] Improve ArrayInsert if the position is foldable and positive.
[SPARK-37829] [SC-128827][sql] Dataframe.joinWith outer-join should return a null value for unmatched row
[SPARK-43042] [SC-128602] [SS] [Connect] Add table() API support for DataStreamReader
[SPARK-43153] [SC-128753][connect] Skip Spark execution when the dataframe is local
[SPARK-43064] [SC-128496][sql] Spark SQL CLI SQL tab should only show once statement once
[SPARK-43126] [SC-128447][sql] Mark two Hive UDF expressions as stateful
[SPARK-43111] [SC-128750][ps][CONNECT][python] Merge nested if statements into single if statements
[SPARK-43113] [SC-128749][sql] Evaluate stream-side variables when generating code for a bound condition
[SPARK-42895] [SC-127258][connect] Improve error messages for stopped Spark sessions
[SPARK-42884] [SC-126662][connect] Add Ammonite REPL integration
[SPARK-43168] [SC-128674][sql] Remove get PhysicalDataType method from Datatype class
[SPARK-43121] [SC-128455][sql] Use BytesWritable.copyBytes instead of manual copy in `HiveInspectors
[SPARK-42916] [SC-128389][sql] JDBCTableCatalog Keeps Char/Varchar meta on the read-side
[SPARK-43050] [SC-128550][sql] Fix construct aggregate expressions by replacing grouping functions
[SPARK-43095] [SC-128549][sql] Avoid Once strategy's idempotence is broken for batch: Infer Filters
[SPARK-43130] [SC-128597][sql] Move InternalType to PhysicalDataType
[SPARK-43105] [SC-128456][connect] Abbreviate Bytes and Strings in proto message
[SPARK-43099] [SC-128596][sql] Use getName instead of getCanonicalName to get builder class name when registering udf to FunctionRegistry
[SPARK-42994] [SC-128586][ml][CONNECT] PyTorch Distributor support Local Mode
[SPARK-42859] Revert “[SC-127935][connect][PS] Basic support for pandas API on Spark Connect”
[SPARK-43021] [SC-128472][sql] CoalesceBucketsInJoin not work when using AQE
[SPARK-43125] [SC-128477][connect] Fix Connect Server Can't Handle Exception With Null Message
[SPARK-43147] [SC-128594] fix flake8 lint for local check
[SPARK-43031] [SC-128360] [SS] [Connect] Enable unit test and doctest for streaming
[SPARK-43039] [LC-67] Support custom fields in the file source _metadata column.
[SPARK-43120] [SC-128407][ss] Add support for tracking pinned blocks memory usage for RocksDB state store
[SPARK-43110] [SC-128381][sql] Move asIntegral to PhysicalDataType
[SPARK-43118] [SC-128398][ss] Remove unnecessary assert for UninterruptibleThread in KafkaMicroBatchStream
[SPARK-43055] [SC-128331][connect][PYTHON] Support duplicated nested field names
[SPARK-42437] [SC-128339][python][CONNECT] PySpark catalog.cacheTable will allow to specify storage level
[SPARK-42985] [SC-128332][connect][PYTHON] Fix createDataFrame to respect the SQL configs
[SPARK-39696] [SC-127830][core] Fix data race in access to TaskMetrics.externalAccums
[SPARK-43103] [SC-128335][sql] Moving Integral to PhysicalDataType
[SPARK-42741] [SC-125547][sql] Do not unwrap casts in binary comparison when literal is null
[SPARK-43057] [SC-127948][connect][PYTHON] Migrate Spark Connect Column errors into error class
[SPARK-42859] [SC-127935][connect][PS] Basic support for pandas API on Spark Connect
[SPARK-43013] [SC-127773][python] Migrate ValueError from DataFrame into PySparkValueError.
[SPARK-43089] [SC-128051][connect] Redact debug string in UI
[SPARK-43028] [SC-128070][sql] Add error class SQL_CONF_NOT_FOUND
[SPARK-42999] [SC-127842][connect] Dataset#foreach, foreachPartition
[SPARK-43066] [SC-127937][sql] Add test for dropDuplicates in JavaDatasetSuite
[SPARK-43075] [SC-127939][connect] Change gRPC to grpcio when it is not installed.
[SPARK-42953] [SC-127809][connect] Typed filter, map, flatMap, mapPartitions
[SPARK-42597] [SC-125506][sql] Support unwrap date type to timestamp type
[SPARK-42931] [SC-127933][ss] Introduce dropDuplicatesWithinWatermark
[SPARK-43073] [SC-127943][connect] Add proto data types constants
[SPARK-43077] [SC-128050][sql] Improve the error message of UNRECOGNIZED_SQL_TYPE
[SPARK-42951] [SC-128030][ss][Connect] DataStreamReader APIs
[SPARK-43049] [SC-127846][sql] Use CLOB instead of VARCHAR(255) for StringType for Oracle JDBC
[SPARK-43018] [SC-127762][sql] Fix bug for INSERT commands with timestamp literals
[SPARK-42855] [SC-127722][sql] Use runtime null checks in TableOutputResolver
[SPARK-43030] [SC-127847][sql] Deduplicate relations with metadata columns
[SPARK-42993] [SC-127829][ml][CONNECT] Make PyTorch Distributor compatible with Spark Connect
[SPARK-43058] [SC-128072][sql] Move Numeric and Fractional to PhysicalDataType
[SPARK-43056] [SC-127946][ss] RocksDB state store commit should continue background work only if its paused
[SPARK-43059] [SC-127947][connect][PYTHON] Migrate TypeError from DataFrame(Reader|Writer) into error class
[SPARK-43071] [SC-128018][sql] Support SELECT DEFAULT with ORDER BY, LIMIT, OFFSET for INSERT source relation
[SPARK-43061] [SC-127956][core][SQL] Introduce PartitionEvaluator for SQL operator execution
[SPARK-43067] [SC-127938][ss] Correct the location of error class resource file in Kafka connector
[SPARK-43019] [SC-127844][sql] Move Ordering to PhysicalDataType
[SPARK-43010] [SC-127759][python] Migrate Column errors into error class
[SPARK-42840] [SC-127782][sql] Change _LEGACY_ERROR_TEMP_2004 error to internal error
[SPARK-43041] [SC-127765][sql] Restore constructors of exceptions for compatibility in connector API
[SPARK-42939] [SC-127761][ss][CONNECT] Core streaming Python API for Spark Connect
[SPARK-42844] [SC-127766][sql] Update the error class _LEGACY_ERROR_TEMP_2008 to INVALID_URL
[SPARK-42316] [SC-127720][sql] Assign name to _LEGACY_ERROR_TEMP_2044
[SPARK-42995] [SC-127723][connect][PYTHON] Migrate Spark Connect DataFrame errors into error class
[SPARK-42983] [SC-127717][connect][PYTHON] Fix createDataFrame to handle 0-dim numpy array properly
[SPARK-42955] [SC-127476][sql] Skip classifyException and wrap AnalysisException for SparkThrowable
[SPARK-42949] [SC-127255][sql] Simplify code for NAAJ
[SPARK-43011] [SC-127577][sql] array_insert should fail with 0 index
[SPARK-42974] [SC-127487][core] Restore Utils.createTempDir to use the ShutdownHookManager and clean up JavaUtils.createTempDir method.
[SPARK-42964] [SC-127585][sql] PosgresDialect '42P07' also means table already exists
[SPARK-42978] [SC-127351][sql] Derby&PG: RENAME cannot qualify a new-table-Name with a schema-Name
[SPARK-37980] [SC-127668][sql] Access row_index via _metadata if possible in tests
[SPARK-42655] [SC-127591][sql] Incorrect ambiguous column reference error
[SPARK-43009] [SC-127596][sql] Parameterized sql() with Any constants
[SPARK-43026] [SC-127590][sql] Apply AQE with non-exchange table cache
[SPARK-42963] [SC-127576][sql] Extend SparkSessionExtensions to inject rules into AQE query stage optimizer
[SPARK-42918] [SC-127357] Generalize handling of metadata attributes in FileSourceStrategy
[SPARK-42806] [SC-127452][spark-42811][CONNECT] Add Catalog support
[SPARK-42997] [SC-127535][sql] TableOutputResolver must use correct column paths in error messages for arrays and maps
[SPARK-43006] [SC-127486][pyspark] Fix typo in StorageLevel eq()
[SPARK-43005] [SC-127485][pyspark] Fix typo in pyspark/pandas/config.py
[SPARK-43004] [SC-127457][core] Fix typo in ResourceRequest.equals()
[SPARK-42907] [SC-126984][connect][PYTHON] Implement Avro functions
[SPARK-42979] [SC-127272][sql] Define literal constructors as keywords
[SPARK-42946] [SC-127252][sql] Redact sensitive data which is nested by variable substitution
[SPARK-42952] [SC-127260][sql] Simplify the parameter of analyzer rule PreprocessTableCreation and DataSourceAnalysis
[SPARK-42683] [LC-75] Automatically rename conflicting metadata columns
[SPARK-42853] [SC-126101][followup] Fix conflicts
[SPARK-42929] [SC-126748][connect] make mapInPandas / mapInArrow support “is_barrier”
[SPARK-42968] [SC-127271][ss] Add option to skip commit coordinator as part of StreamingWrite API for DSv2 sources/sinks
[SPARK-42954] [SC-127261][python][CONNECT] Add YearMonthIntervalType to PySpark and Spark Connect Python Client
[SPARK-41359] [SC-127256][sql] Use PhysicalDataType instead of DataType in UnsafeRow
[SPARK-42873] [SC-127262][sql] Define Spark SQL types as keywords
[SPARK-42808] [SC-126302][core] Avoid getting availableProcessors every time in MapOutputTrackerMaster#getStatistics
[SPARK-42937] [SC-126880][sql] PlanSubqueries should set InSubqueryExec#shouldBroadcast to true
[SPARK-42896] [SC-126729][sql][PYTHON] Make mapInPandas / mapInArrow support barrier mode execution
[SPARK-42874] [SC-126442][sql] Enable new golden file test framework for analysis for all input files
[SPARK-42922] [SC-126850][sql] Move from Random to SecureRandom
[SPARK-42753] [SC-126369] ReusedExchange refers to non-existent nodes
[SPARK-40822] [SC-126274][sql] Stable derived column aliases
[SPARK-42908] [SC-126856][python] Raise RuntimeError when SparkContext is required but not initialized
[SPARK-42779] [SC-126042][sql] Allow V2 writes to indicate advisory shuffle partition size
[SPARK-42914] [SC-126727][python] Reuse transformUnregisteredFunction for DistributedSequenceID.
[SPARK-42878] [SC-126882][connect] The table API in DataFrameReader could also accept options
[SPARK-42927] [SC-126883][core] Change the access scope of o.a.spark.util.Iterators#size to private[util]
[SPARK-42943] [SC-126879][sql] Use LONGTEXT instead of TEXT for StringType for effective length
[SPARK-37677] [SC-126855][core] Unzip could keep file permissions
[SPARK-42891] [13.x][sc-126458][CONNECT][python] Implement CoGrouped Map API
[SPARK-41876] [SC-126849][connect][PYTHON] Implement DataFrame.toLocalIterator
[SPARK-42930] [SC-126761][core][SQL] Change the access scope of ProtobufSerDe related implementations to private[protobuf]
[SPARK-42819] [SC-125879][ss] Add support for setting max_write_buffer_number and write_buffer_size for RocksDB used in streaming
[SPARK-42924] [SC-126737][sql][CONNECT][python] Clarify the comment of parameterized SQL args
[SPARK-42748] [SC-126455][connect] Server-side Artifact Management
[SPARK-42816] [SC-126365][connect] Support Max Message size up to 128MB
[SPARK-42850] [SC-126109][sql] Remove duplicated rule CombineFilters in Optimizer
[SPARK-42662] [SC-126355][connect][PS] Add proto message for pandas API on Spark default index
[SPARK-42720] [SC-126136][ps][SQL] Uses expression for distributed-sequence default index instead of plan
[SPARK-42790] [SC-126174][sql] Abstract the excluded method for better test for JDBC docker tests.
[SPARK-42900] [SC-126473][connect][PYTHON] Fix createDataFrame to respect inference and column names
[SPARK-42917] [SC-126657][sql] Correct getUpdateColumnNullabilityQuery for DerbyDialect
[SPARK-42684] [SC-125157][sql] v2 catalog should not allow column default value by default
[SPARK-42861] [SC-126635][sql] Use private[sql] instead of protected[sql] to avoid generating API doc
[SPARK-42920] [SC-126728][connect][PYTHON] Enable tests for UDF with UDT
[SPARK-42791] [SC-126617][sql] Create a new golden file test framework for analysis
[SPARK-42911] [SC-126652][python] Introduce more basic exceptions
[SPARK-42904] [SC-126634][sql] Char/Varchar Support for JDBC Catalog
[SPARK-42901] [SC-126459][connect][PYTHON] Move StorageLevel into a separate file to avoid potential file recursively imports
[SPARK-42894] [SC-126451][connect] Support cache/persist/unpersist/storageLevel for Spark connect jvm client
[SPARK-42792] [SC-125852][ss] Add support for WRITE_FLUSH_BYTES for RocksDB used in streaming stateful operators
[SPARK-41233] [SC-126441][connect][PYTHON] Add array_prepend to Spark Connect Python client
[SPARK-42681] [SC-125149][sql] Relax ordering constraint for ALTER TABLE ADD|REPLACE column descriptor
[SPARK-42889] [SC-126367][connect][PYTHON] Implement cache, persist, unpersist, and storageLevel
[SPARK-42824] [SC-125985][connect][PYTHON] Provide a clear error message for unsupported JVM attributes
[SPARK-42340] [SC-126131][connect][PYTHON] Implement Grouped Map API
[SPARK-42892] [SC-126454][sql] Move sameType and relevant methods out of DataType
[SPARK-42827] [SC-126126][connect] Support functions#array_prepend for Scala connect client
[SPARK-42823] [SC-125987][sql] spark-sql shell supports multipart namespaces for initialization
[SPARK-42817] [SC-125960][core] Logging the shuffle service name once in ApplicationMaster
[SPARK-42786] [SC-126438][connect] Typed Select
[SPARK-42800] [SC-125868][connect][PYTHON][ml] Implement ml function {array_to_vector, vector_to_array}
[SPARK-42052] [SC-126439][sql] Codegen Support for HiveSimpleUDF
[SPARK-41233] [SC-126110][sql][PYTHON] Add array_prepend function
[SPARK-42864] [SC-126268][ml][3.4] Make IsotonicRegression.PointsAccumulator private
[SPARK-42876] [SC-126281][sql] DataType's physicalDataType should be private[sql]
[SPARK-42101] [SC-125437][sql] Make AQE support InMemoryTableScanExec
[SPARK-41290] [SC-124030][sql] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements
[SPARK-42870] [SC-126220][connect] Move toCatalystValue to connect-common
[SPARK-42247] [SC-126107][connect][PYTHON] Fix UserDefinedFunction to have returnType
[SPARK-42875] [SC-126258][connect][PYTHON] Fix toPandas to handle timezone and map types properly
[SPARK-42757] [SC-125626][connect] Implement textFile for DataFrameReader
[SPARK-42803] [SC-126081][core][SQL][ml] Use getParameterCount function instead of getParameterTypes.length
[SPARK-42833] [SC-126043][sql] Refactor applyExtensions in SparkSession
[SPARK-41765] Revert “[SC-123550][sql] Pull out v1 write metrics…
[SPARK-42848] [SC-126105][connect][PYTHON] Implement DataFrame.registerTempTable
[SPARK-42020] [SC-126103][connect][PYTHON] Support UserDefinedType in Spark Connect
[SPARK-42818] [SC-125861][connect][PYTHON] Implement DataFrameReader/Writer.jdbc
[SPARK-42812] [SC-125867][connect] Add client_type to AddArtifactsRequest protobuf message
[SPARK-42772] [SC-125860][sql] Change the default value of JDBC options about push down to true
[SPARK-42771] [SC-125855][sql] Refactor HiveGenericUDF
[SPARK-25050] [SC-123839][sql] Avro: writing complex unions
[SPARK-42765] [SC-125850][connect][PYTHON] Enable importing pandas_udf from pyspark.sql.connect.functions
[SPARK-42719] [SC-125225][core] MapOutputTracker#getMapLocation should respect spark.shuffle.reduceLocality.enabled
[SPARK-42480] [SC-125173][sql] Improve the performance of drop partitions
[SPARK-42689] [SC-125195][core][SHUFFLE] Allow ShuffleDriverComponent to declare if shuffle data is reliably stored
[SPARK-42726] [SC-125279][connect][PYTHON] Implement DataFrame.mapInArrow
[SPARK-41765] [SC-123550][sql] Pull out v1 write metrics to WriteFiles
[SPARK-41171] [SC-124191][sql] Infer and push down window limit through window if partitionSpec is empty
[SPARK-42686] [SC-125292][core] Defer formatting for debug messages in TaskMemoryManager
[SPARK-42756] [SC-125443][connect][PYTHON] Helper function to convert proto literal to value in Python Client
[SPARK-42793] [SC-125627][connect] connect module requires build_profile_flags
[SPARK-42701] [SC-125192][sql] Add the try_aes_decrypt() function
[SPARK-42679] [SC-125438][connect][PYTHON] createDataFrame doesn't work with non-nullable schema
[SPARK-42733] [SC-125542][connect][Followup] Write without path or table
[SPARK-42777] [SC-125525][sql] Support converting TimestampNTZ catalog stats to plan stats
[SPARK-42770] [SC-125558][connect] Add truncatedTo(ChronoUnit.MICROS) to make SQLImplicitsTestSuite in Java 17 daily test GA task pass
[SPARK-42752] [SC-125550][pyspark][SQL] Make PySpark exceptions printable during initialization
[SPARK-42732] [SC-125544][pyspark][CONNECT] Support spark connect session getActiveSession method
[SPARK-42755] [SC-125442][connect] Factor literal value conversion out to connect-common
[SPARK-42747] [SC-125399][ml] Fix incorrect internal status of LoR and AFT
[SPARK-42740] [SC-125439][sql] Fix the bug that pushdown offset or paging is invalid for some built-in dialect
[SPARK-42745] [SC-125332][sql] Improved AliasAwareOutputExpression works with DSv2
[SPARK-42743] [SC-125330][sql] Support analyze TimestampNTZ columns
[SPARK-42721] [SC-125371][connect] RPC logging interceptor
[SPARK-42691] [SC-125397][connect][PYTHON] Implement Dataset.semanticHash
[SPARK-42688] [SC-124922][connect] Rename Connect proto Request client_id to session_id
[SPARK-42310] [SC-122792][sql] Assign name to _LEGACY_ERROR_TEMP_1289
[SPARK-42685] [SC-125339][core] Optimize Utils.bytesToString routines
[SPARK-42725] [SC-125296][connect][PYTHON] Make LiteralExpression support array params
[SPARK-42702] [SC-125293][spark-42623][SQL] Support parameterized query in subquery and CTE
[SPARK-42697] [SC-125189][webui] Fix /api/v1/applications to return total uptime instead of 0 for the duration field
[SPARK-42733] [SC-125278][connect][PYTHON] Fix DataFrameWriter.save to work without path parameter
[SPARK-42376] [SC-124928][ss] Introduce watermark propagation among operators
[SPARK-42710] [SC-125205][connect][PYTHON] Rename FrameMap proto to MapPartitions
[SPARK-37099] [SC-123542][sql] Introduce the group limit of Window for rank-based filter to optimize top-k computation
[SPARK-42630] [SC-125207][connect][PYTHON] Introduce UnparsedDataType and delay parsing DDL string until SparkConnectClient is available
[SPARK-42690] [SC-125193][connect] Implement CSV/JSON parsing functions for Scala client
[SPARK-42709] [SC-125172][python] Remove the assumption of __file__ being available
[SPARK-42318] [SC-122648][spark-42319][SQL] Assign name to LEGACY_ERROR_TEMP(2123|2125)
[SPARK-42723] [SC-125183][sql] Support parser data type json “timestamp_ltz” as TimestampType
[SPARK-42722] [SC-125175][connect][PYTHON] Python Connect def schema() should not cache the schema
[SPARK-42643] [SC-125152][connect][PYTHON] Register Java (aggregate) user-defined functions
[SPARK-42656] [SC-125177][connect][Followup] Fix the spark-connect script
[SPARK-41516] [SC-123899] [SQL] Allow jdbc dialects to override the query used to create a table
[SPARK-41725] [SC-124396][connect] Eager Execution of DF.sql()
[SPARK-42687] [SC-124896][ss] Better error message for the unsupport pivot operation in Streaming
[SPARK-42676] [SC-124809][ss] Write temp checkpoints for streaming queries to local filesystem even if default FS is set differently
[SPARK-42303] [SC-122644][sql] Assign name to _LEGACY_ERROR_TEMP_1326
[SPARK-42553] [SC-124560][sql] Ensure at least one time unit after “interval”
[SPARK-42649] [SC-124576][core] Remove the standard Apache License header from the top of third-party source files
[SPARK-42611] [SC-124395][sql] Insert char/varchar length checks for inner fields during resolution
[SPARK-42419] [SC-124019][connect][PYTHON] Migrate into error framework for Spark Connect Column API.
[SPARK-42637] [SC-124522][connect] Add SparkSession.stop()
[SPARK-42647] [SC-124647][python] Change alias for numpy deprecated and removed types
[SPARK-42616] [SC-124389][sql] SparkSQLCLIDriver shall only close started hive sessionState
[SPARK-42593] [SC-124405][ps] Deprecate & remove the APIs that will be removed in pandas 2.0.
[SPARK-41870] [SC-124402][connect][PYTHON] Fix createDataFrame to handle duplicated column names
[SPARK-42569] [SC-124379][connect] Throw exceptions for unsupported session API
[SPARK-42631] [SC-124526][connect] Support custom extensions in Scala client
[SPARK-41868] [SC-124387][connect][PYTHON] Fix createDataFrame to support durations
[SPARK-42572] [SC-124171][sql][SS] Fix behavior for StateStoreProvider.validateStateRowFormat

Maintenance updates

See Databricks Runtime 13.1 maintenance updates.

System environment

Operating System: Ubuntu 22.04.2 LTS
Java: Zulu 8.70.0.23-CA-linux64
Scala: 2.12.15
Python: 3.10.12
R: 4.2.2
Delta Lake: 2.4.0

Installed Python libraries

Library	Version	Library	Version	Library	Version
appdirs	1.4.4	argon2-cffi	21.3.0	argon2-cffi-bindings	21.2.0
asttokens	2.2.1	attrs	21.4.0	backcall	0.2.0
beautifulsoup4	4.11.1	black	22.6.0	bleach	4.1.0
blinker	1.4	boto3	1.24.28	botocore	1.27.28
certifi	2022.9.14	cffi	1.15.1	chardet	4.0.0
charset-normalizer	2.0.4	click	8.0.4	cryptography	37.0.1
cycler	0.11.0	Cython	0.29.32	dbus-python	1.2.18
debugpy	1.5.1	decorator	5.1.1	defusedxml	0.7.1
distlib	0.3.6	docstring-to-markdown	0.12	entrypoints	0.4
executing	1.2.0	facets-overview	1.0.3	fastjsonschema	2.16.3
filelock	3.12.0	fonttools	4.25.0	googleapis-common-protos	1.56.4
grpcio	1.48.1	grpcio-status	1.48.1	httplib2	0.20.2
idna	3.3	importlib-metadata	4.6.4	ipykernel	6.17.1
ipython	8.10.0	ipython-genutils	0.2.0	ipywidgets	7.7.2
jedi	0.18.1	jeepney	0.7.1	Jinja2	2.11.3
jmespath	0.10.0	joblib	1.2.0	jsonschema	4.16.0
jupyter-client	7.3.4	jupyter_core	4.11.2	jupyterlab-pygments	0.1.2
jupyterlab-widgets	1.0.0	keyring	23.5.0	kiwisolver	1.4.2
launchpadlib	1.10.16	lazr.restfulclient	0.14.4	lazr.uri	1.0.6
MarkupSafe	2.0.1	matplotlib	3.5.2	matplotlib-inline	0.1.6
mccabe	0.7.0	mistune	0.8.4	more-itertools	8.10.0
mypy-extensions	0.4.3	nbclient	0.5.13	nbconvert	6.4.4
nbformat	5.5.0	nest-asyncio	1.5.5	nodeenv	1.7.0
notebook	6.4.12	numpy	1.21.5	oauthlib	3.2.0
packaging	21.3	pandas	1.4.4	pandocfilters	1.5.0
parso	0.8.3	pathspec	0.9.0	patsy	0.5.2
pexpect	4.8.0	pickleshare	0.7.5	Pillow	9.2.0
pip	22.2.2	platformdirs	2.5.2	plotly	5.9.0
pluggy	1.0.0	prometheus-client	0.14.1	prompt-toolkit	3.0.36
protobuf	3.19.4	psutil	5.9.0	psycopg2	2.9.3
ptyprocess	0.7.0	pure-eval	0.2.2	pyarrow	8.0.0
pycparser	2.21	pydantic	1.10.6	pyflakes	3.0.1
Pygments	2.11.2	PyGObject	3.42.1	PyJWT	2.3.0
pyodbc	4.0.32	pyparsing	3.0.9	pyright	1.1.294
pyrsistent	0.18.0	python-dateutil	2.8.2	python-lsp-jsonrpc	1.0.0
python-lsp-server	1.7.1	pytoolconfig	1.2.2	pytz	2022.1
pyzmq	23.2.0	requests	2.28.1	rope	1.7.0
s3transfer	0.6.0	scikit-learn	1.1.1	scipy	1.9.1
seaborn	0.11.2	SecretStorage	3.3.1	Send2Trash	1.8.0
setuptools	63.4.1	six	1.16.0	soupsieve	2.3.1
ssh-import-id	5.11	stack-data	0.6.2	statsmodels	0.13.2
tenacity	8.1.0	terminado	0.13.1	testpath	0.6.0
threadpoolctl	2.2.0	tokenize-rt	4.2.1	tomli	2.0.1
tornado	6.1	traitlets	5.1.1	typing_extensions	4.3.0
ujson	5.4.0	unattended-upgrades	0.1	urllib3	1.26.11
virtualenv	20.16.3	wadllib	1.3.6	wcwidth	0.2.5
webencodings	0.5.1	whatthepatch	1.0.2	wheel	0.37.1
widgetsnbextension	3.6.1	yapf	0.31.0	zipp	1.0.0

Installed R libraries

R libraries are installed from the Microsoft CRAN snapshot on 2023-02-10.

Library	Version	Library	Version	Library	Version
arrow	10.0.1	askpass	1.1	assertthat	0.2.1
backports	1.4.1	base	4.2.2	base64enc	0.1-3
bit	4.0.5	bit64	4.0.5	blob	1.2.3
boot	1.3-28	brew	1.0-8	brio	1.1.3
broom	1.0.3	bslib	0.4.2	cachem	1.0.6
callr	3.7.3	caret	6.0-93	cellranger	1.1.0
chron	2.3-59	class	7.3-21	cli	3.6.0
clipr	0.8.0	clock	0.6.1	cluster	2.1.4
codetools	0.2-19	colorspace	2.1-0	commonmark	1.8.1
compiler	4.2.2	config	0.3.1	cpp11	0.4.3
crayon	1.5.2	credentials	1.3.2	curl	5.0.0
data.table	1.14.6	datasets	4.2.2	DBI	1.1.3
dbplyr	2.3.0	desc	1.4.2	devtools	2.4.5
diffobj	0.3.5	digest	0.6.31	downlit	0.4.2
dplyr	1.1.0	dtplyr	1.2.2	e1071	1.7-13
ellipsis	0.3.2	evaluate	0.20	fansi	1.0.4
farver	2.1.1	fastmap	1.1.0	fontawesome	0.5.0
forcats	1.0.0	foreach	1.5.2	foreign	0.8-82
forge	0.2.0	fs	1.6.1	future	1.31.0
future.apply	1.10.0	gargle	1.3.0	generics	0.1.3
gert	1.9.2	ggplot2	3.4.0	gh	1.3.1
gitcreds	0.1.2	glmnet	4.1-6	globals	0.16.2
glue	1.6.2	googledrive	2.0.0	googlesheets4	1.0.1
gower	1.0.1	graphics	4.2.2	grDevices	4.2.2
grid	4.2.2	gridExtra	2.3	gsubfn	0.7
gtable	0.3.1	hardhat	1.2.0	haven	2.5.1
highr	0.10	hms	1.1.2	htmltools	0.5.4
htmlwidgets	1.6.1	httpuv	1.6.8	httr	1.4.4
ids	1.0.1	ini	0.3.1	ipred	0.9-13
isoband	0.2.7	iterators	1.0.14	jquerylib	0.1.4
jsonlite	1.8.4	KernSmooth	2.23-20	knitr	1.42
labeling	0.4.2	later	1.3.0	lattice	0.20-45
lava	1.7.1	lifecycle	1.0.3	listenv	0.9.0
lubridate	1.9.1	magrittr	2.0.3	markdown	1.5
MASS	7.3-58.2	Matrix	1.5-1	memoise	2.0.1
methods	4.2.2	mgcv	1.8-41	mime	0.12
miniUI	0.1.1.1	ModelMetrics	1.2.2.2	modelr	0.1.10
munsell	0.5.0	nlme	3.1-162	nnet	7.3-18
numDeriv	2016.8-1.1	openssl	2.0.5	parallel	4.2.2
parallelly	1.34.0	pillar	1.8.1	pkgbuild	1.4.0
pkgconfig	2.0.3	pkgdown	2.0.7	pkgload	1.3.2
plogr	0.2.0	plyr	1.8.8	praise	1.0.0
prettyunits	1.1.1	pROC	1.18.0	processx	3.8.0
prodlim	2019.11.13	profvis	0.3.7	progress	1.2.2
progressr	0.13.0	promises	1.2.0.1	proto	1.0.0
proxy	0.4-27	ps	1.7.2	purrr	1.0.1
r2d3	0.2.6	R6	2.5.1	ragg	1.2.5
randomForest	4.7-1.1	rappdirs	0.3.3	rcmdcheck	1.4.0
RColorBrewer	1.1-3	Rcpp	1.0.10	RcppEigen	0.3.3.9.3
readr	2.1.3	readxl	1.4.2	recipes	1.0.4
rematch	1.0.1	rematch2	2.1.2	remotes	2.4.2
reprex	2.0.2	reshape2	1.4.4	rlang	1.0.6
rmarkdown	2.20	RODBC	1.3-20	roxygen2	7.2.3
rpart	4.1.19	rprojroot	2.0.3	Rserve	1.8-12
RSQLite	2.2.20	rstudioapi	0.14	rversions	2.1.2
rvest	1.0.3	sass	0.4.5	scales	1.2.1
selectr	0.4-2	sessioninfo	1.2.2	shape	1.4.6
shiny	1.7.4	sourcetools	0.1.7-1	sparklyr	1.7.9
SparkR	3.4.0	spatial	7.3-15	splines	4.2.2
sqldf	0.4-11	SQUAREM	2021.1	stats	4.2.2
stats4	4.2.2	stringi	1.7.12	stringr	1.5.0
survival	3.5-3	sys	3.4.1	systemfonts	1.0.4
tcltk	4.2.2	testthat	3.1.6	textshaping	0.3.6
tibble	3.1.8	tidyr	1.3.0	tidyselect	1.2.0
tidyverse	1.3.2	timechange	0.2.0	timeDate	4022.108
tinytex	0.44	tools	4.2.2	tzdb	0.3.0
urlchecker	1.0.1	usethis	2.1.6	utf8	1.2.3
utils	4.2.2	uuid	1.1-0	vctrs	0.5.2
viridisLite	0.4.1	vroom	1.6.1	waldo	0.4.0
whisker	0.4.1	withr	2.5.0	xfun	0.37
xml2	1.3.3	xopen	1.0.0	xtable	1.8-4
yaml	2.3.7	zip	2.2.2

Installed Java and Scala libraries (Scala 2.12 cluster version)

Group ID	Artifact ID	Version
antlr	antlr	2.7.7
com.amazonaws	amazon-kinesis-client	1.12.0
com.amazonaws	aws-java-sdk-autoscaling	1.12.390
com.amazonaws	aws-java-sdk-cloudformation	1.12.390
com.amazonaws	aws-java-sdk-cloudfront	1.12.390
com.amazonaws	aws-java-sdk-cloudhsm	1.12.390
com.amazonaws	aws-java-sdk-cloudsearch	1.12.390
com.amazonaws	aws-java-sdk-cloudtrail	1.12.390
com.amazonaws	aws-java-sdk-cloudwatch	1.12.390
com.amazonaws	aws-java-sdk-cloudwatchmetrics	1.12.390
com.amazonaws	aws-java-sdk-codedeploy	1.12.390
com.amazonaws	aws-java-sdk-cognitoidentity	1.12.390
com.amazonaws	aws-java-sdk-cognitosync	1.12.390
com.amazonaws	aws-java-sdk-config	1.12.390
com.amazonaws	aws-java-sdk-core	1.12.390
com.amazonaws	aws-java-sdk-datapipeline	1.12.390
com.amazonaws	aws-java-sdk-directconnect	1.12.390
com.amazonaws	aws-java-sdk-directory	1.12.390
com.amazonaws	aws-java-sdk-dynamodb	1.12.390
com.amazonaws	aws-java-sdk-ec2	1.12.390
com.amazonaws	aws-java-sdk-ecs	1.12.390
com.amazonaws	aws-java-sdk-efs	1.12.390
com.amazonaws	aws-java-sdk-elasticache	1.12.390
com.amazonaws	aws-java-sdk-elasticbeanstalk	1.12.390
com.amazonaws	aws-java-sdk-elasticloadbalancing	1.12.390
com.amazonaws	aws-java-sdk-elastictranscoder	1.12.390
com.amazonaws	aws-java-sdk-emr	1.12.390
com.amazonaws	aws-java-sdk-glacier	1.12.390
com.amazonaws	aws-java-sdk-glue	1.12.390
com.amazonaws	aws-java-sdk-iam	1.12.390
com.amazonaws	aws-java-sdk-importexport	1.12.390
com.amazonaws	aws-java-sdk-kinesis	1.12.390
com.amazonaws	aws-java-sdk-kms	1.12.390
com.amazonaws	aws-java-sdk-lambda	1.12.390
com.amazonaws	aws-java-sdk-logs	1.12.390
com.amazonaws	aws-java-sdk-machinelearning	1.12.390
com.amazonaws	aws-java-sdk-opsworks	1.12.390
com.amazonaws	aws-java-sdk-rds	1.12.390
com.amazonaws	aws-java-sdk-redshift	1.12.390
com.amazonaws	aws-java-sdk-route53	1.12.390
com.amazonaws	aws-java-sdk-s3	1.12.390
com.amazonaws	aws-java-sdk-ses	1.12.390
com.amazonaws	aws-java-sdk-simpledb	1.12.390
com.amazonaws	aws-java-sdk-simpleworkflow	1.12.390
com.amazonaws	aws-java-sdk-sns	1.12.390
com.amazonaws	aws-java-sdk-sqs	1.12.390
com.amazonaws	aws-java-sdk-ssm	1.12.390
com.amazonaws	aws-java-sdk-storagegateway	1.12.390
com.amazonaws	aws-java-sdk-sts	1.12.390
com.amazonaws	aws-java-sdk-support	1.12.390
com.amazonaws	aws-java-sdk-swf-libraries	1.11.22
com.amazonaws	aws-java-sdk-workspaces	1.12.390
com.amazonaws	jmespath-java	1.12.390
com.clearspring.analytics	stream	2.9.6
com.databricks	Rserve	1.8-3
com.databricks	jets3t	0.7.1-0
com.databricks.scalapb	compilerplugin_2.12	0.4.15-10
com.databricks.scalapb	scalapb-runtime_2.12	0.4.15-10
com.esotericsoftware	kryo-shaded	4.0.2
com.esotericsoftware	minlog	1.3.0
com.fasterxml	classmate	1.3.4
com.fasterxml.jackson.core	jackson-annotations	2.14.2
com.fasterxml.jackson.core	jackson-core	2.14.2
com.fasterxml.jackson.core	jackson-databind	2.14.2
com.fasterxml.jackson.dataformat	jackson-dataformat-cbor	2.14.2
com.fasterxml.jackson.datatype	jackson-datatype-joda	2.14.2
com.fasterxml.jackson.datatype	jackson-datatype-jsr310	2.13.4
com.fasterxml.jackson.module	jackson-module-paranamer	2.14.2
com.fasterxml.jackson.module	jackson-module-scala_2.12	2.14.2
com.github.ben-manes.caffeine	caffeine	2.9.3
com.github.fommil	jniloader	1.1
com.github.fommil.netlib	native_ref-java	1.1
com.github.fommil.netlib	native_ref-java	1.1-natives
com.github.fommil.netlib	native_system-java	1.1
com.github.fommil.netlib	native_system-java	1.1-natives
com.github.fommil.netlib	netlib-native_ref-linux-x86_64	1.1-natives
com.github.fommil.netlib	netlib-native_system-linux-x86_64	1.1-natives
com.github.luben	zstd-jni	1.5.2-5
com.github.wendykierp	JTransforms	3.1
com.google.code.findbugs	jsr305	3.0.0
com.google.code.gson	gson	2.8.9
com.google.crypto.tink	tink	1.7.0
com.google.errorprone	error_prone_annotations	2.10.0
com.google.flatbuffers	flatbuffers-java	1.12.0
com.google.guava	guava	15.0
com.google.protobuf	protobuf-java	2.6.1
com.h2database	h2	2.1.214
com.helger	profiler	1.1.1
com.jcraft	jsch	0.1.55
com.jolbox	bonecp	0.8.0.RELEASE
com.lihaoyi	sourcecode_2.12	0.1.9
com.microsoft.azure	azure-data-lake-store-sdk	2.3.9
com.microsoft.sqlserver	mssql-jdbc	11.2.2.jre8
com.ning	compress-lzf	1.1.2
com.sun.mail	javax.mail	1.5.2
com.sun.xml.bind	jaxb-core	2.2.11
com.sun.xml.bind	jaxb-impl	2.2.11
com.tdunning	json	1.8
com.thoughtworks.paranamer	paranamer	2.8
com.trueaccord.lenses	lenses_2.12	0.4.12
com.twitter	chill-java	0.10.0
com.twitter	chill_2.12	0.10.0
com.twitter	util-app_2.12	7.1.0
com.twitter	util-core_2.12	7.1.0
com.twitter	util-function_2.12	7.1.0
com.twitter	util-jvm_2.12	7.1.0
com.twitter	util-lint_2.12	7.1.0
com.twitter	util-registry_2.12	7.1.0
com.twitter	util-stats_2.12	7.1.0
com.typesafe	config	1.2.1
com.typesafe.scala-logging	scala-logging_2.12	3.7.2
com.uber	h3	3.7.0
com.univocity	univocity-parsers	2.9.1
com.zaxxer	HikariCP	4.0.3
commons-cli	commons-cli	1.5.0
commons-codec	commons-codec	1.15
commons-collections	commons-collections	3.2.2
commons-dbcp	commons-dbcp	1.4
commons-fileupload	commons-fileupload	1.5
commons-httpclient	commons-httpclient	3.1
commons-io	commons-io	2.11.0
commons-lang	commons-lang	2.6
commons-logging	commons-logging	1.1.3
commons-pool	commons-pool	1.5.4
dev.ludovic.netlib	arpack	3.0.3
dev.ludovic.netlib	blas	3.0.3
dev.ludovic.netlib	lapack	3.0.3
info.ganglia.gmetric4j	gmetric4j	1.0.10
io.airlift	aircompressor	0.21
io.delta	delta-sharing-spark_2.12	0.6.4
io.dropwizard.metrics	metrics-core	4.2.10
io.dropwizard.metrics	metrics-graphite	4.2.10
io.dropwizard.metrics	metrics-healthchecks	4.2.10
io.dropwizard.metrics	metrics-jetty9	4.2.10
io.dropwizard.metrics	metrics-jmx	4.2.10
io.dropwizard.metrics	metrics-json	4.2.10
io.dropwizard.metrics	metrics-jvm	4.2.10
io.dropwizard.metrics	metrics-servlets	4.2.10
io.netty	netty-all	4.1.87.Final
io.netty	netty-buffer	4.1.87.Final
io.netty	netty-codec	4.1.87.Final
io.netty	netty-codec-http	4.1.87.Final
io.netty	netty-codec-http2	4.1.87.Final
io.netty	netty-codec-socks	4.1.87.Final
io.netty	netty-common	4.1.87.Final
io.netty	netty-handler	4.1.87.Final
io.netty	netty-handler-proxy	4.1.87.Final
io.netty	netty-resolver	4.1.87.Final
io.netty	netty-transport	4.1.87.Final
io.netty	netty-transport-classes-epoll	4.1.87.Final
io.netty	netty-transport-classes-kqueue	4.1.87.Final
io.netty	netty-transport-native-epoll	4.1.87.Final
io.netty	netty-transport-native-epoll	4.1.87.Final-linux-aarch_64
io.netty	netty-transport-native-epoll	4.1.87.Final-linux-x86_64
io.netty	netty-transport-native-kqueue	4.1.87.Final-osx-aarch_64
io.netty	netty-transport-native-kqueue	4.1.87.Final-osx-x86_64
io.netty	netty-transport-native-unix-common	4.1.87.Final
io.prometheus	simpleclient	0.7.0
io.prometheus	simpleclient_common	0.7.0
io.prometheus	simpleclient_dropwizard	0.7.0
io.prometheus	simpleclient_pushgateway	0.7.0
io.prometheus	simpleclient_servlet	0.7.0
io.prometheus.jmx	collector	0.12.0
jakarta.annotation	jakarta.annotation-api	1.3.5
jakarta.servlet	jakarta.servlet-api	4.0.3
jakarta.validation	jakarta.validation-api	2.0.2
jakarta.ws.rs	jakarta.ws.rs-api	2.1.6
javax.activation	activation	1.1.1
javax.el	javax.el-api	2.2.4
javax.jdo	jdo-api	3.0.1
javax.transaction	jta	1.1
javax.transaction	transaction-api	1.1
javax.xml.bind	jaxb-api	2.2.11
javolution	javolution	5.5.1
jline	jline	2.14.6
joda-time	joda-time	2.12.1
ml.combust.mleap	mleap-databricks-runtime_2.12	v0.20.0-db2
net.java.dev.jna	jna	5.8.0
net.razorvine	pickle	1.3
net.sf.jpam	jpam	1.1
net.sf.opencsv	opencsv	2.3
net.sf.supercsv	super-csv	2.2.0
net.snowflake	snowflake-ingest-sdk	0.9.6
net.snowflake	snowflake-jdbc	3.13.22
net.sourceforge.f2j	arpack_combined_all	0.1
org.acplt.remotetea	remotetea-oncrpc	1.1.2
org.antlr	ST4	4.0.4
org.antlr	antlr-runtime	3.5.2
org.antlr	antlr4-runtime	4.9.3
org.antlr	stringtemplate	3.2.1
org.apache.ant	ant	1.9.16
org.apache.ant	ant-jsch	1.9.16
org.apache.ant	ant-launcher	1.9.16
org.apache.arrow	arrow-format	11.0.0
org.apache.arrow	arrow-memory-core	11.0.0
org.apache.arrow	arrow-memory-netty	11.0.0
org.apache.arrow	arrow-vector	11.0.0
org.apache.avro	avro	1.11.1
org.apache.avro	avro-ipc	1.11.1
org.apache.avro	avro-mapred	1.11.1
org.apache.commons	commons-collections4	4.4
org.apache.commons	commons-compress	1.21
org.apache.commons	commons-crypto	1.1.0
org.apache.commons	commons-lang3	3.12.0
org.apache.commons	commons-math3	3.6.1
org.apache.commons	commons-text	1.10.0
org.apache.curator	curator-client	2.13.0
org.apache.curator	curator-framework	2.13.0
org.apache.curator	curator-recipes	2.13.0
org.apache.datasketches	datasketches-java	3.1.0
org.apache.datasketches	datasketches-memory	2.0.0
org.apache.derby	derby	10.14.2.0
org.apache.hadoop	hadoop-client-runtime	3.3.4
org.apache.hive	hive-beeline	2.3.9
org.apache.hive	hive-cli	2.3.9
org.apache.hive	hive-jdbc	2.3.9
org.apache.hive	hive-llap-client	2.3.9
org.apache.hive	hive-llap-common	2.3.9
org.apache.hive	hive-serde	2.3.9
org.apache.hive	hive-shims	2.3.9
org.apache.hive	hive-storage-api	2.8.1
org.apache.hive.shims	hive-shims-0.23	2.3.9
org.apache.hive.shims	hive-shims-common	2.3.9
org.apache.hive.shims	hive-shims-scheduler	2.3.9
org.apache.httpcomponents	httpclient	4.5.14
org.apache.httpcomponents	httpcore	4.4.16
org.apache.ivy	ivy	2.5.1
org.apache.logging.log4j	log4j-1.2-api	2.19.0
org.apache.logging.log4j	log4j-api	2.19.0
org.apache.logging.log4j	log4j-core	2.19.0
org.apache.logging.log4j	log4j-slf4j2-impl	2.19.0
org.apache.mesos	mesos	1.11.0-shaded-protobuf
org.apache.orc	orc-core	1.8.3-shaded-protobuf
org.apache.orc	orc-mapreduce	1.8.3-shaded-protobuf
org.apache.orc	orc-shims	1.8.3
org.apache.thrift	libfb303	0.9.3
org.apache.thrift	libthrift	0.12.0
org.apache.xbean	xbean-asm9-shaded	4.22
org.apache.yetus	audience-annotations	0.13.0
org.apache.zookeeper	zookeeper	3.6.3
org.apache.zookeeper	zookeeper-jute	3.6.3
org.checkerframework	checker-qual	3.19.0
org.codehaus.jackson	jackson-core-asl	1.9.13
org.codehaus.jackson	jackson-mapper-asl	1.9.13
org.codehaus.janino	commons-compiler	3.0.16
org.codehaus.janino	janino	3.0.16
org.datanucleus	datanucleus-api-jdo	4.2.4
org.datanucleus	datanucleus-core	4.1.17
org.datanucleus	datanucleus-rdbms	4.1.19
org.datanucleus	javax.jdo	3.2.0-m3
org.eclipse.jetty	jetty-client	9.4.50.v20221201
org.eclipse.jetty	jetty-continuation	9.4.50.v20221201
org.eclipse.jetty	jetty-http	9.4.50.v20221201
org.eclipse.jetty	jetty-io	9.4.50.v20221201
org.eclipse.jetty	jetty-jndi	9.4.50.v20221201
org.eclipse.jetty	jetty-plus	9.4.50.v20221201
org.eclipse.jetty	jetty-proxy	9.4.50.v20221201
org.eclipse.jetty	jetty-security	9.4.50.v20221201
org.eclipse.jetty	jetty-server	9.4.50.v20221201
org.eclipse.jetty	jetty-servlet	9.4.50.v20221201
org.eclipse.jetty	jetty-servlets	9.4.50.v20221201
org.eclipse.jetty	jetty-util	9.4.50.v20221201
org.eclipse.jetty	jetty-util-ajax	9.4.50.v20221201
org.eclipse.jetty	jetty-webapp	9.4.50.v20221201
org.eclipse.jetty	jetty-xml	9.4.50.v20221201
org.eclipse.jetty.websocket	websocket-api	9.4.50.v20221201
org.eclipse.jetty.websocket	websocket-client	9.4.50.v20221201
org.eclipse.jetty.websocket	websocket-common	9.4.50.v20221201
org.eclipse.jetty.websocket	websocket-server	9.4.50.v20221201
org.eclipse.jetty.websocket	websocket-servlet	9.4.50.v20221201
org.fusesource.leveldbjni	leveldbjni-all	1.8
org.glassfish.hk2	hk2-api	2.6.1
org.glassfish.hk2	hk2-locator	2.6.1
org.glassfish.hk2	hk2-utils	2.6.1
org.glassfish.hk2	osgi-resource-locator	1.0.3
org.glassfish.hk2.external	aopalliance-repackaged	2.6.1
org.glassfish.hk2.external	jakarta.inject	2.6.1
org.glassfish.jersey.containers	jersey-container-servlet	2.36
org.glassfish.jersey.containers	jersey-container-servlet-core	2.36
org.glassfish.jersey.core	jersey-client	2.36
org.glassfish.jersey.core	jersey-common	2.36
org.glassfish.jersey.core	jersey-server	2.36
org.glassfish.jersey.inject	jersey-hk2	2.36
org.hibernate.validator	hibernate-validator	6.1.7.Final
org.javassist	javassist	3.25.0-GA
org.jboss.logging	jboss-logging	3.3.2.Final
org.jdbi	jdbi	2.63.1
org.jetbrains	annotations	17.0.0
org.joda	joda-convert	1.7
org.jodd	jodd-core	3.5.2
org.json4s	json4s-ast_2.12	3.7.0-M11
org.json4s	json4s-core_2.12	3.7.0-M11
org.json4s	json4s-jackson_2.12	3.7.0-M11
org.json4s	json4s-scalap_2.12	3.7.0-M11
org.lz4	lz4-java	1.8.0
org.mariadb.jdbc	mariadb-java-client	2.7.4
org.mlflow	mlflow-spark	2.2.0
org.objenesis	objenesis	2.5.1
org.postgresql	postgresql	42.3.8
org.roaringbitmap	RoaringBitmap	0.9.39
org.roaringbitmap	shims	0.9.39
org.rocksdb	rocksdbjni	7.8.3
org.rosuda.REngine	REngine	2.1.0
org.scala-lang	scala-compiler_2.12	2.12.15
org.scala-lang	scala-library_2.12	2.12.15
org.scala-lang	scala-reflect_2.12	2.12.15
org.scala-lang.modules	scala-collection-compat_2.12	2.4.3
org.scala-lang.modules	scala-parser-combinators_2.12	1.1.2
org.scala-lang.modules	scala-xml_2.12	1.2.0
org.scala-sbt	test-interface	1.0
org.scalacheck	scalacheck_2.12	1.14.2
org.scalactic	scalactic_2.12	3.2.15
org.scalanlp	breeze-macros_2.12	2.1.0
org.scalanlp	breeze_2.12	2.1.0
org.scalatest	scalatest-compatible	3.2.15
org.scalatest	scalatest-core_2.12	3.2.15
org.scalatest	scalatest-diagrams_2.12	3.2.15
org.scalatest	scalatest-featurespec_2.12	3.2.15
org.scalatest	scalatest-flatspec_2.12	3.2.15
org.scalatest	scalatest-freespec_2.12	3.2.15
org.scalatest	scalatest-funspec_2.12	3.2.15
org.scalatest	scalatest-funsuite_2.12	3.2.15
org.scalatest	scalatest-matchers-core_2.12	3.2.15
org.scalatest	scalatest-mustmatchers_2.12	3.2.15
org.scalatest	scalatest-propspec_2.12	3.2.15
org.scalatest	scalatest-refspec_2.12	3.2.15
org.scalatest	scalatest-shouldmatchers_2.12	3.2.15
org.scalatest	scalatest-wordspec_2.12	3.2.15
org.scalatest	scalatest_2.12	3.2.15
org.slf4j	jcl-over-slf4j	2.0.6
org.slf4j	jul-to-slf4j	2.0.6
org.slf4j	slf4j-api	2.0.6
org.threeten	threeten-extra	1.7.1
org.tukaani	xz	1.9
org.typelevel	algebra_2.12	2.0.1
org.typelevel	cats-kernel_2.12	2.1.1
org.typelevel	spire-macros_2.12	0.17.0
org.typelevel	spire-platform_2.12	0.17.0
org.typelevel	spire-util_2.12	0.17.0
org.typelevel	spire_2.12	0.17.0
org.wildfly.openssl	wildfly-openssl	1.1.3.Final
org.xerial	sqlite-jdbc	3.42.0.0
org.xerial.snappy	snappy-java	1.1.8.4
org.yaml	snakeyaml	1.33
oro	oro	2.0.8
pl.edu.icm	JLargeArrays	1.5
software.amazon.cryptools	AmazonCorrettoCryptoProvider	1.6.1-linux-x86_64
software.amazon.ion	ion-java	1.0.2
stax	stax-api	1.0.1

New features and improvements​

Cluster support for JDK 17 (Public Preview)​

Add, change, or delete data in streaming tables​

Read Kafka with SQL​

New SQL built-in functions​

Unity Catalog support for cluster-scoped Python libraries​

Expanded default enablement for optimized writes in Unity Catalog​

Advanced support for stateful operators in Structured Streaming workloads​

Delta clone for Unity Catalog is in Public Preview​

Pub/Sub support for Structured Streaming​

Drop duplicates within watermarks in Structured Streaming​

Trigger available now is supported for Kinesis data sources​

Expanded support for Delta conversions from Apache Iceberg tables with truncated partition columns​

Stream schema changes with column mapping in Delta Lake​

Remove START VERSION​

New H3 expressions available with Python​

Bug fixes​

Parquet failOnUnknownFields no longer silently drop data on type mismatch​

Breaking changes​

Upgrade sqlite-jdbc version to 3.42.0.0 to address CVE-2023-32697​

Library upgrades​

Apache Spark​

Maintenance updates​

System environment​

Installed Python libraries​

Installed R libraries​

Installed Java and Scala libraries (Scala 2.12 cluster version)​