Skip to main content

Databricks Runtime 18.2 (Beta)

The following release notes provide information about Databricks Runtime 18.2 (Beta).

This version incorporates all features, improvements, and bug fixes from all previous Databricks Runtime releases. Databricks released this version in April 2026.

Beta

Databricks Runtime 18.2 is in Beta. The contents of the supported environments might change during the Beta. Changes can include the list of packages or versions of installed packages.

Behavioral changes

XPath no longer fetches external DTDs

When you evaluate XPath over XML, Databricks no longer loads external Document Type Definitions (DTDs) declared in the document. Previously, XPath could fail when the XML contained an external DTD reference that pointed to a malformed URL or an unreachable endpoint. Because DTD validation is separate from XPath evaluation, queries that already succeeded return the same results as before. Queries that previously failed only during external DTD retrieval can now succeed.

NULL struct preservation in INSERT, MERGE, and streaming writes with schema evolution

For INSERT, MERGE, and streaming writes that use schema evolution, a NULL struct in the source is now stored as NULL in the target. Previously, that value was incorrectly materialized as a non-null struct with every field set to NULL, while the same operations without schema evolution preserved NULL structs correctly. If your code relied on receiving a non-null struct whose fields were all NULL, update your code to handle a NULL struct instead.

NullType (VOID) support in Delta tables

Delta tables now support VOID (Spark's NullType) columns. VOID columns are no longer dropped from the table schema upon reads. Writes are unaffected. See VOID type for restrictions on where VOID columns can appear in the schema.

SHOW CREATE TABLE supports metric views

SHOW CREATE TABLE now supports metric views. Previously, running this command on a metric view raised an error. The output for metric views includes the fully qualified three-part name with catalog (for example, CREATE VIEW catalog.db.my_metric_view ...), making it easier to recreate the metric view in the correct location.

Fix for LEFT OUTER JOIN LATERAL dropping rows

A bug that incorrectly dropped rows from LEFT OUTER JOIN LATERAL queries is now fixed. Queries using this construct now return the correct results. To temporarily revert to the previous behavior, set spark.databricks.sql.optimizer.lateralJoinPreserveOuterSemantic to true.

NATURAL JOIN respects case-insensitive column matching

NATURAL JOIN now correctly uses case-insensitive column matching when spark.sql.caseSensitive is set to false (the default). Previously, NATURAL JOIN used case-sensitive comparison to identify common columns, causing columns that differed only in case (for example, ID versus id) to not be recognized as matching. This caused NATURAL JOIN to silently produce cross-join results. This fix aligns NATURAL JOIN behavior with USING joins, which already handled case-insensitivity correctly. Queries affected by this bug now return correct results with properly joined columns.

SQL UDF dependency validation in Unity Catalog

Unity Catalog now enforces dependency validation for SQL user-defined functions (UDFs) to prevent access control bypass. Previously, SQL functions created through the REST API could reference dependencies the user did not have access to. SQL UDFs with invalid dependency configurations are now blocked from execution.

Optimized writes for partitioned Unity Catalog tables created with CRTAS

Optimized writes are now correctly applied to partitioned Unity Catalog tables created with CREATE OR REPLACE TABLE ... AS SELECT (CRTAS). Previously, CRTAS on new partitioned Unity Catalog tables did not apply optimized writes, resulting in a higher number of small files per partition. This fix can increase write latency. To revert to the previous behavior, set spark.databricks.delta.optimizeWrite.UCTableCRTAS.enabled to false.

AWS SDK v1 dependencies are shaded

AWS SDK v1 dependencies bundled with Databricks Runtime are now shaded and no longer directly available on the classpath. If your code depends on AWS SDK v1 libraries previously provided by Databricks Runtime, add them as explicit dependencies in your project. This change prepares for the migration to AWS SDK v2, following the end of AWS support for SDK v1.

Fix incorrect EPSG authority for ESRI-defined SRID 102100

The Coordinate Reference System (CRS) mapping for SRID 102100 now correctly uses ESRI:102100 instead of the incorrect EPSG:102100. This fix ensures geospatial data is stored with the correct authority for better interoperability with other systems.

New features and improvements

CREATE OR REPLACE support for temporary tables

CREATE OR REPLACE TEMP TABLE syntax is now supported, allowing you to create or replace temporary tables in a single statement. This eliminates the need to explicitly drop and recreate temporary tables.

agg() alias for measure() function

agg() is now available as an alias for the measure() function. This change is fully backward compatible. Existing queries that use measure() continue to work without modification, and agg() produces identical results when used with the same arguments.

Snowflake JDBC driver upgrade

The Snowflake JDBC driver is upgraded from 3.22.0 to 3.28.0.

pyspark.pipelines.testing namespace alias

pyspark.pipelines.testing is now available as a convenience alias for dlt.testing APIs. Import Lakeflow Spark Declarative Pipelines pipeline testing utilities through either namespace.

Improved Auto Loader listing performance

Auto Loader now uses a more efficient listing method that improves listing speed for cloud storage sources. If your stream triggers overlap due to long-running listing operations, this optimization can result in increased cloud listing API costs. Monitor your trigger intervals and adjust scheduling to prevent overlapping operations if cost increases are observed.

Delta table history includes write option flags

Delta table history (DESCRIBE HISTORY) now includes write option flags in the operationParameters column for WRITE and REPLACE TABLE operations. When the following options are explicitly enabled, they appear as boolean flags in the history (only included when true):

For WRITE and REPLACE TABLE operations:

  • isDynamicPartitionOverwrite: present when dynamic partition overwrite mode was used
  • canOverwriteSchema: present when schema overwrite (overwriteSchema) was enabled
  • canMergeSchema: present when schema merge (mergeSchema) was enabled

For REPLACE TABLE operations:

  • predicate: present when replaceWhere was used
  • isV1WriterSaveAsTableOverwrite: present when the replace was triggered by a .saveAsTable overwrite

Structured Streaming rewind and replay support

Structured Streaming now supports rewind and replay for streaming pipelines. This feature enables reprocessing from an earlier point in the stream to recover from failures such as schema changes, malformed input data, or logic errors, without requiring a full state reset. This doesn't modify the default behavior of existing streaming workloads.

Library upgrades

  • Upgraded Python libraries:

    No libraries were upgraded in this version.

  • Upgraded R libraries:

    No libraries were upgraded in this version.

  • Upgraded Java libraries:

    • io.delta.delta-sharing-client_2.13 from 1.3.9 to 1.3.10

Apache Spark

Databricks Runtime 18.2 includes Apache Spark 4.1.0. This release includes all Spark fixes and improvements included in Databricks Runtime 18.1, as well as the following additional bug fixes and improvements made to Spark:

  • SPARK-56219 Revert "[SC-225028][PS] Align groupby idxmax and idxmin skipna=False behavior with pandas 2/3"
  • SPARK-56204 Strip Alias wrappers from inline table row expressions in parser
  • SPARK-56186 Retire pypy
  • SPARK-56202 Refactor streaming join tests: split Base/Suite hierarchy and simplify mode dispatch
  • SPARK-56221 Feature parity between spark.catalog.* vs DDL commands
  • SPARK-56301 Fix typos in error-conditions.json
  • SPARK-55729 Support state data source reader for new state format v4 on stream-stream join
  • SPARK-56256 Add emptyDataFrame API to SparkSession
  • SPARK-56205 Validate base state store checkpoint ID before committing microbatch
  • SPARK-55827 Fix type hint for datasource workers
  • SPARK-55579 Rename PySpark error classes to be eval-type-agnostic
  • SPARK-56247 Fix the fall back behavior and type hint of inheritable_thread_target
  • SPARK-56244 Refine benchmark class layout in bench_eval_type.py
  • SPARK-56262 Remove the unnecessary mypy check disable for types
  • SPARK-55969 regr_r2 should treat first param as dependent variable
  • SPARK-56179 Consolidate error classes for type mismatch - part 3
  • SPARK-55630 Skip updating matched flag for non-outer side in stream-stream join v4
  • SPARK-56217 Fix bucketBy exceptions in connect
  • SPARK-56225 Improve View WITH SCHEMA EVOLUTION error message
  • SPARK-55865 Rename _LEGACY_ERROR_TEMP_1266 to CANNOT_TRUNCATE_EXTERNAL_TABLE
  • SPARK-55861 Rename _LEGACY_ERROR_TEMP_2045 to UNSUPPORTED_TABLE_CHANGE
  • SPARK-56166 Use ArrowBatchTransformer.enforce_schema to replace column-wise type coercion logic
  • SPARK-56245 Fix DataFrame.eval inplace assignment on pandas 3
  • SPARK-56062 Isolate memory_profiler to improve import time
  • SPARK-55964 system catalog wins over user catalog for BUILTIN, and SESSION schemas.
  • SPARK-56226 Catch analysis errors before InternalFrame.__init__ in .loc
  • SPARK-55723 Generalize enforce_schema error to PySparkTypeError
  • SPARK-54878 Add sortKeys option to to_json function
  • SPARK-56219 Align groupby idxmax and idxmin skipna=False behavior with pandas 2/3
  • SPARK-44065 Optimize BroadcastHashJoin skew in OptimizeSkewedJoin
  • SPARK-56179 Revert "[SC-225014][PYTHON] Consolidate error classes for type mismatch - part 3"
  • SPARK-53399 Merge Python UDFs
  • SPARK-56224 Polish type annotations for accumulators.py
  • SPARK-55448 Fix query events loss when session closes during query execution
  • SPARK-55862 Rename _LEGACY_ERROR_TEMP_2027 to UNEXPECTED_OPERATOR_IN_CORRELATED_SUBQUERY
  • SPARK-56201 Run SPARK-49829 tests with VCF joins now that StateDataSource supports it
  • SPARK-56179 Consolidate error classes for type mismatch - part 3
  • SPARK-56184 Replace assert with proper SparkRuntimeException in partition column parsing
  • SPARK-56206 Fix case-insensitive duplicate CTE name detection
  • SPARK-55866 Rename _LEGACY_ERROR_TEMP_2145 to OPTION_VALUE_EXCEEDS_ONE_CHARACTER
  • SPARK-56067 Lazy import psutil to improve import speed
  • SPARK-56066 Lazy import numpy to improve import speed
  • SPARK-55719 Remove deprecation warning for spark.sql.hive.convertCTAS
  • SPARK-56179 Consolidate error classes for type mismatch - part 2
  • SPARK-55510 Update structured-streaming-state-data-source.md doc to reflect deleteRange
  • SPARK-56050 Eagerly resolve IDENTIFIER() with string literals at parse time
  • SPARK-56151 Improve CreateVariable display string
  • SPARK-55751 Add metrics on state store loads from DFS
  • SPARK-56188 Align Series.map() with pandas 3 empty-dict behavior
  • SPARK-55964 Revert "[SC-223957] system catalog wins over user catalog for BUILTIN, and SESSION schemas."
  • SPARK-55577 Refactor SQL_SCALAR_ARROW_ITER_UDF wrapper, mapper, and serializer logic
  • SPARK-55596 DSV2 Enhanced Partition Stats Filtering
  • SPARK-56179 Revert "[SC-224777][PYTHON] Consolidate error classes for type mismatch - part 2"
  • SPARK-55964 system catalog wins over user catalog for BUILTIN, and SESSION schemas.
  • SPARK-56050 Revert "[SC-224153][SQL] Eagerly resolve IDENTIFIER() with string literals at parse time"
  • SPARK-56179 Consolidate error classes for type mismatch - part 2
  • SPARK-56102 UnionEstimation code cleanup
  • SPARK-51712 Swallow non-fatal Throwables when resolving tables/views in spark.catalog.listTables()
  • SPARK-55881 Add queryId, errorMessage, and rootExecutionId to SQL execution REST API
  • SPARK-56050 Eagerly resolve IDENTIFIER() with string literals at parse time
  • SPARK-55628 Integrate stream-stream join state format V4
  • SPARK-56187 Fix Series.argsort null ordering for pandas 3
  • SPARK-56167 Align astype with pandas 3 default string behavior
  • SPARK-56018 Use ruff as formatter
  • SPARK-56042 Fix swapped external/internal col family count metrics in RocksDBStateStoreProvider
  • SPARK-56179 Consolidate error classes for type mismatch - part 1
  • SPARK-56089 Align asinh/acosh with fdlibm algorithm for cross-engine compatibility
  • SPARK-55453 Fix LIKE pattern matching for supplementary Unicode characters
  • SPARK-52785 Simplifying super() syntax in PySpark
  • SPARK-56169 Fix ClassCastException in error reporting when GetStructField child type is changed by plan transformation
  • SPARK-55557 Hyperbolic functions should not overflow with large inputs
  • SPARK-47997 Add errors parameter to DataFrame.drop and Series.drop
  • SPARK-55008 Display Query ID in SparkUI
  • SPARK-54660 Add RTM trigger to python
  • SPARK-56047 Propagate distinctCount through Union in CBO statistics estimation
  • SPARK-56111 Add SparkContext.isDriver() and use it across the codebase
  • SPARK-55999 Enable forceSnapshotUploadOnLag by default
  • SPARK-55610 Add getExecutorInfos to StatusTracker in Python
  • SPARK-55728 Introduce conf for file checksum threadpool size and support disabling the threadpool
  • SPARK-55686 SizeEstimator takes care of Compact Object Headers
  • SPARK-56044 HistoryServerDiskManager does not delete app store on release when app is not in active map
  • SPARK-55809 HeapHistogram uses DiagnosticCommandMBean instead of jmap subprocess
  • SPARK-56122 Use pandas-aware numeric dtype check in Series.cov
  • SPARK-56113 Improve pandas 3 string restoration in pandas-on-Spark
  • SPARK-56118 Match pandas 3.0 bool handling in GroupBy.quantile
  • SPARK-53823 Implement allow list for real time mode
  • SPARK-55977 Fix isin() to use strict type matching like pandas
  • SPARK-54027 Kafka Source RTM support
  • SPARK-50284 Change docs for parseJson function
  • SPARK-56035 [SQL] Introduce AggregationValidator for single-pass resolver Aggregate validation
  • SPARK-55557 Revert "[SC-223720][SQL] Hyperbolic functions should not overflow with large inputs"
  • SPARK-56075 Remove a batch of dead python error classes
  • SPARK-55967 Unify column conversion for connect dataframe
  • SPARK-53915 Add RealTimeScanExec and ability to execute long running batches
  • SPARK-55557 Hyperbolic functions should not overflow with large inputs
  • SPARK-55147 Scope timestamp range for time-interval join retrieval in V4 state format
  • SPARK-56056 Support simpler worker profiling with viztracer
  • SPARK-55948 Add DSv2 CDC connector API, analyzer resolution, and SQL CHANGES clause
  • SPARK-54599 Reapply "[SC-219008][PYTHON] Refactor PythonExcept…
  • SPARK-55390 Consolidate SQL_SCALAR_ARROW_UDF wrapper, mapper, and serializer logic
  • SPARK-56023 Better load balance in LowLatencyMemoryStream
  • SPARK-55986 Upgrade black to 26.3.1
  • SPARK-55667 Move check_dependencies to init
  • SPARK-55145 Support Avro for timestamp based RocksDB state key encoders
  • SPARK-53970 Remove incorrect 'optional' tag for messageName…
  • SPARK-55059 Revert "[SC-224058][PYTHON] Remove empty table workaround in toPandas"
  • SPARK-50111 Add subplots support for pie charts in Plotly backend
  • SPARK-56081 Align idxmax and idxmin NA handling with pandas 3
  • SPARK-56080 Align Series.argmax/argmin with pandas 3.0 NA handling
  • SPARK-56060 Handle pandas 3 null string conversion in describe() for empty timestamp frames
  • SPARK-55059 Remove empty table workaround in toPandas
  • SPARK-55995 Support TIMESTAMP WITH LOCAL TIME ZONE in SQL syntax
  • SPARK-55976 Use Set instead of Seq for write privileges
  • SPARK-56073 Simplify the build of PythonRunnerConfMap
  • SPARK-55887 Special handling for CollectLimitExec/CollectTailExec to avoid full table scans
  • SPARK-55980 Always apply _cast_back_float in numeric arithmetic
  • SPARK-55357 Fix docstring for timestamp_add
  • SPARK-55667 Revert "[SC-223289][PYTHON][CONNECT] Move check_dependencies to init"
  • SPARK-54285 Revert "[PYTHON] Cache timezone info to avoid expensive timestamp conversion"
  • SPARK-56021 Increase AutoSnapshotRepair default maxChangeFileReplay threshold from 50 to 500
  • SPARK-55870 Add docs for Geo types
  • SPARK-55962 Use getShort instead of getInt casting in putShortsFromIntsLittleEndian on Little Endian platforms
  • SPARK-55903 Simplify MERGE Schema Evolution and Check Write Privileges
  • SPARK-55326 Release remote session when SPARK_CONNECT_RELEASE_SESSION_ON_EXIT is set
  • SPARK-55667 Move check_dependencies to init
  • SPARK-55884 Add v1StatsToV2Stats to DataSourceV2Relation
  • SPARK-55929 Add missing toString() to TableChange.UpdateColumnDefaultValue
  • SPARK-55851 Clarify types of datasource partition and read
  • SPARK-55828 Add DSV2 TableChange toString and fix missing error class for Merge Into Schema Evolution
  • SPARK-55790 Build a complete SRS registry using PROJ 9.7.1 data
  • SPARK-55645 Add serdeName to CatalogStorageFormat
  • SPARK-54796 Fix NPE caused by race condition between Executor initialization and shuffle migration
  • SPARK-55983 New single-pass analyzer functionality and bugfixes
  • SPARK-55964 Cache coherence: clear function registry on DROP DATABASE
  • SPARK-55868 Fix Predicate Pushdown for InMemoryTable for V2Filters
  • SPARK-55973 LeftSemi optimization for stream-stream join
  • SPARK-54665 Fix boolean vs string comparison to match pandas behavior
  • SPARK-55539 Allow casting from GeographyType to GeometryType
  • SPARK-55695 Avoid double planning in row-level operations
  • SPARK-55904 Utilize _check_same_session to narrow down types
  • SPARK-55965 Add warning when pandas >= 3.0.0 is used with PySpark
  • SPARK-55493 [SS] Do not mkdirs in streaming checkpoint offset/commit log directory in StateDataSource
  • SPARK-55851 Revert "[SC-223270][PYTHON] Clarify types of datasource partition and read"
  • SPARK-55645 Revert "[SC-221839][SQL] Add serdeName to CatalogStorageFormat"
  • SPARK-55640 Propagate WKB parsing errors for Geometry and Geography
  • SPARK-55693 Avoid deadlock by making SparkSession.observationManager a non-lazy val
  • SPARK-55528 Add default collation support for SQL UDFs
  • SPARK-55860 Use UNABLE_TO_INFER_SCHEMA instead of UNABLE_TO_INFER_SCHEMA_FOR_DATA_SOURCE
  • SPARK-55275 Add InvalidPlanInput sql states for sql/connect
  • SPARK-55645 Add serdeName to CatalogStorageFormat
  • SPARK-55716 Support NOT NULL constraint enforcement for V1 file source table inserts
  • SPARK-53226 Make ClosureCleaner work with Java22+
  • SPARK-55997 Set upper bound to prefixScan in RocksDB state store provider
  • SPARK-55851 Clarify types of datasource partition and read
  • SPARK-55954 Remove the incorrect overload type hint for fillna
  • SPARK-56016 Preserve named Series columns in concat with ignore_index on pandas 3
  • SPARK-55502 Unify UDF and UDTF Arrow conversion error handling
  • SPARK-55989 Preserve non-int64 index dtypes in restore_index
  • SPARK-55955 Remove overload type hint for drop
  • SPARK-55945 [SDP] Support structured identifiers for flows in SDP eager analysis protos
  • SPARK-55714 JDK might throw ArithmeticException without message
  • SPARK-55991 Fix unicode related SQL text corruption with parameters
  • SPARK-55696 Add explicit error to Encoders.bean for interface class
  • SPARK-55533 Support IGNORE NULLS / RESPECT NULLS for collect_set
  • SPARK-55987 Fix V4 windowed join timestamp extraction using findJoinKeyOrdinalForWatermark
  • SPARK-55946 Set up pandas_priority so mixed binary ops dispatch correctly to pandas-on-Spark
  • SPARK-55264 Add ExecuteOutput command to Spark Connect pipelines proto
  • SPARK-47672 Avoid double eval from filter pushDown w/ projection pushdown
  • SPARK-55780 Replace PNG logo with SVG in Spark Web UI
  • SPARK-55821 Enforce keyword-only arguments in serializer init methods
  • SPARK-55621 Fix ambiguous and unnecessary unicode usage
  • SPARK-55662 Implementation of idxmin Axis argument
  • SPARK-55631 ALTER TABLE must invalidate cache for DSv2 tables
  • SPARK-55692 Fix SupportsRuntimeFiltering and SupportsRuntimeV2Filtering documentation
  • SPARK-55928 New linter for config effectiveness in views and UDFs
  • SPARK-55440 Types Framework - Phase 1a - Core Type System Foundation
  • SPARK-55631 Revert "[SC-221596][SQL] ALTER TABLE must invalidate cache for DSv2 tables"
  • SPARK-55631 ALTER TABLE must invalidate cache for DSv2 tables
  • SPARK-55683 Optimize VectorizedPlainValuesReader.readUnsignedLongs
  • SPARK-55892 Fix unable to load state store because reused SST file was deleted by maintenance
  • SPARK-55946 Revert "[SC-223027][PS] Set up pandas_priority so mixed binary ops dispatch correctly to pandas-on-Spark"
  • SPARK-55891 Preserve the SQL scripting context inside EXECUTE IMMEDIATE
  • SPARK-55907 Fix incorrect error positions for invalid data types in CREATE FUNCTION
  • SPARK-55946 Set up pandas_priority so mixed binary ops dispatch correctly to pandas-on-Spark
  • SPARK-55694 Block constraints in CTAS/RTAS at parser level
  • SPARK-55682 ServiceLoader returned iterator may throw NoClassDefFoundError on hasNext()
  • SPARK-55155 Fix SET CATALOG to use special chars and backticks in the identifier name
  • SPARK-55932 Fix XML to variant parser hang on negative scale
  • SPARK-55673 Add more tests for nested type encoder
  • SPARK-55679 Fix dectecting sun.io.serialization.extendedDebugInfo on Java 25
  • SPARK-55957 Add 'DATA_SOURCE_NOT_FOUND' in Catalog.ERROR_HANDLING_RULES
  • SPARK-55052 Add AQEShuffleRead properties to Physical Plan Tree
  • SPARK-55652 Optimize VectorizedPlainValuesReader.readShorts() with direct array access for heap buffers
  • SPARK-55659 Improve EventLogFileWriter to log stop operation
  • SPARK-54666 Leave numeric types unchanged on to_numeric
  • SPARK-55654 Enable TreePattern pruning for EliminateSubqueryAliases and ResolveInlineTables
  • SPARK-55533 Revert "[SC-220538][SQL] Support IGNORE NULLS / RESPECT NULLS for collect_set"
  • SPARK-55901 Raise an error from Series.replace() with no arguments
  • SPARK-55896 Use numpy functions instead of builtins
  • SPARK-55655 Make CountVectorizer vocabulary deterministic when counts are equal
  • SPARK-55811 [SQL] Catch NonFatal instead of UnresolvedException when calling nodeWithOutputColumnsString
  • SPARK-55533 Support IGNORE NULLS / RESPECT NULLS for collect_set
  • SPARK-55435 Use StringBuilder instead of StringBuffer
  • SPARK-54807 Allow qualified names for built-in and session functions (#198171)
  • SPARK-55854 Tag pass-through duplicate attributes in Expand output to prevent AMBIGUOUS_REFERENCE
  • SPARK-55261 Implement Parquet read support for Geo types
  • SPARK-55416 Streaming Python Data Source memory leak when end-offset is not updated
  • SPARK-55465 Support GeometryType in convert_numpy
  • SPARK-55801 Fix type hint of _SimpleStreamReaderWrapper.getCache
  • SPARK-55800 Remove the unused type check for datetime.date
  • SPARK-55663 Unify module for data source functions
  • SPARK-55665 Unify how workers establish connection with the executor
  • SPARK-53446 Optimize BlockManager remove operations with cached block mappings
  • SPARK-55867 Fix StringMethods with pandas 3
  • SPARK-55501 Fix listagg distinct + within group order by bug
  • SPARK-55558 Add support for Tuple/Theta set operations
  • SPARK-55636 Add detailed errors in case of deduplication of invalid columns
  • SPARK-55788 Support ExtensionDType for integers in Pandas UDF
  • SPARK-55464 Support GeographyType in convert_numpy
  • SPARK-55530 Support Geo result sets in Hive and Thrift server
  • SPARK-55525 Fix UDTF_ARROW_TYPE_CONVERSION_ERROR with undefined error message parameter
  • SPARK-55626 Don't load metadata columns on Table unless needed in V2TableUtil
  • SPARK-55533 Revert "[SC-220538][SQL] Support IGNORE NULLS / RESPECT NULLS for collect_set"
  • SPARK-55435 Revert "[SC-219656][CORE][SQL] Use StringBuilder instead of StringBuffer"
  • SPARK-55533 Support IGNORE NULLS / RESPECT NULLS for collect_set
  • SPARK-54452 Fix empty response from SparkConnect server for spark.sql(...) inside FlowFunction
  • SPARK-55638 Refactor WKT serialization in GeometryModel
  • SPARK-55551 Improve BroadcastHashJoinExec output partitioning
  • SPARK-54314 Improve Server-Side debuggability in Spark Connect by capturing client application's file name and line numbers
  • SPARK-55517 Optimize VectorizedPlainValuesReader.readBytes() with direct array access for heap buffers
  • SPARK-55495 Fix EventLogFileWriters.closeWriter to handle checkError
  • SPARK-55279 Add sketch_funcs group for DataSketches SQL functions
  • SPARK-55435 Use StringBuilder instead of StringBuffer
  • SPARK-55064 Support query level indeterminate shuffle retry
  • SPARK-55411 SPJ may throw ArrayIndexOutOfBoundsException when join keys are less than cluster keys
  • SPARK-55451 Cursors must start collecting results on OPEN, not first FETCH
  • SPARK-54687 Add more edge cases with generators
  • SPARK-55691 GetStatus client
  • SPARK-55277 Add protobuf_funcs group for Protobuf SQL functions
  • SPARK-55822 Rename _LEGACY_ERROR_TEMP_0052 to CREATE_VIEW_WITH_IF_NOT_EXISTS_AND_REPLACE
  • SPARK-55236 Address unexpected exception in some CoarseGrainedExecutorBackendSuite test cases
  • SPARK-55275 SQL State Coverage: IllegalStateException
  • SPARK-55462 Reapply "[SC-221123][PYTHON] Support VariantType in convert_numpy"
  • SPARK-55062 Support proto2 extensions in protobuf functions
  • SPARK-55248 Clean up Jackson deprecated API usage in streaming.checkpointing.Checksum
  • SPARK-55250 Reduce Hive client calls on CREATE NAMESPACE
  • SPARK-55247 Clean up deprecated API usage related to o.a.c.io.input.BoundedInputStream
  • SPARK-55198 spark-sql should skip comment line with leading whitespaces
  • SPARK-55826 Rename _LEGACY_ERROR_TEMP_0006 to MERGE_INSERT_VALUE_COUNT_MISMATCH
  • SPARK-55127 Add avro_funcs group for Avro SQL functions
  • SPARK-54914 [SQL] Fixing DROP operator in pipe syntax to support qualified column names
  • SPARK-55113 EnsureRequirements should copy tags
  • SPARK-55074 Add test for Merge Into ANSI type coercion
  • SPARK-54217 Synchronize PythonRunner's MonitorThread kill decision
  • SPARK-54374 Enlarge the SVG viewBox attribute of SQL plan visualization initialization
  • SPARK-54971 Add WITH SCHEMA EVOLUTION syntax for SQL INSERT
  • SPARK-55065 Avoid making two JDBC API calls
  • SPARK-55033 Fix stringArgs of DSv2 writing commands
  • SPARK-55041 Cleanup some unused private funcion/val from core module
  • SPARK-55338 Centralize Spark Connect request decompression logic in gRPC interceptor
  • SPARK-55825 Rename _LEGACY_ERROR_TEMP_1309 to PARTITION_BY_NOT_ALLOWED_WITH_INSERT_INTO
  • SPARK-55492 Validate that eventTime in withWatermark is top-level column
  • SPARK-55802 Fix integer overflow when computing Arrow batch bytes
  • SPARK-55694 Block constraints in CTAS/RTAS at parser level
  • SPARK-55843 Handle the unit of datetime64 and timedelta64 dtypes
  • SPARK-55824 Rename _LEGACY_ERROR_TEMP_1034 to WINDOW_FUNCTION_NOT_ALLOWED_IN_CLAUSE
  • SPARK-55819 Refactor ExpandExec to be more succinct
  • SPARK-55341 Add storage level flag for cached local relations
  • SPARK-54599 Revert "[SC-219008][PYTHON] Refactor PythonException so it can take errorClass with sqlstate"
  • SPARK-46167 Add axis implementation to DataFrame.rank
  • SPARK-54599 Refactor PythonException so it can take errorClass with sqlstate
  • SPARK-55529 Reapply [ES-1721989][SC-220716][PYTHON] Restore Arrow-level batch merge for non-iterator applyInPandas
  • SPARK-55794 Always alias OuterReferences
  • SPARK-55583 Validate Arrow schema types in Python data source
  • SPARK-37711 Reduce pandas describe job count from O(N) to O(1)
  • SPARK-46168 Add axis argument for idxmax
  • SPARK-46162 Implement nunique with axis=1
  • SPARK-55552 Add VariantType support to ColumnarBatchRow.copy() and MutableColumnarRow
  • SPARK-55647 Fix ConstantPropagation incorrectly replacing attributes with non-binary-stable collations
  • SPARK-55747 Fix NPE when accessing elements from an array that is null
  • SPARK-55757 Improve spark.task.cpus validation
  • SPARK-55699 Inconsistent reading of LowLatencyClock when used together with ManualClock
  • SPARK-55702 Support filter predicate in window aggregate functions
  • SPARK-55510 Fix deleteRange of Rocksdb state store to call changelogWriter
  • SPARK-55739 Optimize OnHeapColumnVector.putIntsLittleEndian/putLongsLittleEndian using Platform.copyMemory on little-endian platforms
  • SPARK-55730 Not make timezone lower case
  • SPARK-55701 ES-1694761[SS] Fix race condition in CompactibleFileStreamLog.allFiles
  • SPARK-55462 Revert "[SC-221123][PYTHON] Support VariantType in convert_numpy"
  • SPARK-55144 Introduce new state format version for performant stream-stream join
  • SPARK-55606 Server-side implementation of GetStatus API
  • SPARK-55462 Support VariantType in convert_numpy
  • SPARK-55600 Fix pandas to arrow loses row count when schema has 0 columns on classic
  • SPARK-55700 Fix handling integer keys on Series with non-integer index
  • SPARK-55349 Consolidate pandas-to-Arrow conversion utilities in serializers
  • SPARK-55681 Fix singleton DataType equality after deserialization (reattempt)
  • SPARK-55681 Revert "[SC-221427][SC-214079][SQL] Fix singleton DataType equality after deserialization"
  • SPARK-55681 Fix singleton DataType equality after deserialization
  • SPARK-55674 Optimize 0-column table conversion in Spark Connect
  • SPARK-55323 Reapply "[SC-218885][PYTHON] Move UDF metadata to EvalConf to simplify worker protocol"
  • SPARK-55322 Reapply [SC-221062][SQL] MaxBy and MinBy Overload with K Elements
  • SPARK-55323 Revert "[SC-218885][PYTHON] Move UDF metadata to EvalConf to simplify worker protocol"
  • SPARK-55615 Move SparkContext import into class branch
  • SPARK-55323 Move UDF metadata to EvalConf to simplify worker protocol
  • SPARK-55648 Handle an unexpected keyword argument error groupby(axis) with pandas 3
  • SPARK-55647 Revert "[SC-221274][SQL] Fix ConstantPropagation incorrectly replacing attributes with non-binary-stable collations"
  • SPARK-55646 Refactored SQLExecution.withThreadLocalCaptured to separate thread-local capture from execution
  • SPARK-54854 Add a UUIDv7 queryId to SQLExecution Events
  • SPARK-55619 Fix custom metrics in case of coalesced partitions
  • SPARK-55647 Fix ConstantPropagation incorrectly replacing attributes with non-binary-stable collations
  • SPARK-55322 Revert "[SC-221062][SQL] MaxBy and MinBy Overload with K Elements"
  • SPARK-54740 Start faulthandler early in daemon mode
  • SPARK-55493 [SS] Do not mkdirs in streaming checkpoint state directory in StateDataSource
  • SPARK-55322 MaxBy and MinBy Overload with K Elements
  • SPARK-55625 Fix StringOps to make str dtype work properly
  • SPARK-55161 Reapply "[SC-218867][PYTHON] Support profilers on python data source"
  • SPARK-55505 Fix NPE on reading EXECUTION_ROOT_ID_KEY in concurrent scenarios
  • SPARK-55111 Recheckin Unfinished repartitioning detection on query restart
  • SPARK-55593 Unify aggregation state for vector_avg/vector_sum
  • SPARK-55500 Fix analyzer cycle between ApplyDefaultCollation, ExtractWindowExpressions and CollationTypeCasts
  • SPARK-55494 Introduce iterator/prefixScan with multi-values in StateStore API
  • SPARK-55561 Add retries for all Kafka admin client methods
  • SPARK-55296 Support CoW mode with pandas 3
  • SPARK-55479 Fix style issues in SparkShreddingUtils
  • SPARK-55372 Fix SHOW CREATE TABLE for tables / views with default collation
  • SPARK-55333 Enable DateType and TimeType in convert_numpy
  • SPARK-55129 Introduce new key encoders for timestamp as a first class (UnsafeRow)
  • SPARK-46163 DataFrame.update parameters filter_func and errors
  • SPARK-55372 Revert "[SC-220571][SQL] Fix SHOW CREATE TABLE for tables / views with default collation"
  • SPARK-55480 Remove all unused noqa for ruff
  • SPARK-55471 Add optimizer support for SequentialStreamingUnion
  • SPARK-55584 Produce better error on scalar subquery to EXEC IMMEDIATE
  • SPARK-55161 Revert "[SC-218867][PYTHON] Support profilers on python data source"
  • SPARK-55506 Pass explicit input schema to to_pandas in CogroupPandasUDFSerializer
  • SPARK-55586 Add jdbc.py Example
  • SPARK-55161 Support profilers on python data source
  • SPARK-55529 Revert "[SC-220716][PYTHON] Restore Arrow-level batch merge for non-iterator applyInPandas"
  • SPARK-55385 Mitigate the recomputation in zipWithIndex
  • SPARK-55529 Restore Arrow-level batch merge for non-iterator applyInPandas
  • SPARK-55389 Consolidate SQL_MAP_ARROW_ITER_UDF wrapper, mapper, and serializer logic
  • SPARK-55406 Reimplement the thread pool for ExecutePlanResponseReattachableIterator
  • SPARK-55372 Fix SHOW CREATE TABLE for tables / views with default collation
  • SPARK-55367 Use venv for run-pip-tests
  • SPARK-55355 Upgrade mypy version to the latest
  • SPARK-55460 Remove E203 from ruff's ignore list
  • SPARK-55541 Support Geometry and Geography in catalyst type converters
  • SPARK-55449 Enable WKB parsing and writing for Geography
  • SPARK-55339 Implement WKT writer support for Geo objects
  • SPARK-54122 Implement TwsTester in Scala
  • SPARK-54805 Implement TwsTester in PySpark
  • SPARK-55256 Reapply "[SC-218596][SQL] Support IGNORE NULLS / RESPECT NULLS for array_agg and collect_list"
  • SPARK-55156 Deal with include_groups for groupby.apply
  • SPARK-55401 Add retry logic and timeout handling to pyspark install download
  • SPARK-55229 Implement DataFrame.zipWithIndex in PySpark
  • SPARK-55462 Support UserDefinedType in convert_numpy
  • SPARK-55483 Fix NPE in PivotFirst when pivot column is a non-atomic type with null values
  • SPARK-55490 Make groupby(as_index=False) include a grouping that is not in the DataFrame with pandas 3
  • SPARK-55473 Replace itertools.tee with chain in applyInPandasWithState
  • SPARK-55404 Always raise KeyboardInterrupt from SIGINT handler
  • SPARK-55407 Replace logger.warn with logger.warning

Databricks ODBC/JDBC driver support

Databricks supports ODBC/JDBC drivers released in the past 2 years. Please download the recently released drivers and upgrade (download ODBC, download JDBC).

System environment

  • Operating System: Ubuntu 24.04.4 LTS
  • Java: Zulu21.48+15-CA
  • Scala: 2.13.16
  • Python: 3.12.3
  • R: 4.5.1
  • Delta Lake: 4.1.0

Installed Python libraries

Library

Version

Library

Version

Library

Version

aiohappyeyeballs

2.4.4

aiohttp

3.11.10

aiosignal

1.2.0

annotated-doc

0.0.4

annotated-types

0.7.0

anyio

4.7.0

argon2-cffi

21.3.0

argon2-cffi-bindings

21.2.0

arro3-core

0.6.5

arrow

1.3.0

asttokens

3.0.0

astunparse

1.6.3

async-lru

2.0.4

attrs

24.3.0

autocommand

2.2.2

azure-common

1.1.28

azure-core

1.37.0

azure-identity

1.20.0

azure-mgmt-core

1.6.0

azure-mgmt-web

8.0.0

azure-storage-blob

12.28.0

azure-storage-file-datalake

12.22.0

babel

2.16.0

backports.tarfile

1.2.0

beautifulsoup4

4.12.3

black

24.10.0

bleach

6.2.0

blinker

1.7.0

boto3

1.40.45

botocore

1.40.45

cachetools

5.5.1

certifi

2025.4.26

cffi

1.17.1

chardet

4.0.0

charset-normalizer

3.3.2

click

8.1.8

cloudpickle

3.0.0

comm

0.2.1

contourpy

1.3.1

cryptography

44.0.1

cycler

0.11.0

Cython

3.1.5

databricks-agents

1.9.1

databricks-sdk

0.67.0

dataclasses-json

0.6.7

dbus-python

1.3.2

debugpy

1.8.11

decorator

5.1.1

defusedxml

0.7.1

deltalake

1.1.4

Deprecated

1.2.18

distlib

0.3.9

docstring-to-markdown

0.11

executing

1.2.0

facets-overview

1.1.1

fastapi

0.128.0

fastjsonschema

2.21.1

filelock

3.17.0

fonttools

4.55.3

fqdn

1.5.1

frozenlist

1.5.0

fsspec

2023.5.0

gitdb

4.0.11

GitPython

3.1.43

google-api-core

2.28.1

google-auth

2.47.0

google-cloud-core

2.5.0

google-cloud-storage

3.7.0

google-crc32c

1.8.0

google-resumable-media

2.8.0

googleapis-common-protos

1.65.0

grpcio

1.67.0

grpcio-status

1.67.0

h11

0.16.0

hf-xet

1.2.0

httpcore

1.0.9

httplib2

0.20.4

httpx

0.28.1

huggingface_hub

1.2.4

idna

3.7

importlib_metadata

8.5.0

inflect

7.3.1

iniconfig

1.1.1

ipyflow-core

0.0.209

ipykernel

6.29.5

ipython

8.30.0

ipython-genutils

0.2.0

ipywidgets

7.8.1

isodate

0.7.2

isoduration

20.11.0

jaraco.collections

5.1.0

jaraco.context

5.3.0

jaraco.functools

4.0.1

jaraco.text

3.12.1

jedi

0.19.2

Jinja2

3.1.6

jiter

0.12.0

jmespath

1.0.1

joblib

1.4.2

json5

0.9.25

jsonpatch

1.33

jsonpointer

3.0.0

jsonschema

4.23.0

jsonschema-specifications

2023.7.1

jupyter-events

0.12.0

jupyter-lsp

2.2.5

jupyter_client

8.6.3

jupyter_core

5.7.2

jupyter_server

2.15.0

jupyter_server_terminals

0.5.3

jupyterlab

4.3.4

jupyterlab_pygments

0.3.0

jupyterlab_server

2.27.3

jupyterlab_widgets

1.1.11

kiwisolver

1.4.8

langchain-core

1.2.6

langchain-openai

1.1.6

langsmith

0.6.1

launchpadlib

1.11.0

lazr.restfulclient

0.14.6

lazr.uri

1.0.6

litellm

1.75.9

markdown-it-py

2.2.0

MarkupSafe

3.0.2

marshmallow

3.26.2

matplotlib

3.10.0

matplotlib-inline

0.1.7

mccabe

0.7.0

mdurl

0.1.0

mistune

3.1.2

mlflow-skinny

3.8.1

mmh3

5.2.0

more-itertools

10.3.0

msal

1.34.0

msal-extensions

1.3.1

multidict

6.1.0

mypy-extensions

1.0.0

nbclient

0.10.2

nbconvert

7.16.6

nbformat

5.10.4

nest-asyncio

1.6.0

nodeenv

1.10.0

notebook

7.3.2

notebook_shim

0.2.4

numpy

2.1.3

oauthlib

3.2.2

openai

2.14.0

opentelemetry-api

1.39.1

opentelemetry-proto

1.39.1

opentelemetry-sdk

1.39.1

opentelemetry-semantic-conventions

0.60b1

orjson

3.11.5

overrides

7.4.0

packaging

24.2

pandas

2.2.3

pandocfilters

1.5.0

parso

0.8.4

pathspec

0.10.3

patsy

1.0.1

pexpect

4.8.0

pillow

11.1.0

pip

25.0.1

platformdirs

4.3.7

plotly

5.24.1

pluggy

1.5.0

prometheus_client

0.21.1

prompt-toolkit

3.0.43

propcache

0.3.1

proto-plus

1.27.0

protobuf

5.29.4

psutil

5.9.0

psycopg2

2.9.11

ptyprocess

0.7.0

pure-eval

0.2.2

pyarrow

21.0.0

pyasn1

0.4.8

pyasn1-modules

0.2.8

pyccolo

0.0.71

pycparser

2.21

pydantic

2.10.6

pydantic_core

2.27.2

pyflakes

3.2.0

Pygments

2.19.1

PyGObject

3.48.2

pyiceberg

0.10.0

PyJWT

2.10.1

pyodbc

5.2.0

pyparsing

3.2.0

pyright

1.1.394

pyroaring

1.0.3

pytest

8.3.5

python-dateutil

2.9.0.post0

python-dotenv

1.2.1

python-json-logger

3.2.1

python-lsp-jsonrpc

1.1.2

python-lsp-server

1.12.2

pytoolconfig

1.2.6

pytz

2024.1

PyYAML

6.0.2

pyzmq

26.2.0

referencing

0.30.2

regex

2024.11.6

requests

2.32.3

requests-toolbelt

1.0.0

rfc3339-validator

0.1.4

rfc3986-validator

0.1.1

rich

13.9.4

rope

1.13.0

rpds-py

0.22.3

rsa

4.9.1

s3transfer

0.14.0

scikit-learn

1.6.1

scipy

1.15.3

seaborn

0.13.2

Send2Trash

1.8.2

setuptools

78.1.1

shellingham

1.5.4

six

1.17.0

smmap

5.0.0

sniffio

1.3.0

sortedcontainers

2.4.0

soupsieve

2.5

sqlparse

0.5.5

ssh-import-id

5.11

stack-data

0.6.3

starlette

0.50.0

strictyaml

1.7.3

tenacity

9.0.0

terminado

0.17.1

threadpoolctl

3.5.0

tiktoken

0.12.0

tinycss2

1.4.0

tokenize_rt

6.1.0

tokenizers

0.22.2

tomli

2.0.1

tornado

6.5.1

tqdm

4.67.1

traitlets

5.14.3

typeguard

4.3.0

typer-slim

0.21.1

types-python-dateutil

2.9.0.20251115

typing-inspect

0.9.0

typing_extensions

4.12.2

tzdata

2024.1

ujson

5.10.0

unattended-upgrades

0.1

uri-template

1.3.0

urllib3

2.3.0

uuid_utils

0.12.0

uvicorn

0.40.0

virtualenv

20.29.3

wadllib

1.3.6

wcwidth

0.2.5

webcolors

25.10.0

webencodings

0.5.1

websocket-client

1.8.0

whatthepatch

1.0.2

wheel

0.45.1

whenever

0.7.3

widgetsnbextension

3.6.6

wrapt

1.17.0

yapf

0.40.2

yarl

1.18.0

zipp

3.21.0

zstandard

0.23.0

Installed R libraries

R libraries are installed from the Posit Package Manager CRAN snapshot on 2025-11-20.

Library

Version

Library

Version

Library

Version

arrow

22.0.0

askpass

1.2.1

assertthat

0.2.1

backports

1.5.0

base

4.5.1

base64enc

0.1-3

bigD

0.3.1

bit

4.6.0

bit64

4.6.0-1

bitops

1.0-9

blob

1.2.4

boot

1.3-30

brew

1.0-10

brio

1.1.5

broom

1.0.10

bslib

0.9.0

cachem

1.1.0

callr

3.7.6

caret

7.0-1

cellranger

1.1.0

chron

2.3-62

class

7.3-22

cli

3.6.5

clipr

0.8.0

clock

0.7.3

cluster

2.1.6

codetools

0.2-20

commonmark

2.0.0

compiler

4.5.1

config

0.3.2

conflicted

1.2.0

cpp11

0.5.2

crayon

1.5.3

credentials

2.0.3

curl

7.0.0

data.table

1.17.8

datasets

4.5.1

DBI

1.2.3

dbplyr

2.5.1

desc

1.4.3

devtools

2.4.6

diagram

1.6.5

diffobj

0.3.6

digest

0.6.39

downlit

0.4.5

dplyr

1.1.4

dtplyr

1.3.2

e1071

1.7-16

ellipsis

0.3.2

evaluate

1.0.5

fansi

1.0.7

farver

2.1.2

fastmap

1.2.0

fontawesome

0.5.3

forcats

1.0.1

foreach

1.5.2

foreign

0.8-86

fs

1.6.6

future

1.68.0

future.apply

1.20.0

gargle

1.6.0

generics

0.1.4

gert

2.2.0

ggplot2

4.0.1

gh

1.5.0

git2r

0.36.2

gitcreds

0.1.2

glmnet

4.1-10

globals

0.18.0

glue

1.8.0

googledrive

2.1.2

googlesheets4

1.1.2

gower

1.0.2

graphics

4.5.1

grDevices

4.5.1

grid

4.5.1

gridExtra

2.3

gsubfn

0.7

gt

1.1.0

gtable

0.3.6

hardhat

1.4.2

haven

2.5.5

highr

0.11

hms

1.1.4

htmltools

0.5.8.1

htmlwidgets

1.6.4

httpuv

1.6.16

httr

1.4.7

httr2

1.2.1

ids

1.0.1

ini

0.3.1

ipred

0.9-15

isoband

0.2.7

iterators

1.0.14

jquerylib

0.1.4

jsonlite

2.0.0

juicyjuice

0.1.0

KernSmooth

2.23-22

knitr

1.50

labeling

0.4.3

later

1.4.4

lattice

0.22-5

lava

1.8.2

lifecycle

1.0.4

listenv

0.10.0

litedown

0.8

lubridate

1.9.4

magrittr

2.0.4

markdown

2.0

MASS

7.3-60.0.1

Matrix

1.6-5

memoise

2.0.1

methods

4.5.1

mgcv

1.9-1

mime

0.13

miniUI

0.1.2

mlflow

3.6.0

ModelMetrics

1.2.2.2

modelr

0.1.11

nlme

3.1-164

nnet

7.3-19

numDeriv

2016.8-1.1

openssl

2.3.4

otel

0.2.0

parallel

4.5.1

parallelly

1.45.1

pillar

1.11.1

pkgbuild

1.4.8

pkgconfig

2.0.3

pkgdown

2.2.0

pkgload

1.4.1

plogr

0.2.0

plyr

1.8.9

praise

1.0.0

prettyunits

1.2.0

pROC

1.19.0.1

processx

3.8.6

prodlim

2025.04.28

profvis

0.4.0

progress

1.2.3

progressr

0.18.0

promises

1.5.0

proto

1.0.0

proxy

0.4-27

ps

1.9.1

purrr

1.2.0

R6

2.6.1

ragg

1.5.0

randomForest

4.7-1.2

rappdirs

0.3.3

rcmdcheck

1.4.0

RColorBrewer

1.1-3

Rcpp

1.1.0

RcppEigen

0.3.4.0.2

reactable

0.4.4

reactR

0.6.1

readr

2.1.6

readxl

1.4.5

recipes

1.3.1

rematch

2.0.0

rematch2

2.1.2

remotes

2.5.0

reprex

2.1.1

reshape2

1.4.5

rlang

1.1.6

rmarkdown

2.30

RODBC

1.3-26

roxygen2

7.3.3

rpart

4.1.23

rprojroot

2.1.1

Rserve

1.8-15

RSQLite

2.4.4

rstudioapi

0.17.1

rversions

3.0.0

rvest

1.0.5

S7

0.2.1

sass

0.4.10

scales

1.4.0

selectr

0.4-2

sessioninfo

1.2.3

shape

1.4.6.1

shiny

1.11.1

sourcetools

0.1.7-1

sparklyr

1.9.3

SparkR

4.1.0

sparsevctrs

0.3.4

spatial

7.3-17

splines

4.5.1

sqldf

0.4-11

SQUAREM

2021.1

stats

4.5.1

stats4

4.5.1

stringi

1.8.7

stringr

1.6.0

survival

3.5-8

swagger

5.17.14.1

sys

3.4.3

systemfonts

1.3.1

tcltk

4.5.1

testthat

3.3.0

textshaping

1.0.4

tibble

3.3.0

tidyr

1.3.1

tidyselect

1.2.1

tidyverse

2.0.0

timechange

0.3.0

timeDate

4051.111

tinytex

0.58

tools

4.5.1

tzdb

0.5.0

urlchecker

1.0.1

usethis

3.2.1

utf8

1.2.6

utils

4.5.1

uuid

1.2-1

V8

8.0.1

vctrs

0.6.5

viridisLite

0.4.2

vroom

1.6.6

waldo

0.6.2

whisker

0.4.1

withr

3.0.2

xfun

0.54

xml2

1.5.0

xopen

1.0.1

xtable

1.8-4

yaml

2.3.10

zeallot

0.2.0

zip

2.3.3

Installed Java and Scala libraries (Scala 2.13 cluster version)

Group ID

Artifact ID

Version

antlr

antlr

2.7.7

com.clearspring.analytics

stream

2.9.8

com.databricks

Rserve

1.8-3

com.databricks

databricks-sdk-java

0.53.0

com.databricks

jets3t

0.7.1-0

com.databricks.scalapb

scalapb-runtime_2.13

0.4.15-11

com.esotericsoftware

kryo-shaded

4.0.3

com.esotericsoftware

minlog

1.3.0

com.fasterxml

classmate

1.5.1

com.fasterxml.jackson.core

jackson-annotations

2.18.2

com.fasterxml.jackson.core

jackson-core

2.18.2

com.fasterxml.jackson.core

jackson-databind

2.18.2

com.fasterxml.jackson.dataformat

jackson-dataformat-yaml

2.15.2

com.fasterxml.jackson.datatype

jackson-datatype-joda

2.18.2

com.fasterxml.jackson.datatype

jackson-datatype-jsr310

2.18.3

com.fasterxml.jackson.module

jackson-module-paranamer

2.18.2

com.fasterxml.jackson.module

jackson-module-scala_2.13

2.18.2

com.github.ben-manes.caffeine

caffeine

2.9.3

com.github.blemale

scaffeine_2.13

4.1.0

com.github.fommil

jniloader

1.1

com.github.fommil.netlib

native_ref-java

1.1

com.github.fommil.netlib

native_ref-java

1.1-natives

com.github.fommil.netlib

native_system-java

1.1

com.github.fommil.netlib

native_system-java

1.1-natives

com.github.fommil.netlib

netlib-native_ref-linux-x86_64

1.1-natives

com.github.fommil.netlib

netlib-native_system-linux-x86_64

1.1-natives

com.github.luben

zstd-jni

1.5.7-6

com.github.virtuald

curvesapi

1.08

com.github.wendykierp

JTransforms

3.1

com.google.api.grpc

proto-google-common-protos

2.5.1

com.google.auth

google-auth-library-credentials

1.20.0

com.google.auth

google-auth-library-oauth2-http

1.20.0

com.google.auto.value

auto-value-annotations

1.10.4

com.google.code.findbugs

jsr305

3.0.0

com.google.code.gson

gson

2.11.0

com.google.crypto.tink

tink

1.16.0

com.google.errorprone

error_prone_annotations

2.36.0

com.google.flatbuffers

flatbuffers-java

25.2.10

com.google.guava

failureaccess

1.0.3

com.google.guava

guava

33.4.8-jre

com.google.http-client

google-http-client

1.43.3

com.google.http-client

google-http-client-gson

1.43.3

com.google.j2objc

j2objc-annotations

3.0.0

com.google.protobuf

protobuf-java

3.25.5

com.google.protobuf

protobuf-java-util

3.25.5

com.helger

profiler

1.1.1

com.ibm.icu

icu4j

75.1

com.jcraft

jsch

0.1.55

com.lihaoyi

sourcecode_2.13

0.1.9

com.microsoft.azure

azure-data-lake-store-sdk

2.3.10

com.microsoft.sqlserver

mssql-jdbc

12.8.0.jre11

com.microsoft.sqlserver

mssql-jdbc

12.8.0.jre8

com.ning

compress-lzf

1.1.2

com.sun.mail

javax.mail

1.5.2

com.sun.xml.bind

jaxb-core

2.2.11

com.sun.xml.bind

jaxb-impl

2.2.11

com.tdunning

json

1.8

com.thoughtworks.paranamer

paranamer

2.8

com.trueaccord.lenses

lenses_2.13

0.4.13

com.twitter

chill-java

0.10.0

com.twitter

chill_2.13

0.10.0

com.twitter

util-app_2.13

19.8.1

com.twitter

util-core_2.13

19.8.1

com.twitter

util-function_2.13

19.8.1

com.twitter

util-jvm_2.13

19.8.1

com.twitter

util-lint_2.13

19.8.1

com.twitter

util-registry_2.13

19.8.1

com.twitter

util-stats_2.13

19.8.1

com.typesafe

config

1.4.3

com.typesafe.scala-logging

scala-logging_2.13

3.9.2

com.uber

h3

3.7.3

com.univocity

univocity-parsers

2.9.1

com.zaxxer

HikariCP

4.0.3

com.zaxxer

SparseBitSet

1.3

commons-cli

commons-cli

1.10.0

commons-codec

commons-codec

1.19.0

commons-collections

commons-collections

3.2.2

commons-dbcp

commons-dbcp

1.4

commons-fileupload

commons-fileupload

1.6.0

commons-httpclient

commons-httpclient

3.1

commons-io

commons-io

2.21.0

commons-lang

commons-lang

2.6

commons-logging

commons-logging

1.1.3

commons-pool

commons-pool

1.5.4

dev.ludovic.netlib

arpack

3.0.4

dev.ludovic.netlib

blas

3.0.4

dev.ludovic.netlib

lapack

3.0.4

info.ganglia.gmetric4j

gmetric4j

1.0.10

io.airlift

aircompressor

2.0.2

io.delta

delta-sharing-client_2.13

1.3.10

io.dropwizard.metrics

metrics-annotation

4.2.37

io.dropwizard.metrics

metrics-core

4.2.37

io.dropwizard.metrics

metrics-graphite

4.2.37

io.dropwizard.metrics

metrics-healthchecks

4.2.37

io.dropwizard.metrics

metrics-jetty10

4.2.37

io.dropwizard.metrics

metrics-jmx

4.2.37

io.dropwizard.metrics

metrics-json

4.2.37

io.dropwizard.metrics

metrics-jvm

4.2.37

io.dropwizard.metrics

metrics-servlets

4.2.37

io.github.java-diff-utils

java-diff-utils

4.15

io.netty

netty-all

4.2.7.Final

io.netty

netty-buffer

4.2.7.Final

io.netty

netty-codec

4.2.7.Final

io.netty

netty-codec-base

4.2.7.Final

io.netty

netty-codec-classes-quic

4.2.7.Final

io.netty

netty-codec-compression

4.2.7.Final

io.netty

netty-codec-http

4.2.7.Final

io.netty

netty-codec-http2

4.2.7.Final

io.netty

netty-codec-http3

4.2.7.Final

io.netty

netty-codec-marshalling

4.2.7.Final

io.netty

netty-codec-native-quic

4.2.7.Final-linux-aarch_64

io.netty

netty-codec-native-quic

4.2.7.Final-linux-x86_64

io.netty

netty-codec-native-quic

4.2.7.Final-osx-aarch_64

io.netty

netty-codec-native-quic

4.2.7.Final-osx-x86_64

io.netty

netty-codec-native-quic

4.2.7.Final-windows-x86_64

io.netty

netty-codec-protobuf

4.2.7.Final

io.netty

netty-codec-socks

4.2.7.Final

io.netty

netty-common

4.2.7.Final

io.netty

netty-handler

4.2.7.Final

io.netty

netty-handler-proxy

4.2.7.Final

io.netty

netty-resolver

4.2.7.Final

io.netty

netty-tcnative-boringssl-static

2.0.74.Final-db-r0-linux-aarch_64

io.netty

netty-tcnative-boringssl-static

2.0.74.Final-db-r0-linux-x86_64

io.netty

netty-tcnative-boringssl-static

2.0.74.Final-db-r0-osx-aarch_64

io.netty

netty-tcnative-boringssl-static

2.0.74.Final-db-r0-osx-x86_64

io.netty

netty-tcnative-boringssl-static

2.0.74.Final-db-r0-windows-x86_64

io.netty

netty-tcnative-classes

2.0.74.Final

io.netty

netty-transport

4.2.7.Final

io.netty

netty-transport-classes-epoll

4.2.7.Final

io.netty

netty-transport-classes-io_uring

4.2.7.Final

io.netty

netty-transport-classes-kqueue

4.2.7.Final

io.netty

netty-transport-native-epoll

4.2.7.Final

io.netty

netty-transport-native-epoll

4.2.7.Final-linux-aarch_64

io.netty

netty-transport-native-epoll

4.2.7.Final-linux-riscv64

io.netty

netty-transport-native-epoll

4.2.7.Final-linux-x86_64

io.netty

netty-transport-native-io_uring

4.2.7.Final-linux-aarch_64

io.netty

netty-transport-native-io_uring

4.2.7.Final-linux-riscv64

io.netty

netty-transport-native-io_uring

4.2.7.Final-linux-x86_64

io.netty

netty-transport-native-kqueue

4.2.7.Final-osx-aarch_64

io.netty

netty-transport-native-kqueue

4.2.7.Final-osx-x86_64

io.netty

netty-transport-native-unix-common

4.2.7.Final

io.opencensus

opencensus-api

0.31.1

io.opencensus

opencensus-contrib-http-util

0.31.1

io.prometheus

simpleclient

0.16.1-databricks

io.prometheus

simpleclient_common

0.16.1-databricks

io.prometheus

simpleclient_dropwizard

0.16.1-databricks

io.prometheus

simpleclient_pushgateway

0.16.1-databricks

io.prometheus

simpleclient_servlet

0.16.1-databricks

io.prometheus

simpleclient_servlet_common

0.16.1-databricks

io.prometheus

simpleclient_tracer_common

0.16.1-databricks

io.prometheus

simpleclient_tracer_otel

0.16.1-databricks

io.prometheus

simpleclient_tracer_otel_agent

0.16.1-databricks

io.prometheus.jmx

collector

0.18.0

jakarta.annotation

jakarta.annotation-api

1.3.5

jakarta.servlet

jakarta.servlet-api

4.0.3

jakarta.validation

jakarta.validation-api

2.0.2

jakarta.ws.rs

jakarta.ws.rs-api

2.1.6

javax.activation

activation

1.1.1

javax.annotation

javax.annotation-api

1.3.2

javax.el

javax.el-api

2.2.4

javax.jdo

jdo-api

3.0.1

javax.media

jai_core

jai_core_dummy

javax.transaction

jta

1.1

javax.transaction

transaction-api

1.1

javax.xml.bind

jaxb-api

2.2.11

javolution

javolution

5.5.1

jline

jline

2.14.6

joda-time

joda-time

2.14.0

net.java.dev.jna

jna

5.8.0

net.razorvine

pickle

1.5

net.sf.jpam

jpam

1.1

net.sf.opencsv

opencsv

2.3

net.sf.supercsv

super-csv

2.2.0

net.snowflake

snowflake-ingest-sdk

0.9.6

net.sourceforge.f2j

arpack_combined_all

0.1

org.acplt.remotetea

remotetea-oncrpc

1.1.2

org.antlr

ST4

4.0.4

org.antlr

antlr-runtime

3.5.2

org.antlr

antlr4-runtime

4.13.1

org.antlr

stringtemplate

3.2.1

org.apache.ant

ant

1.10.11

org.apache.ant

ant-jsch

1.10.11

org.apache.ant

ant-launcher

1.10.11

org.apache.arrow

arrow-compression

18.3.0

org.apache.arrow

arrow-format

18.3.0

org.apache.arrow

arrow-memory-core

18.3.0

org.apache.arrow

arrow-memory-netty

18.3.0

org.apache.arrow

arrow-memory-netty-buffer-patch

18.3.0

org.apache.arrow

arrow-vector

18.3.0

org.apache.avro

avro

1.12.1

org.apache.avro

avro-ipc

1.12.1

org.apache.avro

avro-mapred

1.12.1

org.apache.commons

commons-collections4

4.5.0

org.apache.commons

commons-compress

1.28.0

org.apache.commons

commons-configuration2

2.11.0

org.apache.commons

commons-crypto

1.1.0

org.apache.commons

commons-lang3

3.19.0

org.apache.commons

commons-math3

3.6.1

org.apache.commons

commons-text

1.14.0

org.apache.curator

curator-client

5.9.0

org.apache.curator

curator-framework

5.9.0

org.apache.curator

curator-recipes

5.9.0

org.apache.datasketches

datasketches-java

6.2.0

org.apache.datasketches

datasketches-memory

3.0.2

org.apache.derby

derby

10.14.2.0

org.apache.hadoop

hadoop-client-runtime

3.4.2

org.apache.hive

hive-beeline

2.3.10

org.apache.hive

hive-cli

2.3.10

org.apache.hive

hive-jdbc

2.3.10

org.apache.hive

hive-llap-client

2.3.10

org.apache.hive

hive-llap-common

2.3.10

org.apache.hive

hive-serde

2.3.10

org.apache.hive

hive-shims

2.3.10

org.apache.hive

hive-storage-api

2.8.1

org.apache.hive.shims

hive-shims-0.23

2.3.10

org.apache.hive.shims

hive-shims-common

2.3.10

org.apache.hive.shims

hive-shims-scheduler

2.3.10

org.apache.httpcomponents

httpclient

4.5.14

org.apache.httpcomponents

httpcore

4.4.16

org.apache.ivy

ivy

2.5.3

org.apache.logging.log4j

log4j-1.2-api

2.24.3

org.apache.logging.log4j

log4j-api

2.24.3

org.apache.logging.log4j

log4j-core

2.24.3

org.apache.logging.log4j

log4j-layout-template-json

2.24.3

org.apache.logging.log4j

log4j-slf4j2-impl

2.24.3

org.apache.orc

orc-core

2.2.0-shaded-protobuf

org.apache.orc

orc-format

1.1.1-shaded-protobuf

org.apache.orc

orc-mapreduce

2.2.0-shaded-protobuf

org.apache.orc

orc-shims

2.2.0

org.apache.poi

poi

5.4.1

org.apache.poi

poi-ooxml

5.4.1

org.apache.poi

poi-ooxml-full

5.4.1

org.apache.poi

poi-ooxml-lite

5.4.1

org.apache.thrift

libfb303

0.9.3

org.apache.thrift

libthrift

0.16.0

org.apache.ws.xmlschema

xmlschema-core

2.3.1

org.apache.xbean

xbean-asm9-shaded

4.28

org.apache.xmlbeans

xmlbeans

5.3.0

org.apache.yetus

audience-annotations

0.13.0

org.apache.zookeeper

zookeeper

3.9.4

org.apache.zookeeper

zookeeper-jute

3.9.4

org.checkerframework

checker-qual

3.43.0

org.codehaus.janino

commons-compiler

3.0.16

org.codehaus.janino

janino

3.0.16

org.datanucleus

datanucleus-api-jdo

4.2.4

org.datanucleus

datanucleus-core

4.1.17

org.datanucleus

datanucleus-rdbms

4.1.19

org.datanucleus

javax.jdo

3.2.0-m3

org.eclipse.jetty

jetty-alpn-client

10.0.26

org.eclipse.jetty

jetty-client

10.0.26

org.eclipse.jetty

jetty-http

10.0.26

org.eclipse.jetty

jetty-io

10.0.26

org.eclipse.jetty

jetty-jndi

10.0.26

org.eclipse.jetty

jetty-plus

10.0.26

org.eclipse.jetty

jetty-proxy

10.0.26

org.eclipse.jetty

jetty-security

10.0.26

org.eclipse.jetty

jetty-server

10.0.26

org.eclipse.jetty

jetty-servlet

10.0.26

org.eclipse.jetty

jetty-servlets

10.0.26

org.eclipse.jetty

jetty-util

10.0.26

org.eclipse.jetty

jetty-webapp

10.0.26

org.eclipse.jetty

jetty-xml

10.0.26

org.fusesource.leveldbjni

leveldbjni-all

1.8

org.glassfish.hk2

hk2-api

2.6.1

org.glassfish.hk2

hk2-locator

2.6.1

org.glassfish.hk2

hk2-utils

2.6.1

org.glassfish.hk2

osgi-resource-locator

1.0.3

org.glassfish.hk2.external

aopalliance-repackaged

2.6.1

org.glassfish.hk2.external

jakarta.inject

2.6.1

org.glassfish.jersey.containers

jersey-container-servlet

2.41

org.glassfish.jersey.containers

jersey-container-servlet-core

2.41

org.glassfish.jersey.core

jersey-client

2.41

org.glassfish.jersey.core

jersey-common

2.41

org.glassfish.jersey.core

jersey-server

2.41

org.glassfish.jersey.inject

jersey-hk2

2.41

org.hibernate.validator

hibernate-validator

6.2.5.Final

org.ini4j

ini4j

0.5.4

org.javassist

javassist

3.29.2-GA

org.jboss.logging

jboss-logging

3.4.1.Final

org.jdbi

jdbi

2.63.1

org.jline

jline

3.27.1-jdk8

org.joda

joda-convert

1.7

org.json4s

json4s-ast_2.13

4.0.7

org.json4s

json4s-core_2.13

4.0.7

org.json4s

json4s-jackson-core_2.13

4.0.7

org.json4s

json4s-jackson_2.13

4.0.7

org.json4s

json4s-scalap_2.13

4.0.7

org.locationtech.jts

jts-core

1.20.0

org.lz4

lz4-java

1.8.0-databricks-1

org.mlflow

mlflow-spark_2.13

2.22.1

org.objenesis

objenesis

3.4

org.postgresql

postgresql

42.6.1

org.roaringbitmap

RoaringBitmap

1.2.1

org.rosuda.REngine

REngine

2.1.0

org.scala-lang

scala-compiler_2.13

2.13.16

org.scala-lang

scala-library_2.13

2.13.16

org.scala-lang

scala-reflect_2.13

2.13.16

org.scala-lang.modules

scala-collection-compat_2.13

2.11.0

org.scala-lang.modules

scala-java8-compat_2.13

0.9.1

org.scala-lang.modules

scala-parallel-collections_2.13

1.2.0

org.scala-lang.modules

scala-parser-combinators_2.13

2.4.0

org.scala-lang.modules

scala-xml_2.13

2.4.0

org.scala-sbt

test-interface

1.0

org.scalacheck

scalacheck_2.13

1.18.0

org.scalactic

scalactic_2.13

3.2.19

org.scalanlp

breeze-macros_2.13

2.1.0

org.scalanlp

breeze_2.13

2.1.0

org.scalatest

scalatest-compatible

3.2.19

org.scalatest

scalatest-core_2.13

3.2.19

org.scalatest

scalatest-diagrams_2.13

3.2.19

org.scalatest

scalatest-featurespec_2.13

3.2.19

org.scalatest

scalatest-flatspec_2.13

3.2.19

org.scalatest

scalatest-freespec_2.13

3.2.19

org.scalatest

scalatest-funspec_2.13

3.2.19

org.scalatest

scalatest-funsuite_2.13

3.2.19

org.scalatest

scalatest-matchers-core_2.13

3.2.19

org.scalatest

scalatest-mustmatchers_2.13

3.2.19

org.scalatest

scalatest-propspec_2.13

3.2.19

org.scalatest

scalatest-refspec_2.13

3.2.19

org.scalatest

scalatest-shouldmatchers_2.13

3.2.19

org.scalatest

scalatest-wordspec_2.13

3.2.19

org.scalatest

scalatest_2.13

3.2.19

org.slf4j

jcl-over-slf4j

2.0.16

org.slf4j

jul-to-slf4j

2.0.16

org.slf4j

slf4j-api

2.0.16

org.slf4j

slf4j-simple

1.7.25

org.threeten

threeten-extra

1.8.0

org.tukaani

xz

1.10

org.typelevel

algebra_2.13

2.8.0

org.typelevel

cats-kernel_2.13

2.8.0

org.typelevel

spire-macros_2.13

0.18.0

org.typelevel

spire-platform_2.13

0.18.0

org.typelevel

spire-util_2.13

0.18.0

org.typelevel

spire_2.13

0.18.0

org.wildfly.openssl

wildfly-openssl

1.1.3.Final

org.xerial

sqlite-jdbc

3.42.0.0

org.xerial.snappy

snappy-java

1.1.10.3

org.yaml

snakeyaml

2.0

oro

oro

2.0.8

pl.edu.icm

JLargeArrays

1.5

software.amazon.cryptools

AmazonCorrettoCryptoProvider

2.5.0-linux-x86_64

stax

stax-api

1.0.1