Databricks Runtime 12.2 LTS

The following release notes provide information about Databricks Runtime 12.2 LTS, powered by Apache Spark 3.3.2.

Databricks released these images in March, 2023.

New features and improvements

Delta Lake schema evolution supports specifying source columns in merge statements

You can now specify columns present only in the source table in insert or update actions for merge statements when schema evolution is enabled. In Databricks Runtime 12.1 and below, only INSERT * or UPDATE SET * actions can be used for schema evolution with merge. See Automatic schema evolution for Delta Lake merge.

Structured Streaming workloads are supported on clusters with shared access mode

You can now use Structured Streaming to interact with Unity Catalog on shared clusters. Some limitations apply. See What Structured Streaming functionality does Unity Catalog support?.

New features for Predictive I/O

Photon support for Foreachbatch sink is now available. Workloads that stream from a source and merge into Delta Tables or write to multiple sinks can now benefit from Photonized Foreachbatch sink.

Implicit lateral column aliasing support

Databricks now supports implicit lateral column aliasing by default. You can now reuse an expression specified earlier in the same SELECT list. For example, given SELECT 1 AS a, a + 1 AS b, the a in a + 1 can be resolved as the previously defined 1 AS a. Check Name resolution for more details in resolution order. To turn off this feature, you can set spark.sql.lateralColumnAlias.enableImplicitResolution to false.

New forEachBatch feature

Photon is now supported when using foreachBatch to write to a data sink.

Standardized Connection Options for Query Federation

You can now use a unified set of options (host, port, database, user, password)to connect to data sources supported in Query Federation. Port is optional and uses the default port number for each data source, unless specified.

Extended SQL function library for array management

You can now remove all NULL elements from an array using array_compact. To append elements to an array, use array_append.

New mask function to anonymize strings

Invoke the mask function to anonymize sensitive string values.

Common error conditions now return SQLSTATEs

Most error conditions that Databricks Runtime now include documented SQLSTATE values which can be used to test for errors in a SQL standard compliant way.

Invoke generator functions in the FROM clause

You can now invoke table-valued generator functions such as explode in the regular FROM clause of a query. This aligns generator function invocation with other built-in and user defined table functions.

Support for protocol buffers is generally available

You can use the from_protobuf and to_protobuf functions to exchange data between binary and struct types. See Read and write protocol buffers.

Go-to-definition for notebook variables and functions

In notebooks, you can quickly go to the definition of a variable, function, or the code behind a %run statement by right-clicking the variable or function name.

Bug fixes

  • Improved consistency for Delta commit behavior for empty transactions relating to update, delete, and merge commands. In WriteSerializable isolation level, commands that result in no changes now create an empty commit. In a Serializable isolation level, such empty transactions now do not create a commit.

Behavior changes

Behavior changes with the new lateral column alias feature

The new lateral column alias feature introduces behavior changes for the following cases during name resolution:

  • Lateral column alias now takes precedence over correlated references with the same name. For example, for this query SELECT (SELECT c2 FROM (SELECT 1 AS c1, c1 AS c2) WHERE c2 > 5) FROM VALUES(6) AS t(c1), the c1 in the inner c1 AS c2 was resolved to the correlated reference t.c1, but now changes to lateral column alias 1 AS c1. The query now returns NULL.

  • Lateral column alias now takes precedence over function parameters with the same name. For example, for the function, CREATE OR REPLACE TEMPORARY FUNCTION func(x INT) RETURNS TABLE (a INT, b INT, c DOUBLE) RETURN SELECT x + 1 AS x, x, the x in the function body was resolved to the function parameter x, but changes to lateral column alias x + 1 in the function body. The query, SELECT * FROM func(1) now returns 2, 2.

  • To turn off the lateral column alias feature, set spark.sql.lateralColumnAlias.enableImplicitResolution to false. For more information, see Name resolution.

Library upgrades

  • Upgraded Python libraries:

    • filelock from 3.8.2 to 3.9.0

    • joblib from 1.1.0 to 1.1.1

    • platformdirs from 2.6.0 to 2.6.2

    • whatthepatch from 1.0.3 to 1.0.4

  • Upgraded R libraries:

    • class from 7.3-20 to 7.3-21

    • codetools from 0.2-18 to 0.2-19

    • MASS from 7.3-58 to 7.3-58.2

    • nlme from 3.1-160 to 3.1-162

    • Rserve from 1.8-11 to 1.8-12

    • SparkR from 3.3.1 to 3.3.2

Behavior changes

  • Users are now required to have SELECT and MODIFY privileges on any file when creating a schema with a defined location.

Apache Spark

Databricks Runtime 12.2 includes Apache Spark 3.3.2. This release includes all Spark fixes and improvements included in Databricks Runtime 12.1 (unsupported), as well as the following additional bug fixes and improvements made to Spark:

  • [SPARK-42416] [SC-123205][SC-122851][SQL] Dateset operations should not resolve the analyzed logical plan again

  • [SPARK-41848] Revert “[CHERRY-PICK][12.x][12.1][12.0][SC-120037][CORE] Fixing task over-scheduled with TaskResourceProfile”

  • [SPARK-42162] [SC-122711][ES-556261] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions

  • [SPARK-42406] [SC-122998][PROTOBUF][Cherry-pick] Fix recursive depth setting for Protobuf functions

  • [SPARK-42002] [SC-122476][CONNECT][PYTHON] Implement DataFrameWriterV2

  • [SPARK-41716] [SC-122545][CONNECT] Rename catalogto_pandas to executeand_fetch in Catalog

  • [SPARK-41490] [SC-121774][SQL] Assign name to LEGACYERROR_TEMP_2441

  • [SPARK-41600] [SC-122538][SPARK-41623][SPARK-41612][CONNECT] Implement Catalog.cacheTable, isCached and uncache

  • [SPARK-42191] [SC-121990][SQL] Support udf ‘luhn_check’

  • [SPARK-42253] [SC-121976][PYTHON] Add test for detecting duplicated error class

  • [SPARK-42268] [SC-122251][CONNECT][PYTHON] Add UserDefinedType in protos

  • [SPARK-42231] [SC-121841][SQL] Turn MISSING_STATIC_PARTITION_COLUMN into internalError

  • [SPARK-42136] [SC-122554] Refactor BroadcastHashJoinExec output partitioning calculation

  • [SPARK-42158] [SC-121610][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND

  • [SPARK-42192] [12.x][SC-121820][PYTHON] Migrate the TypeError from pyspark/sql/dataframe.py into PySparkTypeError

  • [SPARK-35240] Revert “[SC-118242][SS] Use CheckpointFileManager …

  • [SPARK-41488] [SC-121858][SQL] Assign name to LEGACYERROR_TEMP_1176 (and 1177)

  • [SPARK-42232] [SC-122267][SQL] Rename error class: UNSUPPORTED_FEATURE.JDBC_TRANSACTION

  • [SPARK-42346] [SC-122480][SQL] Rewrite distinct aggregates after subquery merge

  • [SPARK-42306] [SC-122539][SQL] Integrate _LEGACY_ERROR_TEMP_1317 into UNRESOLVED_COLUMN.WITH_SUGGESTION

  • [SPARK-42234] [SC-122354][SQL] Rename error class: UNSUPPORTED_FEATURE.REPEATED_PIVOT

  • [SPARK-42343] [SC-122437][CORE] Ignore IOException in handleBlockRemovalFailure if SparkContext is stopped

  • [SPARK-41295] [SC-122442][SPARK-41296][SQL] Rename the error classes

  • [SPARK-42320] [SC-122478][SQL] Assign name to LEGACYERROR_TEMP_2188

  • [SPARK-42255] [SC-122483][SQL] Assign name to LEGACYERROR_TEMP_2430

  • [SPARK-42156] [SC-121851][CONNECT] SparkConnectClient supports RetryPolicies now

  • [SPARK-38728] [SC-116723][SQL] Test the error class: FAILED_RENAME_PATH

  • [SPARK-40005] [12.X] Self contained examples in PySpark

  • [SPARK-39347] [SC-122457][SS] Bug fix for time window calculation when event time < 0

  • [SPARK-42336] [SC-122458][CORE] Use getOrElse() instead of contains() in ResourceAllocator

  • [SPARK-42125] [SC-121827][CONNECT][PYTHON] Pandas UDF in Spark Connect

  • [SPARK-42217] [SC-122263][SQL] Support implicit lateral column alias in queries with Window

  • [SPARK-35240] [SC-118242][SS] Use CheckpointFileManager for checkpoint file manipulation

  • [SPARK-42294] [SC-122337][SQL] Include column default values in DESCRIBE output for V2 tables

  • [SPARK-41979] Revert “Revert “[12.x][SC-121190][SQL] Add missing dots for error messages in error classes.””

  • [SPARK-42286] [SC-122336][SQL] Fallback to previous codegen code path for complex expr with CAST

  • [SPARK-42275] [SC-122249][CONNECT][PYTHON] Avoid using built-in list, dict in static typing

  • [SPARK-41985] [SC-122172][SQL] Centralize more column resolution rules

  • [SPARK-42126] [SC-122330][PYTHON][CONNECT] Accept return type in DDL strings for Python Scalar UDFs in Spark Connect

  • [SPARK-42197] [SC-122328][SC-121514][CONNECT] Reuses JVM initialization, and separate configuration groups to set in remote local mode

  • [SPARK-41575] [SC-120118][SQL] Assign name to LEGACYERROR_TEMP_2054

  • [SPARK-41985] Revert “[SC-122172][SQL] Centralize more column resolution rules”

  • [SPARK-42123] [SC-122234][SC-121453][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output

  • [SPARK-41985] [SC-122172][SQL] Centralize more column resolution rules

  • [SPARK-42284] [SC-122233][CONNECT] Make sure connect server assembly is built before running client tests - SBT

  • [SPARK-42239] [SC-121790][SQL] Integrate MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY

  • [SPARK-42278] [SC-122170][SQL] DS V2 pushdown supports supports JDBC dialects compile SortOrder by themselves

  • [SPARK-42259] [SC-122168][SQL] ResolveGroupingAnalytics should take care of Python UDAF

  • [SPARK-41979] Revert “[12.x][SC-121190][SQL] Add missing dots for error messages in error classes.”

  • [SPARK-42224] [12.x][SC-121708][CONNECT] Migrate TypeError into error framework for Spark Connect functions

  • [SPARK-41712] [12.x][SC-121189][PYTHON][CONNECT] Migrate the Spark Connect errors into PySpark error framework.

  • [SPARK-42119] [SC-121913][SC-121342][SQL] Add built-in table-valued functions inline and inline_outer

  • [SPARK-41489] [SC-121713][SQL] Assign name to LEGACYERROR_TEMP_2415

  • [SPARK-42082] [12.x][SC-121163][SPARK-41598][PYTHON][CONNECT] Introduce PySparkValueError and PySparkTypeError

  • [SPARK-42081] [SC-121723][SQL] Improve the plan change validation

  • [SPARK-42225] [12.x][SC-121714][CONNECT] Add SparkConnectIllegalArgumentException to handle Spark Connect error precisely.

  • [SPARK-42044] [12.x][SC-121280][SQL] Fix incorrect error message for MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY

  • [SPARK-42194] [12.x][SC-121712][PS] Allow columns parameter when creating DataFrame with Series.

  • [SPARK-42078] [12.x][SC-120761][PYTHON] Migrate errors thrown by JVM into PySparkException.

  • [SPARK-42133] [12.x][SC-121250] Add basic Dataset API methods to SparkConnect Scala Client

  • [SPARK-41979] [12.x][SC-121190][SQL] Add missing dots for error messages in error classes.

  • [SPARK-42124] [12.x][SC-121420][PYTHON][CONNECT] Scalar Inline Python UDF in Spark Connect

  • [SPARK-42051] [SC-121994][SQL] Codegen Support for HiveGenericUDF

  • [SPARK-42257] [SC-121948][CORE] Remove unused variable external sorter

  • [SPARK-41735] [SC-121771][SQL] Use MINIMAL instead of STANDARD for SparkListenerSQLExecutionEnd

  • [SPARK-42236] [SC-121882][SQL] Refine NULLABLE_ARRAY_OR_MAP_ELEMENT

  • [SPARK-42233] [SC-121775][SQL] Improve error message for PIVOT_AFTER_GROUP_BY

  • [SPARK-42229] [SC-121856][CORE] Migrate SparkCoreErrors into error classes

  • [SPARK-42163] [SC-121839][SQL] Fix schema pruning for non-foldable array index or map key

  • [SPARK-40711] [SC-119990][SQL] Add spill size metrics for window

  • [SPARK-42023] [SC-121847][SPARK-42024][CONNECT][PYTHON] Make createDataFrame support AtomicType -> StringType coercion

  • [SPARK-42202] [SC-121837][Connect][Test] Improve the E2E test server stop logic

  • [SPARK-41167] [SC-117425][SQL] Improve multi like performance by creating a balanced expression tree predicate

  • [SPARK-41931] [SC-121618][SQL] Better error message for incomplete complex type definition

  • [SPARK-36124] [SC-121339][SC-110446][SQL] Support subqueries with correlation through UNION

  • [SPARK-42090] [SC-121290][3.3] Introduce sasl retry count in RetryingBlockTransferor

  • [SPARK-42157] [SC-121264][CORE] spark.scheduler.mode=FAIR should provide FAIR scheduler

  • [SPARK-41572] [SC-120772][SQL] Assign name to LEGACYERROR_TEMP_2149

  • [SPARK-41983] [SC-121224][SQL] Rename & improve error message for NULL_COMPARISON_RESULT

  • [SPARK-41976] [SC-121024][SQL] Improve error message for INDEX_NOT_FOUND

  • [SPARK-41994] [SC-121210][SC-120573] Assign SQLSTATE’s (1/2)

  • [SPARK-41415] [SC-121117][3.3] SASL Request Retries

  • [SPARK-38591] [SC-121018][SQL] Add flatMapSortedGroups and cogroupSorted

  • [SPARK-41975] [SC-120767][SQL] Improve error message for INDEX_ALREADY_EXISTS

  • [SPARK-42056] [SC-121158][SQL][PROTOBUF] Add missing options for Protobuf functions

  • [SPARK-41984] [SC-120769][SQL] Rename & improve error message for RESET_PERMISSION_TO_ORIGINAL

  • [SPARK-41948] [SC-121196][SQL] Fix NPE for error classes: CANNOT_PARSE_JSON_FIELD

  • [SPARK-41772] [SC-121176][CONNECT][PYTHON] Fix incorrect column name in withField’s doctest

  • [SPARK-41283] [SC-121175][CONNECT][PYTHON] Add array_append to Connect

  • [SPARK-41960] [SC-120773][SQL] Assign name to LEGACYERROR_TEMP_1056

  • [SPARK-42134] [SC-121116][SQL] Fix getPartitionFiltersAndDataFilters() to handle filters without referenced attributes

  • [SPARK-42096] [SC-121012][CONNECT] Some code cleanup for connect module

  • [SPARK-42099] [SC-121114][SPARK-41845][CONNECT][PYTHON] Fix count(*) and count(col(*))

  • [SPARK-42045] [SC-120958][SC-120450][SQL] ANSI SQL mode: Round/Bround should return an error on integer overflow

  • [SPARK-42043] [SC-120968][CONNECT] Scala Client Result with E2E Tests

  • [SPARK-41884] [SC-121022][CONNECT] Support naive tuple as a nested row

  • [SPARK-42112] [SC-121011][SQL][SS] Add null check before ContinuousWriteRDD#compute function close dataWriter

  • [SPARK-42077] [SC-120553][CONNECT][PYTHON] Literal should throw TypeError for unsupported DataType

  • [SPARK-42108] [SC-120898][SQL] Make Analyzer transform Count(*) into Count(1)

  • [SPARK-41666] [SC-120928][SC-119009][PYTHON] Support parameterized SQL by sql()

  • [SPARK-40599] [SC-120930][SQL] Relax multiTransform rule type to allow alternatives to be any kinds of Seq

  • [SPARK-41574] [SC-120771][SQL] Update _LEGACY_ERROR_TEMP_2009 as INTERNAL_ERROR.

  • [SPARK-41579] [SC-120770][SQL] Assign name to LEGACYERROR_TEMP_1249

  • [SPARK-41974] [SC-120766][SQL] Turn INCORRECT_END_OFFSET into INTERNAL_ERROR

  • [SPARK-41530] [SC-120916][SC-118513][CORE] Rename MedianHeap to PercentileMap and support percentile

  • [SPARK-41757] [SC-120608][SPARK-41901][CONNECT] Fix string representation for Column class

  • [SPARK-42084] [SC-120775][SQL] Avoid leaking the qualified-access-only restriction

  • [SPARK-41973] [SC-120765][SQL] Assign name to LEGACYERROR_TEMP_1311

  • [SPARK-42039] [SC-120655][SQL] SPJ: Remove Option in KeyGroupedPartitioning#partitionValuesOpt

  • [SPARK-42079] [SC-120712][CONNECT][PYTHON] Rename proto messages for toDF and withColumnsRenamed

  • [SPARK-42089] [SC-120605][CONNECT][PYTHON] Fix variable name issues in nested lambda functions

  • [SPARK-41982] [SC-120604][SQL] Partitions of type string should not be treated as numeric types

  • [SPARK-40599] [SC-120620][SQL] Add multiTransform methods to TreeNode to generate alternatives

  • [SPARK-42085] [SC-120556][CONNECT][PYTHON] Make from_arrow_schema support nested types

  • [SPARK-42057] [SC-120507][SQL][PROTOBUF] Fix how exception is handled in error reporting.

  • [SPARK-41586] [12.x][ALL TESTS][SC-120544][PYTHON] Introduce pyspark.errors and error classes for PySpark.

  • [SPARK-41903] [SC-120543][CONNECT][PYTHON] Literal should support 1-dim ndarray

  • [SPARK-42021] [SC-120584][CONNECT][PYTHON] Make createDataFrame support array.array

  • [SPARK-41896] [SC-120506][SQL] Filtering by row index returns empty results

  • [SPARK-41162] [SC-119742][SQL] Fix anti- and semi-join for self-join with aggregations

  • [SPARK-41961] [SC-120501][SQL] Support table-valued functions with LATERAL

  • [SPARK-41752] [SC-120550][SQL][UI] Group nested executions under the root execution

  • [SPARK-42047] [SC-120586][SPARK-41900][CONNECT][PYTHON][12.X] Literal should support Numpy datatypes

  • [SPARK-42028] [SC-120344][CONNECT][PYTHON] Truncating nanoseconds timestampsl

  • [SPARK-42011] [SC-120534][CONNECT][PYTHON] Implement DataFrameReader.csv

  • [SPARK-41990] [SC-120532][SQL] Use FieldReference.column instead of apply in V1 to V2 filter conversion

  • [SPARK-39217] [SC-120446][SQL] Makes DPP support the pruning side has Union

  • [SPARK-42076] [SC-120551][CONNECT][PYTHON] Factor data conversion arrow -> rows out to conversion.py

  • [SPARK-42074] [SC-120540][SQL] Enable KryoSerializer in TPCDSQueryBenchmark to enforce SQL class registration

  • [SPARK-42012] [SC-120517][CONNECT][PYTHON] Implement DataFrameReader.orc

  • [SPARK-41832] [SC-120513][CONNECT][PYTHON] Fix DataFrame.unionByName, add allow_missing_columns

  • [SPARK-38651] [SC-120514] [SQL] Add spark.sql.legacy.allowEmptySchemaWrite

  • [SPARK-41991] [SC-120406][SQL] CheckOverflowInTableInsert should accept ExpressionProxy as child

  • [SPARK-41232] [SC-120073][SQL][PYTHON] Adding array_append function

  • [SPARK-42041] [SC-120512][SPARK-42013][CONNECT][PYTHON] DataFrameReader should support list of paths

  • [SPARK-42071] [SC-120533][CORE] Register scala.math.Ordering$Reverse to KyroSerializer

  • [SPARK-41986] [SC-120429][SQL] Introduce shuffle on SinglePartition

  • [SPARK-42016] [SC-120428][CONNECT][PYTHON] Enable tests related to the nested column

  • [SPARK-42042] [SC-120427][CONNECT][PYTHON] DataFrameReader should support StructType schema

  • [SPARK-42031] [SC-120389][CORE][SQL] Clean up remove methods that do not need override

  • [SPARK-41746] [SC-120463][SPARK-41838][SPARK-41837][SPARK-41835][SPARK-41836][SPARK-41847][CONNECT][PYTHON] Make createDataFrame(rows/lists/tuples/dicts) support nested types

  • [SPARK-41437] [SC-117601][SQL][ALL TESTS] Do not optimize the input query twice for v1 write fallback

  • [SPARK-41840] [SC-119719][CONNECT][PYTHON] Add the missing alias groupby

  • [SPARK-41846] [SC-119717][CONNECT][PYTHON] Enable doctests for window functions

  • [SPARK-41914] [SC-120094][SQL] FileFormatWriter materializes AQE plan before accessing outputOrdering

  • [SPARK-41805] [SC-119992][SQL] Reuse expressions in WindowSpecDefinition

  • [SPARK-41977] [SC-120269][SPARK-41978][CONNECT] SparkSession.range to take float as arguments

  • [SPARK-42029] [SC-120336][CONNECT] Add Guava Shading rules to connect-common to avoid startup failure

  • [SPARK-41989] [SC-120334][PYTHON] Avoid breaking logging config from pyspark.pandas

  • [SPARK-42003] [SC-120331][SQL] Reduce duplicate code in ResolveGroupByAll

  • [SPARK-41635] [SC-120313][SQL] Fix group by all error reporting

  • [SPARK-41047] [SC-120291][SQL] Improve docs for round

  • [SPARK-41822] [SC-120122][CONNECT] Setup gRPC connection for Scala/JVM client

  • [SPARK-41879] [SC-120264][CONNECT][PYTHON] Make DataFrame.collect support nested types

  • [SPARK-41887] [SC-120268][CONNECT][PYTHON] Make DataFrame.hint accept list typed parameter

  • [SPARK-41964] [SC-120210][CONNECT][PYTHON] Add the list of unsupported IO functions

  • [SPARK-41595] [SC-120097][SQL] Support generator function explode/explode_outer in the FROM clause

  • [SPARK-41957] [SC-120121][CONNECT][PYTHON] Enable the doctest for DataFrame.hint

  • [SPARK-41886] [SC-120141][CONNECT][PYTHON] DataFrame.intersect doctest output has different order

  • [SPARK-41442] [SC-117795][SQL][ALL TESTS] Only update SQLMetric value if merging with valid metric

  • [SPARK-41944] [SC-120046][CONNECT] Pass configurations when local remote mode is on

  • [SPARK-41708] [SC-119838][SQL] Pull v1write information to WriteFiles

  • [SPARK-41780] [SC-120000][SQL] Should throw INVALID_PARAMETER_VALUE.PATTERN when the parameters regexp is invalid

  • [SPARK-41889] [SC-119975][SQL] Attach root cause to invalidPatternError & refactor error classes INVALID_PARAMETER_VALUE

  • [SPARK-41860] [SC-120028][SQL] Make AvroScanBuilder and JsonScanBuilder case classes

  • [SPARK-41945] [SC-120010][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist

  • [SPARK-41690] [SC-119102][SC-119087][SQL][CONNECT] Agnostic Encoders

  • [SPARK-41354] [SC-119995][CONNECT][PYTHON] Implement RepartitionByExpression

  • [SPARK-41581] [SC-119997][SQL] Update _LEGACY_ERROR_TEMP_1230 as INTERNAL_ERROR

  • [SPARK-41928] [SC-119972][CONNECT][PYTHON] Add the unsupported list for functions

  • [SPARK-41933] [SC-119980][CONNECT] Provide local mode that automatically starts the server

  • [SPARK-41899] [SC-119971][CONNECT][PYTHON] createDataFrame` should respect user provided DDL schema

  • [SPARK-41936] [SC-119978][CONNECT][PYTHON] Make withMetadata reuse the withColumns proto

  • [SPARK-41898] [SC-119931][CONNECT][PYTHON] Window.rowsBetween, Window.rangeBetween parameters typechecking parity with pyspark

  • [SPARK-41939] [SC-119977][CONNECT][PYTHON] Add the unsupported list for catalog functions

  • [SPARK-41924] [SC-119946][CONNECT][PYTHON] Make StructType support metadata and Implement DataFrame.withMetadata

  • [SPARK-41934] [SC-119967][CONNECT][PYTHON] Add the unsupported function list for session

  • [SPARK-41875] [SC-119969][CONNECT][PYTHON] Add test cases for Dataset.to()

  • [SPARK-41824] [SC-119970][CONNECT][PYTHON] Ingore the doctest for explain of connect

  • [SPARK-41880] [SC-119959][CONNECT][PYTHON] Make function from_json accept non-literal schema

  • [SPARK-41927] [SC-119952][CONNECT][PYTHON] Add the unsupported list for GroupedData

  • [SPARK-41929] [SC-119949][CONNECT][PYTHON] Add function array_compact

  • [SPARK-41827] [SC-119841][CONNECT][PYTHON] Make GroupBy accept column list

  • [SPARK-41925] [SC-119905][SQL] Enable spark.sql.orc.enableNestedColumnVectorizedReader by default

  • [SPARK-41831] [SC-119853][CONNECT][PYTHON] Make DataFrame.select accept column list

  • [SPARK-41455] [SC-119858][CONNECT][PYTHON] Make DataFrame.collect discard the timezone info

  • [SPARK-41923] [SC-119861][CONNECT][PYTHON] Add DataFrame.writeTo to the unsupported list

  • [SPARK-41912] [SC-119837][SQL] Subquery should not validate CTE

  • [SPARK-41828] [SC-119832][CONNECT][PYTHON][12.X] Make createDataFrame support empty dataframe

  • [SPARK-41905] [SC-119848][CONNECT] Support name as strings in slice

  • [SPARK-41869] [SC-119845][CONNECT] Reject single string in dropDuplicates

  • [SPARK-41830] [SC-119840][CONNECT][PYTHON] Make DataFrame.sample accept the same parameters as PySpark

  • [SPARK-41849] [SC-119835][CONNECT] Implement DataFrameReader.text

  • [SPARK-41861] [SC-119834][SQL] Make v2 ScanBuilders’ build() return typed scan

  • [SPARK-41825] [SC-119710][CONNECT][PYTHON] Enable doctests related to DataFrame.show

  • [SPARK-41855] [SC-119804][SC-119410][SPARK-41814][SPARK-41851][SPARK-41852][CONNECT][PYTHON][12.X] Make createDataFrame handle None/NaN properly

  • [SPARK-41833] [SC-119685][SPARK-41881][SPARK-41815][CONNECT][PYTHON] Make DataFrame.collect handle None/NaN/Array/Binary porperly

  • [SPARK-39318] [SC-119713][SQL] Remove tpch-plan-stability WithStats golden files

  • [SPARK-41791] [SC-119745] Add new file source metadata column types

  • [SPARK-41790] [SC-119729][SQL] Set TRANSFORM reader and writer’s format correctly

  • [SPARK-41829] [SC-119725][CONNECT][PYTHON] Add the missing ordering parameter in Sort and sortWithinPartitions

  • [SPARK-41576] [SC-119718][SQL] Assign name to LEGACYERROR_TEMP_2051

  • [SPARK-41821] [SC-119716][CONNECT][PYTHON] Fix doc test for DataFrame.describe

  • [SPARK-41871] [SC-119714][CONNECT] DataFrame hint parameter can be str, float or int

  • [SPARK-41720] [SC-119076][SQL] Rename UnresolvedFunc to UnresolvedFunctionName

  • [SPARK-41573] [SC-119567][SQL] Assign name to LEGACYERROR_TEMP_2136

  • [SPARK-41862] [SC-119492][SQL] Fix correctness bug related to DEFAULT values in Orc reader

  • [SPARK-41582] [SC-119482][SC-118701][CORE][SQL] Reuse INVALID_TYPED_LITERAL instead of _LEGACY_ERROR_TEMP_0022

System environment

  • Operating System: Ubuntu 20.04.5 LTS

  • Java: Zulu 8.68.0.21-CA-linux64

  • Scala: 2.12.15

  • Python: 3.9.5

  • R: 4.2.2

  • Delta Lake: 2.2.0

Installed Python libraries

Library

Version

Library

Version

Library

Version

argon2-cffi

21.3.0

argon2-cffi-bindings

21.2.0

asttokens

2.0.5

attrs

21.4.0

backcall

0.2.0

backports.entry-points-selectable

1.2.0

beautifulsoup4

4.11.1

black

22.3.0

bleach

4.1.0

boto3

1.21.32

botocore

1.24.32

certifi

2021.10.8

cffi

1.15.0

chardet

4.0.0

charset-normalizer

2.0.4

click

8.0.4

cryptography

3.4.8

cycler

0.11.0

Cython

0.29.28

dbus-python

1.2.16

debugpy

1.5.1

decorator

5.1.1

defusedxml

0.7.1

distlib

0.3.6

docstring-to-markdown

0.11

entrypoints

0.4

executing

0.8.3

facets-overview

1.0.0

fastjsonschema

2.16.2

filelock

3.9.0

fonttools

4.25.0

idna

3.3

ipykernel

6.15.3

ipython

8.5.0

ipython-genutils

0.2.0

ipywidgets

7.7.2

jedi

0.18.1

Jinja2

2.11.3

jmespath

0.10.0

joblib

1.1.1

jsonschema

4.4.0

jupyter-client

6.1.12

jupyter_core

4.11.2

jupyterlab-pygments

0.1.2

jupyterlab-widgets

1.0.0

kiwisolver

1.3.2

MarkupSafe

2.0.1

matplotlib

3.5.1

matplotlib-inline

0.1.2

mccabe

0.7.0

mistune

0.8.4

mypy-extensions

0.4.3

nbclient

0.5.13

nbconvert

6.4.4

nbformat

5.3.0

nest-asyncio

1.5.5

nodeenv

1.7.0

notebook

6.4.8

numpy

1.21.5

packaging

21.3

pandas

1.4.2

pandocfilters

1.5.0

parso

0.8.3

pathspec

0.9.0

patsy

0.5.2

pexpect

4.8.0

pickleshare

0.7.5

Pillow

9.0.1

pip

21.2.4

platformdirs

2.6.2

plotly

5.6.0

pluggy

1.0.0

prometheus-client

0.13.1

prompt-toolkit

3.0.20

protobuf

3.19.4

psutil

5.8.0

psycopg2

2.9.3

ptyprocess

0.7.0

pure-eval

0.2.2

pyarrow

7.0.0

pycparser

2.21

pyflakes

2.5.0

Pygments

2.11.2

PyGObject

3.36.0

pyodbc

4.0.32

pyparsing

3.0.4

pyright

1.1.283

pyrsistent

0.18.0

python-dateutil

2.8.2

python-lsp-jsonrpc

1.0.0

python-lsp-server

1.6.0

pytz

2021.3

pyzmq

22.3.0

requests

2.27.1

requests-unixsocket

0.2.0

rope

0.22.0

s3transfer

0.5.0

scikit-learn

1.0.2

scipy

1.7.3

seaborn

0.11.2

Send2Trash

1.8.0

setuptools

61.2.0

six

1.16.0

soupsieve

2.3.1

ssh-import-id

5.10

stack-data

0.2.0

statsmodels

0.13.2

tenacity

8.0.1

terminado

0.13.1

testpath

0.5.0

threadpoolctl

2.2.0

tokenize-rt

4.2.1

tomli

1.2.2

tornado

6.1

traitlets

5.1.1

typing_extensions

4.1.1

ujson

5.1.0

unattended-upgrades

0.1

urllib3

1.26.9

virtualenv

20.8.0

wcwidth

0.2.5

webencodings

0.5.1

whatthepatch

1.0.4

wheel

0.37.0

widgetsnbextension

3.6.1

yapf

0.31.0

Installed R libraries

R libraries are installed from the Microsoft CRAN snapshot on 2022-11-11.

Library

Version

Library

Version

Library

Version

arrow

10.0.0

askpass

1.1

assertthat

0.2.1

backports

1.4.1

base

4.2.2

base64enc

0.1-3

bit

4.0.4

bit64

4.0.5

blob

1.2.3

boot

1.3-28

brew

1.0-8

brio

1.1.3

broom

1.0.1

bslib

0.4.1

cachem

1.0.6

callr

3.7.3

caret

6.0-93

cellranger

1.1.0

chron

2.3-58

class

7.3-21

cli

3.4.1

clipr

0.8.0

clock

0.6.1

cluster

2.1.4

codetools

0.2-19

colorspace

2.0-3

commonmark

1.8.1

compiler

4.2.2

config

0.3.1

cpp11

0.4.3

crayon

1.5.2

credentials

1.3.2

curl

4.3.3

data.table

1.14.4

datasets

4.2.2

DBI

1.1.3

dbplyr

2.2.1

desc

1.4.2

devtools

2.4.5

diffobj

0.3.5

digest

0.6.30

downlit

0.4.2

dplyr

1.0.10

dtplyr

1.2.2

e1071

1.7-12

ellipsis

0.3.2

evaluate

0.18

fansi

1.0.3

farver

2.1.1

fastmap

1.1.0

fontawesome

0.4.0

forcats

0.5.2

foreach

1.5.2

foreign

0.8-82

forge

0.2.0

fs

1.5.2

future

1.29.0

future.apply

1.10.0

gargle

1.2.1

generics

0.1.3

gert

1.9.1

ggplot2

3.4.0

gh

1.3.1

gitcreds

0.1.2

glmnet

4.1-4

globals

0.16.1

glue

1.6.2

googledrive

2.0.0

googlesheets4

1.0.1

gower

1.0.0

graphics

4.2.2

grDevices

4.2.2

grid

4.2.2

gridExtra

2.3

gsubfn

0.7

gtable

0.3.1

hardhat

1.2.0

haven

2.5.1

highr

0.9

hms

1.1.2

htmltools

0.5.3

htmlwidgets

1.5.4

httpuv

1.6.6

httr

1.4.4

ids

1.0.1

ini

0.3.1

ipred

0.9-13

isoband

0.2.6

iterators

1.0.14

jquerylib

0.1.4

jsonlite

1.8.3

KernSmooth

2.23-20

knitr

1.40

labeling

0.4.2

later

1.3.0

lattice

0.20-45

lava

1.7.0

lifecycle

1.0.3

listenv

0.8.0

lubridate

1.9.0

magrittr

2.0.3

markdown

1.3

MASS

7.3-58.2

Matrix

1.5-1

memoise

2.0.1

methods

4.2.2

mgcv

1.8-41

mime

0.12

miniUI

0.1.1.1

ModelMetrics

1.2.2.2

modelr

0.1.9

munsell

0.5.0

nlme

3.1-162

nnet

7.3-18

numDeriv

2016.8-1.1

openssl

2.0.4

parallel

4.2.2

parallelly

1.32.1

pillar

1.8.1

pkgbuild

1.3.1

pkgconfig

2.0.3

pkgdown

2.0.6

pkgload

1.3.1

plogr

0.2.0

plyr

1.8.7

praise

1.0.0

prettyunits

1.1.1

pROC

1.18.0

processx

3.8.0

prodlim

2019.11.13

profvis

0.3.7

progress

1.2.2

progressr

0.11.0

promises

1.2.0.1

proto

1.0.0

proxy

0.4-27

ps

1.7.2

purrr

0.3.5

r2d3

0.2.6

R6

2.5.1

ragg

1.2.4

randomForest

4.7-1.1

rappdirs

0.3.3

rcmdcheck

1.4.0

RColorBrewer

1.1-3

Rcpp

1.0.9

RcppEigen

0.3.3.9.3

readr

2.1.3

readxl

1.4.1

recipes

1.0.3

rematch

1.0.1

rematch2

2.1.2

remotes

2.4.2

reprex

2.0.2

reshape2

1.4.4

rlang

1.0.6

rmarkdown

2.18

RODBC

1.3-19

roxygen2

7.2.1

rpart

4.1.19

rprojroot

2.0.3

Rserve

1.8-12

RSQLite

2.2.18

rstudioapi

0.14

rversions

2.1.2

rvest

1.0.3

sass

0.4.2

scales

1.2.1

selectr

0.4-2

sessioninfo

1.2.2

shape

1.4.6

shiny

1.7.3

sourcetools

0.1.7

sparklyr

1.7.8

SparkR

3.3.2

spatial

7.3-11

splines

4.2.2

sqldf

0.4-11

SQUAREM

2021.1

stats

4.2.2

stats4

4.2.2

stringi

1.7.8

stringr

1.4.1

survival

3.4-0

sys

3.4.1

systemfonts

1.0.4

tcltk

4.2.2

testthat

3.1.5

textshaping

0.3.6

tibble

3.1.8

tidyr

1.2.1

tidyselect

1.2.0

tidyverse

1.3.2

timechange

0.1.1

timeDate

4021.106

tinytex

0.42

tools

4.2.2

tzdb

0.3.0

urlchecker

1.0.1

usethis

2.1.6

utf8

1.2.2

utils

4.2.2

uuid

1.1-0

vctrs

0.5.0

viridisLite

0.4.1

vroom

1.6.0

waldo

0.4.0

whisker

0.4

withr

2.5.0

xfun

0.34

xml2

1.3.3

xopen

1.0.0

xtable

1.8-4

yaml

2.3.6

zip

2.2.2

Installed Java and Scala libraries (Scala 2.12 cluster version)

Group ID

Artifact ID

Version

antlr

antlr

2.7.7

com.amazonaws

amazon-kinesis-client

1.12.0

com.amazonaws

aws-java-sdk-autoscaling

1.12.189

com.amazonaws

aws-java-sdk-cloudformation

1.12.189

com.amazonaws

aws-java-sdk-cloudfront

1.12.189

com.amazonaws

aws-java-sdk-cloudhsm

1.12.189

com.amazonaws

aws-java-sdk-cloudsearch

1.12.189

com.amazonaws

aws-java-sdk-cloudtrail

1.12.189

com.amazonaws

aws-java-sdk-cloudwatch

1.12.189

com.amazonaws

aws-java-sdk-cloudwatchmetrics

1.12.189

com.amazonaws

aws-java-sdk-codedeploy

1.12.189

com.amazonaws

aws-java-sdk-cognitoidentity

1.12.189

com.amazonaws

aws-java-sdk-cognitosync

1.12.189

com.amazonaws

aws-java-sdk-config

1.12.189

com.amazonaws

aws-java-sdk-core

1.12.189

com.amazonaws

aws-java-sdk-datapipeline

1.12.189

com.amazonaws

aws-java-sdk-directconnect

1.12.189

com.amazonaws

aws-java-sdk-directory

1.12.189

com.amazonaws

aws-java-sdk-dynamodb

1.12.189

com.amazonaws

aws-java-sdk-ec2

1.12.189

com.amazonaws

aws-java-sdk-ecs

1.12.189

com.amazonaws

aws-java-sdk-efs

1.12.189

com.amazonaws

aws-java-sdk-elasticache

1.12.189

com.amazonaws

aws-java-sdk-elasticbeanstalk

1.12.189

com.amazonaws

aws-java-sdk-elasticloadbalancing

1.12.189

com.amazonaws

aws-java-sdk-elastictranscoder

1.12.189

com.amazonaws

aws-java-sdk-emr

1.12.189

com.amazonaws

aws-java-sdk-glacier

1.12.189

com.amazonaws

aws-java-sdk-glue

1.12.189

com.amazonaws

aws-java-sdk-iam

1.12.189

com.amazonaws

aws-java-sdk-importexport

1.12.189

com.amazonaws

aws-java-sdk-kinesis

1.12.189

com.amazonaws

aws-java-sdk-kms

1.12.189

com.amazonaws

aws-java-sdk-lambda

1.12.189

com.amazonaws

aws-java-sdk-logs

1.12.189

com.amazonaws

aws-java-sdk-machinelearning

1.12.189

com.amazonaws

aws-java-sdk-opsworks

1.12.189

com.amazonaws

aws-java-sdk-rds

1.12.189

com.amazonaws

aws-java-sdk-redshift

1.12.189

com.amazonaws

aws-java-sdk-route53

1.12.189

com.amazonaws

aws-java-sdk-s3

1.12.189

com.amazonaws

aws-java-sdk-ses

1.12.189

com.amazonaws

aws-java-sdk-simpledb

1.12.189

com.amazonaws

aws-java-sdk-simpleworkflow

1.12.189

com.amazonaws

aws-java-sdk-sns

1.12.189

com.amazonaws

aws-java-sdk-sqs

1.12.189

com.amazonaws

aws-java-sdk-ssm

1.12.189

com.amazonaws

aws-java-sdk-storagegateway

1.12.189

com.amazonaws

aws-java-sdk-sts

1.12.189

com.amazonaws

aws-java-sdk-support

1.12.189

com.amazonaws

aws-java-sdk-swf-libraries

1.11.22

com.amazonaws

aws-java-sdk-workspaces

1.12.189

com.amazonaws

jmespath-java

1.12.189

com.chuusai

shapeless_2.12

2.3.3

com.clearspring.analytics

stream

2.9.6

com.databricks

Rserve

1.8-3

com.databricks

jets3t

0.7.1-0

com.databricks.scalapb

compilerplugin_2.12

0.4.15-10

com.databricks.scalapb

scalapb-runtime_2.12

0.4.15-10

com.esotericsoftware

kryo-shaded

4.0.2

com.esotericsoftware

minlog

1.3.0

com.fasterxml

classmate

1.3.4

com.fasterxml.jackson.core

jackson-annotations

2.13.4

com.fasterxml.jackson.core

jackson-core

2.13.4

com.fasterxml.jackson.core

jackson-databind

2.13.4.2

com.fasterxml.jackson.dataformat

jackson-dataformat-cbor

2.13.4

com.fasterxml.jackson.datatype

jackson-datatype-joda

2.13.4

com.fasterxml.jackson.datatype

jackson-datatype-jsr310

2.13.4

com.fasterxml.jackson.module

jackson-module-paranamer

2.13.4

com.fasterxml.jackson.module

jackson-module-scala_2.12

2.13.4

com.github.ben-manes.caffeine

caffeine

2.3.4

com.github.fommil

jniloader

1.1

com.github.fommil.netlib

core

1.1.2

com.github.fommil.netlib

native_ref-java

1.1

com.github.fommil.netlib

native_ref-java-natives

1.1

com.github.fommil.netlib

native_system-java

1.1

com.github.fommil.netlib

native_system-java-natives

1.1

com.github.fommil.netlib

netlib-native_ref-linux-x86_64-natives

1.1

com.github.fommil.netlib

netlib-native_system-linux-x86_64-natives

1.1

com.github.luben

zstd-jni

1.5.2-1

com.github.wendykierp

JTransforms

3.1

com.google.code.findbugs

jsr305

3.0.0

com.google.code.gson

gson

2.8.6

com.google.crypto.tink

tink

1.6.1

com.google.flatbuffers

flatbuffers-java

1.12.0

com.google.guava

guava

15.0

com.google.protobuf

protobuf-java

2.6.1

com.h2database

h2

2.0.204

com.helger

profiler

1.1.1

com.jcraft

jsch

0.1.50

com.jolbox

bonecp

0.8.0.RELEASE

com.lihaoyi

sourcecode_2.12

0.1.9

com.microsoft.azure

azure-data-lake-store-sdk

2.3.9

com.microsoft.sqlserver

mssql-jdbc

11.2.2.jre8

com.ning

compress-lzf

1.1

com.sun.mail

javax.mail

1.5.2

com.tdunning

json

1.8

com.thoughtworks.paranamer

paranamer

2.8

com.trueaccord.lenses

lenses_2.12

0.4.12

com.twitter

chill-java

0.10.0

com.twitter

chill_2.12

0.10.0

com.twitter

util-app_2.12

7.1.0

com.twitter

util-core_2.12

7.1.0

com.twitter

util-function_2.12

7.1.0

com.twitter

util-jvm_2.12

7.1.0

com.twitter

util-lint_2.12

7.1.0

com.twitter

util-registry_2.12

7.1.0

com.twitter

util-stats_2.12

7.1.0

com.typesafe

config

1.2.1

com.typesafe.scala-logging

scala-logging_2.12

3.7.2

com.uber

h3

3.7.0

com.univocity

univocity-parsers

2.9.1

com.zaxxer

HikariCP

4.0.3

commons-cli

commons-cli

1.5.0

commons-codec

commons-codec

1.15

commons-collections

commons-collections

3.2.2

commons-dbcp

commons-dbcp

1.4

commons-fileupload

commons-fileupload

1.3.3

commons-httpclient

commons-httpclient

3.1

commons-io

commons-io

2.11.0

commons-lang

commons-lang

2.6

commons-logging

commons-logging

1.1.3

commons-pool

commons-pool

1.5.4

dev.ludovic.netlib

arpack

2.2.1

dev.ludovic.netlib

blas

2.2.1

dev.ludovic.netlib

lapack

2.2.1

info.ganglia.gmetric4j

gmetric4j

1.0.10

io.airlift

aircompressor

0.21

io.delta

delta-sharing-spark_2.12

0.6.3

io.dropwizard.metrics

metrics-core

4.1.1

io.dropwizard.metrics

metrics-graphite

4.1.1

io.dropwizard.metrics

metrics-healthchecks

4.1.1

io.dropwizard.metrics

metrics-jetty9

4.1.1

io.dropwizard.metrics

metrics-jmx

4.1.1

io.dropwizard.metrics

metrics-json

4.1.1

io.dropwizard.metrics

metrics-jvm

4.1.1

io.dropwizard.metrics

metrics-servlets

4.1.1

io.netty

netty-all

4.1.74.Final

io.netty

netty-buffer

4.1.74.Final

io.netty

netty-codec

4.1.74.Final

io.netty

netty-common

4.1.74.Final

io.netty

netty-handler

4.1.74.Final

io.netty

netty-resolver

4.1.74.Final

io.netty

netty-tcnative-classes

2.0.48.Final

io.netty

netty-transport

4.1.74.Final

io.netty

netty-transport-classes-epoll

4.1.74.Final

io.netty

netty-transport-classes-kqueue

4.1.74.Final

io.netty

netty-transport-native-epoll-linux-aarch_64

4.1.74.Final

io.netty

netty-transport-native-epoll-linux-x86_64

4.1.74.Final

io.netty

netty-transport-native-kqueue-osx-aarch_64

4.1.74.Final

io.netty

netty-transport-native-kqueue-osx-x86_64

4.1.74.Final

io.netty

netty-transport-native-unix-common

4.1.74.Final

io.prometheus

simpleclient

0.7.0

io.prometheus

simpleclient_common

0.7.0

io.prometheus

simpleclient_dropwizard

0.7.0

io.prometheus

simpleclient_pushgateway

0.7.0

io.prometheus

simpleclient_servlet

0.7.0

io.prometheus.jmx

collector

0.12.0

jakarta.annotation

jakarta.annotation-api

1.3.5

jakarta.servlet

jakarta.servlet-api

4.0.3

jakarta.validation

jakarta.validation-api

2.0.2

jakarta.ws.rs

jakarta.ws.rs-api

2.1.6

javax.activation

activation

1.1.1

javax.el

javax.el-api

2.2.4

javax.jdo

jdo-api

3.0.1

javax.transaction

jta

1.1

javax.transaction

transaction-api

1.1

javax.xml.bind

jaxb-api

2.2.11

javolution

javolution

5.5.1

jline

jline

2.14.6

joda-time

joda-time

2.10.13

net.java.dev.jna

jna

5.8.0

net.razorvine

pickle

1.2

net.sf.jpam

jpam

1.1

net.sf.opencsv

opencsv

2.3

net.sf.supercsv

super-csv

2.2.0

net.snowflake

snowflake-ingest-sdk

0.9.6

net.snowflake

snowflake-jdbc

3.13.22

net.sourceforge.f2j

arpack_combined_all

0.1

org.acplt.remotetea

remotetea-oncrpc

1.1.2

org.antlr

ST4

4.0.4

org.antlr

antlr-runtime

3.5.2

org.antlr

antlr4-runtime

4.8

org.antlr

stringtemplate

3.2.1

org.apache.ant

ant

1.9.2

org.apache.ant

ant-jsch

1.9.2

org.apache.ant

ant-launcher

1.9.2

org.apache.arrow

arrow-format

7.0.0

org.apache.arrow

arrow-memory-core

7.0.0

org.apache.arrow

arrow-memory-netty

7.0.0

org.apache.arrow

arrow-vector

7.0.0

org.apache.avro

avro

1.11.0

org.apache.avro

avro-ipc

1.11.0

org.apache.avro

avro-mapred

1.11.0

org.apache.commons

commons-collections4

4.4

org.apache.commons

commons-compress

1.21

org.apache.commons

commons-crypto

1.1.0

org.apache.commons

commons-lang3

3.12.0

org.apache.commons

commons-math3

3.6.1

org.apache.commons

commons-text

1.10.0

org.apache.curator

curator-client

2.13.0

org.apache.curator

curator-framework

2.13.0

org.apache.curator

curator-recipes

2.13.0

org.apache.derby

derby

10.14.2.0

org.apache.hadoop

hadoop-client-api

3.3.4-databricks

org.apache.hadoop

hadoop-client-runtime

3.3.4

org.apache.hive

hive-beeline

2.3.9

org.apache.hive

hive-cli

2.3.9

org.apache.hive

hive-jdbc

2.3.9

org.apache.hive

hive-llap-client

2.3.9

org.apache.hive

hive-llap-common

2.3.9

org.apache.hive

hive-serde

2.3.9

org.apache.hive

hive-shims

2.3.9

org.apache.hive

hive-storage-api

2.8.1

org.apache.hive.shims

hive-shims-0.23

2.3.9

org.apache.hive.shims

hive-shims-common

2.3.9

org.apache.hive.shims

hive-shims-scheduler

2.3.9

org.apache.httpcomponents

httpclient

4.5.13

org.apache.httpcomponents

httpcore

4.4.14

org.apache.ivy

ivy

2.5.0

org.apache.logging.log4j

log4j-1.2-api

2.18.0

org.apache.logging.log4j

log4j-api

2.18.0

org.apache.logging.log4j

log4j-core

2.18.0

org.apache.logging.log4j

log4j-slf4j-impl

2.18.0

org.apache.mesos

mesos-shaded-protobuf

1.4.0

org.apache.orc

orc-core

1.7.6

org.apache.orc

orc-mapreduce

1.7.6

org.apache.orc

orc-shims

1.7.6

org.apache.parquet

parquet-column

1.12.3-databricks-0002

org.apache.parquet

parquet-common

1.12.3-databricks-0002

org.apache.parquet

parquet-encoding

1.12.3-databricks-0002

org.apache.parquet

parquet-format-structures

1.12.3-databricks-0002

org.apache.parquet

parquet-hadoop

1.12.3-databricks-0002

org.apache.parquet

parquet-jackson

1.12.3-databricks-0002

org.apache.thrift

libfb303

0.9.3

org.apache.thrift

libthrift

0.12.0

org.apache.xbean

xbean-asm9-shaded

4.20

org.apache.yetus

audience-annotations

0.13.0

org.apache.zookeeper

zookeeper

3.6.2

org.apache.zookeeper

zookeeper-jute

3.6.2

org.checkerframework

checker-qual

3.5.0

org.codehaus.jackson

jackson-core-asl

1.9.13

org.codehaus.jackson

jackson-mapper-asl

1.9.13

org.codehaus.janino

commons-compiler

3.0.16

org.codehaus.janino

janino

3.0.16

org.datanucleus

datanucleus-api-jdo

4.2.4

org.datanucleus

datanucleus-core

4.1.17

org.datanucleus

datanucleus-rdbms

4.1.19

org.datanucleus

javax.jdo

3.2.0-m3

org.eclipse.jetty

jetty-client

9.4.46.v20220331

org.eclipse.jetty

jetty-continuation

9.4.46.v20220331

org.eclipse.jetty

jetty-http

9.4.46.v20220331

org.eclipse.jetty

jetty-io

9.4.46.v20220331

org.eclipse.jetty

jetty-jndi

9.4.46.v20220331

org.eclipse.jetty

jetty-plus

9.4.46.v20220331

org.eclipse.jetty

jetty-proxy

9.4.46.v20220331

org.eclipse.jetty

jetty-security

9.4.46.v20220331

org.eclipse.jetty

jetty-server

9.4.46.v20220331

org.eclipse.jetty

jetty-servlet

9.4.46.v20220331

org.eclipse.jetty

jetty-servlets

9.4.46.v20220331

org.eclipse.jetty

jetty-util

9.4.46.v20220331

org.eclipse.jetty

jetty-util-ajax

9.4.46.v20220331

org.eclipse.jetty

jetty-webapp

9.4.46.v20220331

org.eclipse.jetty

jetty-xml

9.4.46.v20220331

org.eclipse.jetty.websocket

websocket-api

9.4.46.v20220331

org.eclipse.jetty.websocket

websocket-client

9.4.46.v20220331

org.eclipse.jetty.websocket

websocket-common

9.4.46.v20220331

org.eclipse.jetty.websocket

websocket-server

9.4.46.v20220331

org.eclipse.jetty.websocket

websocket-servlet

9.4.46.v20220331

org.fusesource.leveldbjni

leveldbjni-all

1.8

org.glassfish.hk2

hk2-api

2.6.1

org.glassfish.hk2

hk2-locator

2.6.1

org.glassfish.hk2

hk2-utils

2.6.1

org.glassfish.hk2

osgi-resource-locator

1.0.3

org.glassfish.hk2.external

aopalliance-repackaged

2.6.1

org.glassfish.hk2.external

jakarta.inject

2.6.1

org.glassfish.jersey.containers

jersey-container-servlet

2.36

org.glassfish.jersey.containers

jersey-container-servlet-core

2.36

org.glassfish.jersey.core

jersey-client

2.36

org.glassfish.jersey.core

jersey-common

2.36

org.glassfish.jersey.core

jersey-server

2.36

org.glassfish.jersey.inject

jersey-hk2

2.36

org.hibernate.validator

hibernate-validator

6.1.0.Final

org.javassist

javassist

3.25.0-GA

org.jboss.logging

jboss-logging

3.3.2.Final

org.jdbi

jdbi

2.63.1

org.jetbrains

annotations

17.0.0

org.joda

joda-convert

1.7

org.jodd

jodd-core

3.5.2

org.json4s

json4s-ast_2.12

3.7.0-M11

org.json4s

json4s-core_2.12

3.7.0-M11

org.json4s

json4s-jackson_2.12

3.7.0-M11

org.json4s

json4s-scalap_2.12

3.7.0-M11

org.lz4

lz4-java

1.8.0

org.mariadb.jdbc

mariadb-java-client

2.7.4

org.mlflow

mlflow-spark

2.1.1

org.objenesis

objenesis

2.5.1

org.postgresql

postgresql

42.3.3

org.roaringbitmap

RoaringBitmap

0.9.25

org.roaringbitmap

shims

0.9.25

org.rocksdb

rocksdbjni

6.28.2

org.rosuda.REngine

REngine

2.1.0

org.scala-lang

scala-compiler_2.12

2.12.14

org.scala-lang

scala-library_2.12

2.12.14

org.scala-lang

scala-reflect_2.12

2.12.14

org.scala-lang.modules

scala-collection-compat_2.12

2.4.3

org.scala-lang.modules

scala-parser-combinators_2.12

1.1.2

org.scala-lang.modules

scala-xml_2.12

1.2.0

org.scala-sbt

test-interface

1.0

org.scalacheck

scalacheck_2.12

1.14.2

org.scalactic

scalactic_2.12

3.0.8

org.scalanlp

breeze-macros_2.12

1.2

org.scalanlp

breeze_2.12

1.2

org.scalatest

scalatest_2.12

3.0.8

org.slf4j

jcl-over-slf4j

1.7.36

org.slf4j

jul-to-slf4j

1.7.36

org.slf4j

slf4j-api

1.7.36

org.spark-project.spark

unused

1.0.0

org.threeten

threeten-extra

1.5.0

org.tukaani

xz

1.9

org.typelevel

algebra_2.12

2.0.1

org.typelevel

cats-kernel_2.12

2.1.1

org.typelevel

macro-compat_2.12

1.1.1

org.typelevel

spire-macros_2.12

0.17.0

org.typelevel

spire-platform_2.12

0.17.0

org.typelevel

spire-util_2.12

0.17.0

org.typelevel

spire_2.12

0.17.0

org.wildfly.openssl

wildfly-openssl

1.0.7.Final

org.xerial

sqlite-jdbc

3.8.11.2

org.xerial.snappy

snappy-java

1.1.8.4

org.yaml

snakeyaml

1.24

oro

oro

2.0.8

pl.edu.icm

JLargeArrays

1.5

software.amazon.cryptools

AmazonCorrettoCryptoProvider

1.6.1-linux-x86_64

software.amazon.ion

ion-java

1.0.2

stax

stax-api

1.0.1