2.1.0-db3 Cluster Image

Databricks released this image in late March, 2017.

Important

This release has been deprecated. For more information about the Databricks Runtime deprecation policy and schedule, see Databricks Runtime Versioning and Deprecation Policy.

The following release notes provide information about the Spark 2.1.0-db3 cluster image powered by Apache Spark.

Changes and Improvements

  • Query Watchdog

The goal of this feature is creating an automatic mechanism for detecting and terminating queries, which produce excessive amount of rows at some point during their execution. This sometimes happens when clients unintentionally include joins create an explosion of data size (e.g., Cartesian products) in their queries. Please refer to the Handling Large Queries in Interactive Workflows page for more details.

  • Improved query push-down capability for the Redshift connector

The Redshift connector can now push more operations like filter, project, sort, and limit down to Redshift in many cases for better performance. See documentation for query pushdown in Query pushdown into Redshift.

  • Automatic JDBC SSL configuration for the Redshift connector

The Redshift connector now automatically configures SSL encryption for its JDBC connections. To disable automatic SSL configuration, use .option("autoenablessl", "false") on your DataFrameReader/Writer or put ?ssl=false in your JDBC connection string. For more information, see the Encryption section of the Redshift connector documentation.

  • Removed virtual memory limit on R processes in Databricks.

Previously R processes were launched with 10GB virtual memory constraint. With much larger instances and more need for single node analysis in R notebooks, we removed that limit.

  • Bug fixes and stability improvements to Spark.

Apache Spark

The 2.1.0-db3 cluster image includes Apache Spark 2.1.0. In addition to 2.1.0-db2 Cluster Image, the 2.1.0-db3 cluster image also includes the following extra bug fixes and improvements made to Spark:

  • [SPARK-19218] SET -v should not cause a NPE when a configuration uses NULL as its default value
  • [SPARK-19650] Commands should not trigger a Spark job
  • [SPARK-19682][SPARKR] Issue warning (or error) when subset method “[[” takes vector index
  • [SPARK-19652][UI] Do auth checks for REST API access (branch-2.1).
  • [SPARK-19617][SS] Fix the race condition when starting and stopping a query quickly (branch-2.1)
  • Open up visibility. This is a subset of upstream SPARK-19669.
  • [SPARK-19646][CORE][STREAMING] binaryRecords replicates records in scala API
  • [SPARK-18975][CORE] Add an API to remove SparkListener
  • [SPARK-19447] Make Range operator generate “recordsRead” metric
  • [SPARK-19517][SS] KafkaSource fails to initialize partition offsets
  • [SPARK-19500] [SQL] Fix off-by-one bug in BytesToBytesMap
  • [SPARK-19622][WEBUI] Fix a http error in a paged table when using a Go button to search.
  • [SPARK-15214][SQL] Code-generation for Generate
  • [SPARK-19603][SS] Fix StreamingQuery explain command
  • [SPARK-19599][SS] Clean up HDFSMetadataLog
  • [SPARK-18555][SQL] DataFrameNaFunctions.fill miss up original values …
  • [SPARK-19399][SPARKR] Add R coalesce API for DataFrame and Column
  • [SPARK-19607] Finding QueryExecution that matches provided executionId
  • [SPARK-19584][SS][DOCS] update structured streaming documentation around batch mode
  • [SPARK-19387][SPARKR] Tests do not run with SparkR source package in CRAN check
  • [SPARK-19459][SQL] Add Hive datatype (char/varchar) to StructField metadata
  • [SPARK-19548][SQL] Support Hive UDFs which return typed Lists/Maps
  • [SPARK-19585][DOC][SQL] Fix the cacheTable and uncacheTable api call in the doc
  • [SPARK-19520][STREAMING] Do not encrypt data written to the WAL.
  • [SPARK-19529] TransportClientFactory.createClient() shouldn’t call awaitUninterruptibly()
  • [HOTFIX]`[SPARK-19542] <https://issues.apache.org/jira/browse/SPARK-19542>`_[SS]Fix the missing import in DataStreamReaderWriterSuite
  • [SPARK-17714][CORE][TEST-MAVEN][TEST-HADOOP2.6] Avoid using ExecutorClassLoader to load Netty generated classes
  • [SPARK-19542][SS] Delete the temp checkpoint if a query is stopped without errors
  • [SPARK-19514] Enhancing the test for Range interruption.
  • [SPARK-19506][ML][PYTHON] Import warnings in pyspark.ml.util
  • [SPARK-19574][ML][DOCUMENTATION] Fix Liquid Exception: Start indices amount is not equal to end indices amount
  • [SPARK-19564][SPARK-19559][SS][KAFKA] KafkaOffsetReader’s consumers should not be in the same group
  • [SPARK-19319][BACKPORT-2.1][SPARKR] SparkR Kmeans summary returns error when the cluster size doesn’t equal to k
  • [SPARK-19342][SPARKR] bug fixed in collect method for collecting timestamp column
  • [SPARK-18717][SQL] Make code generation for Scala Map work with immutable.Map also
  • [SPARK-19549] Allow providing reason for stage/job cancelling
  • [SPARK-19543] from_json fails when the input row is empty
  • [SPARK-19512][BACKPORT-2.1][SQL] codegen for compare structs fails #16852
  • [SPARK-19509][SQL] Grouping Sets do not respect nullable grouping columns
  • [SPARK-19481] [REPL] [MAVEN] Avoid to leak SparkContext in Signaling.cancelOnInterrupt
  • [SPARK-19514] Making range interruptible.
  • [SPARK-19495][SQL] Make SQLConf slightly more extensible
  • [SPARK-18609][SPARK-18841][SQL][BACKPORT-2.1] Fix redundant Alias removal in the optimizer
  • [SPARK-19499][SS] Add more notes in the comments of Sink.addBatch()
  • [SPARK-18682][SS] Batch Source for Kafka
  • Revert “[SPARK-18609][SPARK-18841][SQL] Fix redundant Alias removal in the optimizer”
  • [SPARK-18609][SPARK-18841][SQL] Fix redundant Alias removal in the optimizer
  • [SPARK-18362][SQL] Use TextFileFormat in implementation of CSVFileFormat
  • [SPARK-19444][ML][DOCUMENTATION] Fix imports not being present in documentation
  • [SPARK-19407][SS] defaultFS is used FileSystem.get instead of getting it from uri scheme
  • [SPARK-19472][SQL] Parser should not mistake CASE WHEN(...) for a function call
  • [SPARK-19432][CORE] Fix an unexpected failure when connecting timeout
  • [SPARK-19377][WEBUI][CORE] Killed tasks should have the status as KILLED
  • [SPARK-19410][DOC] Fix brokens links in ml-pipeline and ml-tuning
  • [SPARK-19378][SS] Ensure continuity of stateOperator and eventTime metrics even if there is no new data in trigger
  • [SPARK-19406][SQL] Fix function to_json to respect user-provided options
  • [SPARK-19396][DOC] JDBC Options are Case In-sensitive
  • [SPARK-19324][SPARKR] Spark VJM stdout output is getting dropped in SparkR
  • [SPARK-19333][SPARKR] Add Apache License headers to R files
  • [SPARK-18788][SPARKR] Add API for getNumPartitions
  • [SPARK-19220][UI] Make redirection to HTTPS apply to all URIs. (branch-2.1)
  • [SPARK-19338][SQL] Add UDF names in explain
  • [SPARK-14804][SPARK][GRAPHX] Fix checkpointing of VertexRDD/EdgeRDD
  • [SPARK-19064][PYSPARK] Fix pip installing of sub components
  • [SPARK-19307][PYSPARK] Make sure user conf is propagated to SparkContext.
  • [SPARK-18863][SQL] Output non-aggregate expressions without GROUP BY in a subquery does not yield an error
  • [SPARK-16046][DOCS] Aggregations in the Spark SQL programming guide
  • [SPARK-19330][DSTREAMS] Also show tooltip for successful batches
  • [SPARK-19017][SQL] NOT IN subquery with more than one column may return incorrect results
  • [SPARK-16473][MLLIB] Fix BisectingKMeans Algorithm failing in edge case
  • [SPARK-18823][SPARKR] add support for assigning to column
  • [SPARK-19268][SS] Disallow adaptive query execution for streaming queries
  • [SPARK-9435][SQL] Reuse function in Java UDF to correctly support expressions that require equality comparison between ScalaUDF
  • [SPARK-19306][CORE] Fix inconsistent state in DiskBlockObject when expection occurred
  • [SPARK-19155][ML] Make family case insensitive in GLM
  • [SPARK-19155][ML] MLlib GeneralizedLinearRegression family and link should case insensitive
  • [SPARK-19267][SS] Fix a race condition when stopping StateStore
  • [SPARK-18589][SQL] Fix Python UDF accessing attributes from both side of join

Known Issues

  • Log links on the executor page are not set correctly. Please use the worker page to access stdout and stderr links of an executor for now.

System Environment

  • Operating System: Ubuntu 16.04.1 LTS
  • Java: 1.8.0_111
  • Scala: 2.10.6 (Scala 2.10 cluster version)/2.11.8 (Scala 2.11 cluster version)
  • Python: 2.7.12 (or 3.5.2 if Python 3 support is enabled)
  • R: R version 3.2.3 (2015-12-10)

Pre-installed Python Libraries

Library Version Library Version Library Version
ansi2html 1.1.1 argparse 1.2.1 boto 2.42.0
boto3 1.4.1 botocore 1.4.70 brewer2mpl 1.4.1
certifi 2016.2.28 cffi 1.7.0 chardet 2.3.0
colorama 0.3.7 configobj 5.0.6 cryptography 1.5
cycler 0.10.0 Cython 0.24.1 decorator 4.0.10
docutils 0.13.1 enum34 1.1.6 et-xmlfile 1.0.1
freetype-py 1.0.2 funcsigs 1.0.2 fusepy 2.0.4
futures 3.0.5 ggplot 0.6.8 html5lib 0.999
idna 2.1 ipaddress 1.0.16 ipython 2.2.0
ipython-genutils 0.1.0 jdcal 1.2 Jinja2 2.8
jmespath 0.9.0 llvmlite 0.13.0 lxml 3.6.4
MarkupSafe 0.23 matplotlib 1.5.3 mpld3 0.2
msgpack-python 0.4.7 ndg-httpsclient 0.3.3 numba 0.28.1
numpy 1.11.1 openpyxl 2.3.2 pandas 0.18.1
pathlib2 2.1.0 patsy 0.4.1 pexpect 4.0.1
pickleshare 0.7.4 Pillow 3.3.1 pip 9.0.1
pkg_resources 0.0.0 ply 3.9 prompt-toolkit 1.0.7
psycopg2 2.6.2 ptyprocess 0.5.1 py4j 0.10.3
pyasn1 0.1.9 pycparser 2.14 Pygments 2.1.3
PyGObject 3.20.0 pyOpenSSL 16.0.0 pyparsing 2.1.4
pypng 0.0.18 Python 2.7.12 python-dateutil 2.5.3
python-geohash 0.8.5 pytz 2016.6.1 requests 2.11.1
s3transfer 0.1.9 scikit-learn 0.17.1 scipy 0.18.1
scour 0.32 seaborn 0.7.1 setuptools 32.3.1
simplejson 3.8.2 simples3 1.0 singledispatch 3.4.0.3
six 1.10.0 statsmodels 0.6.1 traitlets 4.3.0
urllib3 1.19.1 virtualenv 15.0.1 wcwidth 0.1.7
wheel 0.30.0a0 wsgiref 0.1.2    

Pre-installed R Libraries

Library Version Library Version Library Version
abind 1.4-3 assertthat 0.1 base 3.2.3
BH 1.60.0-2 bitops 1.0-6 boot 1.3-17
brew 1.0-6 car 2.1-3 caret 6.0-71
chron 2.3-47 class 7.3-14 cluster 2.0.5
codetools 0.2-14 colorspace 1.2-4 compiler 3.2.3
crayon 1.3.1 curl 2.2 data.table 1.9.6
datasets 3.2.3 DBI 0.5-1 devtools 1.12.0
dichromat 2.0-0 digest 0.6.9 doMC 1.3.4
dplyr 0.5.0 foreach 1.4.3 foreign 0.8-66
gbm 2.1.1 ggplot2 2.1.0 git2r 0.15.0
glmnet 2.0-5 graphics 3.2.3 grDevices 3.2.3
grid 3.2.3 gsubfn 0.6-6 gtable 0.1.2
h2o 3.10.0.8 httr 1.2.1 hwriter 1.3.2
hwriterPlus 1.0-3 iterators 1.0.8 jsonlite 1.1
KernSmooth 2.23-15 labeling 0.3 lattice 0.20-34
lazyeval 0.2.0 littler 0.3.0 lme4 1.1-12
lubridate 1.6.0 magrittr 1.5 mapproj 1.2-4
maps 3.0.2 MASS 7.3-45 Matrix 1.2-7.1
MatrixModels 0.4-1 memoise 1.0.0 methods 3.2.3
mgcv 1.8-11 mime 0.5 minqa 1.2.4
multicore 0.2 munsell 0.4.2 mvtnorm 1.0-5
nlme 3.1-124 nloptr 1.0.4 nnet 7.3-12
openssl 0.9.4 parallel 3.2.3 pbkrtest 0.4-6
pkgKitten 0.1.3 plyr 1.8.4 praise 1.0.0
pROC 1.8 proto 0.3-10 quantreg 5.29
R.methodsS3 1.7.1 R.oo 1.20.0 R.utils 2.4.0
R6 2.2.0 randomForest 4.6-12 RColorBrewer 1.1-2
Rcpp 0.12.7 RcppEigen 0.3.2.9.0 RCurl 1.95-4.8
reshape2 1.4.2 RODBC 1.3-12 roxygen2 5.0.1
rpart 4.1-10 Rserve 1.7-3 RSQLite 1.0.0
rstudioapi 0.6 scales 0.3.0 sp 1.0-15
SparkR 2.1.0 SparseM 1.72 spatial 7.3-11
splines 3.2.3 sqldf 0.4-10 statmod 1.4.26
stats 3.2.3 stats4 3.2.3 stringi 1.0-1
stringr 1.0.0 survival 2.38-3 tcltk 3.2.3
TeachingDemos 2.10 testthat 1.0.2 tibble 1.2
tools 3.2.3 utils 3.2.3 whisker 0.3-2
withr 1.0.2        

Pre-installed Java and Scala libraries (Scala 2.10 cluster version)

Group ID Artifact ID Version
antlr antlr 2.7.7
com.amazonaws aws-java-sdk 1.9.40
com.amazonaws aws-java-sdk-autoscaling 1.9.40
com.amazonaws aws-java-sdk-cloudformation 1.9.40
com.amazonaws aws-java-sdk-cloudfront 1.9.40
com.amazonaws aws-java-sdk-cloudhsm 1.9.40
com.amazonaws aws-java-sdk-cloudsearch 1.9.40
com.amazonaws aws-java-sdk-cloudtrail 1.9.40
com.amazonaws aws-java-sdk-cloudwatch 1.9.40
com.amazonaws aws-java-sdk-cloudwatchmetrics 1.9.40
com.amazonaws aws-java-sdk-codedeploy 1.9.40
com.amazonaws aws-java-sdk-cognitoidentity 1.9.40
com.amazonaws aws-java-sdk-cognitosync 1.9.40
com.amazonaws aws-java-sdk-config 1.9.40
com.amazonaws aws-java-sdk-core 1.9.40
com.amazonaws aws-java-sdk-datapipeline 1.9.40
com.amazonaws aws-java-sdk-directconnect 1.9.40
com.amazonaws aws-java-sdk-directory 1.9.40
com.amazonaws aws-java-sdk-dynamodb 1.9.40
com.amazonaws aws-java-sdk-ec2 1.9.40
com.amazonaws aws-java-sdk-ecs 1.9.40
com.amazonaws aws-java-sdk-efs 1.9.40
com.amazonaws aws-java-sdk-elasticache 1.9.40
com.amazonaws aws-java-sdk-elasticbeanstalk 1.9.40
com.amazonaws aws-java-sdk-elasticloadbalancing 1.9.40
com.amazonaws aws-java-sdk-elastictranscoder 1.9.40
com.amazonaws aws-java-sdk-emr 1.9.40
com.amazonaws aws-java-sdk-glacier 1.9.40
com.amazonaws aws-java-sdk-iam 1.9.40
com.amazonaws aws-java-sdk-importexport 1.9.40
com.amazonaws aws-java-sdk-kinesis 1.9.40
com.amazonaws aws-java-sdk-kms 1.9.40
com.amazonaws aws-java-sdk-lambda 1.9.40
com.amazonaws aws-java-sdk-logs 1.9.40
com.amazonaws aws-java-sdk-machinelearning 1.9.40
com.amazonaws aws-java-sdk-opsworks 1.9.40
com.amazonaws aws-java-sdk-rds 1.9.40
com.amazonaws aws-java-sdk-redshift 1.9.40
com.amazonaws aws-java-sdk-route53 1.9.40
com.amazonaws aws-java-sdk-s3 1.9.40
com.amazonaws aws-java-sdk-ses 1.9.40
com.amazonaws aws-java-sdk-simpledb 1.9.40
com.amazonaws aws-java-sdk-simpleworkflow 1.9.40
com.amazonaws aws-java-sdk-sns 1.9.40
com.amazonaws aws-java-sdk-sqs 1.9.40
com.amazonaws aws-java-sdk-ssm 1.9.40
com.amazonaws aws-java-sdk-storagegateway 1.9.40
com.amazonaws aws-java-sdk-sts 1.9.40
com.amazonaws aws-java-sdk-support 1.9.40
com.amazonaws aws-java-sdk-swf-libraries 1.9.40
com.amazonaws aws-java-sdk-workspaces 1.9.40
com.chuusai shapeless_2.10.4 2.0.0
com.clearspring.analytics stream 2.7.0
com.databricks Rserve 1.8-3
com.databricks dbml-local_2.10 0.1.1-spark2.1-SNAPSHOT
com.databricks dbml-local_2.10-tests 0.1.1-spark2.1-SNAPSHOT
com.databricks jets3t 0.7.1-0
com.databricks.scalapb compilerplugin_2.10 0.4.15-9
com.databricks.scalapb scalapb-runtime_2.10 0.4.15-9
com.esotericsoftware kryo-shaded 3.0.3
com.esotericsoftware minlog 1.3.0
com.fasterxml classmate 1.0.0
com.fasterxml.jackson.core jackson-annotations 2.4.5
com.fasterxml.jackson.core jackson-core 2.4.5
com.fasterxml.jackson.core jackson-databind 2.4.5
com.fasterxml.jackson.datatype jackson-datatype-joda 2.4.5
com.fasterxml.jackson.module jackson-module-paranamer 2.4.5
com.fasterxml.jackson.module jackson-module-scala_2.10 2.4.5
com.github.fommil jniloader 1.1
com.github.fommil.netlib core 1.1.2
com.github.fommil.netlib native_ref-java 1.1
com.github.fommil.netlib native_ref-java-natives 1.1
com.github.fommil.netlib native_system-java 1.1
com.github.fommil.netlib native_system-java-natives 1.1
com.github.fommil.netlib netlib-native_ref-linux-x86_64-natives 1.1
com.github.fommil.netlib netlib-native_system-linux-x86_64-natives 1.1
com.github.rwl jtransforms 2.4.0
com.google.code.findbugs jsr305 2.0.1
com.google.code.gson gson 2.2.4
com.google.guava guava 15.0
com.google.protobuf protobuf-java 2.6.1
com.googlecode.javaewah JavaEWAH 0.3.2
com.h2database h2 1.3.174
com.jcraft jsch 0.1.50
com.jolbox bonecp 0.8.0.RELEASE
com.mchange c3p0 0.9.5.1
com.mchange mchange-commons-java 0.2.10
com.ning compress-lzf 1.0.3
com.sun.mail javax.mail 1.5.2
com.thoughtworks.paranamer paranamer 2.6
com.trueaccord.lenses lenses_2.10 0.3
com.twitter chill-java 0.8.0
com.twitter chill_2.10 0.8.0
com.twitter parquet-hadoop-bundle 1.6.0
com.twitter util-app_2.10 6.23.0
com.twitter util-core_2.10 6.23.0
com.twitter util-jvm_2.10 6.23.0
com.typesafe config 1.2.1
com.typesafe scalalogging-slf4j_2.10 1.1.0
com.univocity univocity-parsers 2.2.1
com.zaxxer HikariCP 2.4.1
commons-beanutils commons-beanutils 1.7.0
commons-beanutils commons-beanutils-core 1.8.0
commons-cli commons-cli 1.2
commons-codec commons-codec 1.10
commons-collections commons-collections 3.2.2
commons-configuration commons-configuration 1.6
commons-dbcp commons-dbcp 1.4
commons-digester commons-digester 1.8
commons-httpclient commons-httpclient 3.1
commons-io commons-io 2.4
commons-lang commons-lang 2.6
commons-logging commons-logging 1.1.3
commons-net commons-net 2.2
commons-pool commons-pool 1.5.4
info.ganglia.gmetric4j gmetric4j 1.0.7
io.dropwizard.metrics metrics-core 3.1.2
io.dropwizard.metrics metrics-ganglia 3.1.2
io.dropwizard.metrics metrics-graphite 3.1.2
io.dropwizard.metrics metrics-healthchecks 3.1.2
io.dropwizard.metrics metrics-jetty9 3.1.2
io.dropwizard.metrics metrics-json 3.1.2
io.dropwizard.metrics metrics-jvm 3.1.2
io.dropwizard.metrics metrics-log4j 3.1.2
io.dropwizard.metrics metrics-servlets 3.1.2
io.netty netty 3.8.0.Final
io.netty netty-all 4.0.42.Final
io.prometheus simpleclient 0.0.16
io.prometheus simpleclient_common 0.0.16
io.prometheus simpleclient_dropwizard 0.0.16
io.prometheus simpleclient_servlet 0.0.16
io.prometheus.jmx collector 0.7
javax.activation activation 1.1
javax.annotation javax.annotation-api 1.2
javax.el javax.el-api 2.2.4
javax.jdo jdo-api 3.0.1
javax.servlet javax.servlet-api 3.1.0
javax.servlet.jsp jsp-api 2.1
javax.transaction jta 1.1
javax.validation validation-api 1.1.0.Final
javax.ws.rs javax.ws.rs-api 2.0.1
javax.xml.bind jaxb-api 2.2.2
javax.xml.stream stax-api 1.0-2
javolution javolution 5.5.1
jline jline 2.11
joda-time joda-time 2.9.3
log4j apache-log4j-extras 1.2.17
log4j log4j 1.2.17
mysql mysql-connector-java 5.1.27
net.hydromatic eigenbase-properties 1.1.5
net.java.dev.jets3t jets3t 0.7.1
net.jpountz.lz4 lz4 1.3.0
net.razorvine pyrolite 4.13
net.sf.jpam jpam 1.1
net.sf.opencsv opencsv 2.3
net.sf.py4j py4j 0.10.4
net.sf.supercsv super-csv 2.2.0
net.sourceforge.f2j arpack_combined_all 0.1
org.acplt oncrpc 1.0.7
org.antlr ST4 4.0.4
org.antlr antlr-runtime 3.4
org.antlr antlr4-runtime 4.5.3
org.antlr stringtemplate 3.2.1
org.apache.ant ant 1.9.2
org.apache.ant ant-jsch 1.9.2
org.apache.ant ant-launcher 1.9.2
org.apache.avro avro 1.7.7
org.apache.avro avro-ipc 1.7.7
org.apache.avro avro-ipc-tests 1.7.7
org.apache.avro avro-mapred-hadoop2 1.7.7
org.apache.calcite calcite-avatica 1.2.0-incubating
org.apache.calcite calcite-core 1.2.0-incubating
org.apache.calcite calcite-linq4j 1.2.0-incubating
org.apache.commons commons-compress 1.4.1
org.apache.commons commons-crypto 1.0.0
org.apache.commons commons-lang3 3.5
org.apache.commons commons-math3 3.4.1
org.apache.curator curator-client 2.6.0
org.apache.curator curator-framework 2.6.0
org.apache.curator curator-recipes 2.6.0
org.apache.derby derby 10.10.2.0
org.apache.directory.api api-asn1-api 1.0.0-M20
org.apache.directory.api api-util 1.0.0-M20
org.apache.directory.server apacheds-i18n 2.0.0-M15
org.apache.directory.server apacheds-kerberos-codec 2.0.0-M15
org.apache.hadoop hadoop-annotations 2.7.3
org.apache.hadoop hadoop-auth 2.7.3
org.apache.hadoop hadoop-client 2.7.3
org.apache.hadoop hadoop-common 2.7.3
org.apache.hadoop hadoop-hdfs 2.7.3
org.apache.hadoop hadoop-mapreduce-client-app 2.7.3
org.apache.hadoop hadoop-mapreduce-client-common 2.7.3
org.apache.hadoop hadoop-mapreduce-client-core 2.7.3
org.apache.hadoop hadoop-mapreduce-client-jobclient 2.7.3
org.apache.hadoop hadoop-mapreduce-client-shuffle 2.7.3
org.apache.hadoop hadoop-yarn-api 2.7.3
org.apache.hadoop hadoop-yarn-client 2.7.3
org.apache.hadoop hadoop-yarn-common 2.7.3
org.apache.hadoop hadoop-yarn-server-common 2.7.3
org.apache.htrace htrace-core 3.1.0-incubating
org.apache.httpcomponents httpclient 4.5.2
org.apache.httpcomponents httpcore 4.4.4
org.apache.ivy ivy 2.4.0
org.apache.parquet parquet-column 1.8.1
org.apache.parquet parquet-common 1.8.1
org.apache.parquet parquet-encoding 1.8.1
org.apache.parquet parquet-format 2.3.0-incubating
org.apache.parquet parquet-hadoop 1.8.1
org.apache.parquet parquet-jackson 1.8.1
org.apache.thrift libfb303 0.9.2
org.apache.thrift libthrift 0.9.2
org.apache.xbean xbean-asm5-shaded 4.4
org.apache.zookeeper zookeeper 3.4.6
org.codehaus.jackson jackson-core-asl 1.9.13
org.codehaus.jackson jackson-jaxrs 1.9.13
org.codehaus.jackson jackson-mapper-asl 1.9.13
org.codehaus.jackson jackson-xc 1.9.13
org.codehaus.janino commons-compiler 3.0.0
org.codehaus.janino janino 3.0.0
org.datanucleus datanucleus-api-jdo 3.2.6
org.datanucleus datanucleus-core 3.2.10
org.datanucleus datanucleus-rdbms 3.2.9
org.eclipse.jetty jetty-client 9.3.3.v20150827
org.eclipse.jetty jetty-continuation 9.3.3.v20150827
org.eclipse.jetty jetty-http 9.3.3.v20150827
org.eclipse.jetty jetty-io 9.3.3.v20150827
org.eclipse.jetty jetty-jndi 9.3.3.v20150827
org.eclipse.jetty jetty-plus 9.3.3.v20150827
org.eclipse.jetty jetty-proxy 9.3.3.v20150827
org.eclipse.jetty jetty-security 9.3.3.v20150827
org.eclipse.jetty jetty-server 9.3.3.v20150827
org.eclipse.jetty jetty-servlet 9.3.3.v20150827
org.eclipse.jetty jetty-servlets 9.3.3.v20150827
org.eclipse.jetty jetty-util 9.3.3.v20150827
org.eclipse.jetty jetty-webapp 9.3.3.v20150827
org.eclipse.jetty jetty-xml 9.3.3.v20150827
org.fusesource.jansi jansi 1.4
org.fusesource.leveldbjni leveldbjni-all 1.8
org.glassfish.hk2 hk2-api 2.4.0-b34
org.glassfish.hk2 hk2-locator 2.4.0-b34
org.glassfish.hk2 hk2-utils 2.4.0-b34
org.glassfish.hk2 osgi-resource-locator 1.0.1
org.glassfish.hk2.external aopalliance-repackaged 2.4.0-b34
org.glassfish.hk2.external javax.inject 2.4.0-b34
org.glassfish.jersey.bundles.repackaged jersey-guava 2.22.2
org.glassfish.jersey.containers jersey-container-servlet 2.22.2
org.glassfish.jersey.containers jersey-container-servlet-core 2.22.2
org.glassfish.jersey.core jersey-client 2.22.2
org.glassfish.jersey.core jersey-common 2.22.2
org.glassfish.jersey.core jersey-server 2.22.2
org.glassfish.jersey.media jersey-media-jaxb 2.22.2
org.hibernate hibernate-validator 5.1.1.Final
org.iq80.snappy snappy 0.2
org.javassist javassist 3.18.1-GA
org.jboss.logging jboss-logging 3.1.3.GA
org.jdbi jdbi 2.63.1
org.joda joda-convert 1.7
org.jodd jodd-core 3.5.2
org.jpmml pmml-model 1.2.15
org.jpmml pmml-schema 1.2.15
org.json4s json4s-ast_2.10 3.2.11
org.json4s json4s-core_2.10 3.2.11
org.json4s json4s-jackson_2.10 3.2.11
org.mockito mockito-all 1.9.5
org.objenesis objenesis 2.1
org.postgresql postgresql 9.4-1204-jdbc41
org.roaringbitmap RoaringBitmap 0.5.11
org.rosuda.REngine REngine 2.1.0
org.scala-lang jline 2.10.6
org.scala-lang scala-compiler_2.10 2.10.6
org.scala-lang scala-library_2.10 2.10.6
org.scala-lang scala-reflect_2.10 2.10.6
org.scala-lang scalap_2.10 2.10.6
org.scala-sbt test-interface 1.0
org.scalacheck scalacheck_2.10 1.12.5
org.scalamacros quasiquotes_2.10 2.0.0
org.scalanlp breeze-macros_2.10 0.12
org.scalanlp breeze_2.10 0.12
org.scalatest scalatest_2.10 2.2.6
org.slf4j jcl-over-slf4j 1.7.16
org.slf4j jul-to-slf4j 1.7.16
org.slf4j slf4j-api 1.7.16
org.slf4j slf4j-log4j12 1.7.16
org.spark-project.hive hive-beeline 1.2.1.spark2
org.spark-project.hive hive-cli 1.2.1.spark2
org.spark-project.hive hive-exec 1.2.1.spark2
org.spark-project.hive hive-jdbc 1.2.1.spark2
org.spark-project.hive hive-metastore 1.2.1.spark2
org.spark-project.spark unused 1.0.0
org.spire-math spire-macros_2.10 0.7.4
org.spire-math spire_2.10 0.7.4
org.springframework spring-core 4.1.4.RELEASE
org.springframework spring-test 4.1.4.RELEASE
org.tukaani xz 1.0
org.xerial sqlite-jdbc 3.8.11.2
org.xerial.snappy snappy-java 1.1.2.6
org.yaml snakeyaml 1.16
oro oro 2.0.8
stax stax-api 1.0.1
xerces xercesImpl 2.9.1
xml-apis xml-apis 1.3.04
xmlenc xmlenc 0.52

Pre-installed Java and Scala libraries (Scala 2.11 cluster version)

Group ID Artifact ID Version
antlr antlr 2.7.7
com.amazonaws aws-java-sdk 1.9.40
com.amazonaws aws-java-sdk-autoscaling 1.9.40
com.amazonaws aws-java-sdk-cloudformation 1.9.40
com.amazonaws aws-java-sdk-cloudfront 1.9.40
com.amazonaws aws-java-sdk-cloudhsm 1.9.40
com.amazonaws aws-java-sdk-cloudsearch 1.9.40
com.amazonaws aws-java-sdk-cloudtrail 1.9.40
com.amazonaws aws-java-sdk-cloudwatch 1.9.40
com.amazonaws aws-java-sdk-cloudwatchmetrics 1.9.40
com.amazonaws aws-java-sdk-codedeploy 1.9.40
com.amazonaws aws-java-sdk-cognitoidentity 1.9.40
com.amazonaws aws-java-sdk-cognitosync 1.9.40
com.amazonaws aws-java-sdk-config 1.9.40
com.amazonaws aws-java-sdk-core 1.9.40
com.amazonaws aws-java-sdk-datapipeline 1.9.40
com.amazonaws aws-java-sdk-directconnect 1.9.40
com.amazonaws aws-java-sdk-directory 1.9.40
com.amazonaws aws-java-sdk-dynamodb 1.9.40
com.amazonaws aws-java-sdk-ec2 1.9.40
com.amazonaws aws-java-sdk-ecs 1.9.40
com.amazonaws aws-java-sdk-efs 1.9.40
com.amazonaws aws-java-sdk-elasticache 1.9.40
com.amazonaws aws-java-sdk-elasticbeanstalk 1.9.40
com.amazonaws aws-java-sdk-elasticloadbalancing 1.9.40
com.amazonaws aws-java-sdk-elastictranscoder 1.9.40
com.amazonaws aws-java-sdk-emr 1.9.40
com.amazonaws aws-java-sdk-glacier 1.9.40
com.amazonaws aws-java-sdk-iam 1.9.40
com.amazonaws aws-java-sdk-importexport 1.9.40
com.amazonaws aws-java-sdk-kinesis 1.9.40
com.amazonaws aws-java-sdk-kms 1.9.40
com.amazonaws aws-java-sdk-lambda 1.9.40
com.amazonaws aws-java-sdk-logs 1.9.40
com.amazonaws aws-java-sdk-machinelearning 1.9.40
com.amazonaws aws-java-sdk-opsworks 1.9.40
com.amazonaws aws-java-sdk-rds 1.9.40
com.amazonaws aws-java-sdk-redshift 1.9.40
com.amazonaws aws-java-sdk-route53 1.9.40
com.amazonaws aws-java-sdk-s3 1.9.40
com.amazonaws aws-java-sdk-ses 1.9.40
com.amazonaws aws-java-sdk-simpledb 1.9.40
com.amazonaws aws-java-sdk-simpleworkflow 1.9.40
com.amazonaws aws-java-sdk-sns 1.9.40
com.amazonaws aws-java-sdk-sqs 1.9.40
com.amazonaws aws-java-sdk-ssm 1.9.40
com.amazonaws aws-java-sdk-storagegateway 1.9.40
com.amazonaws aws-java-sdk-sts 1.9.40
com.amazonaws aws-java-sdk-support 1.9.40
com.amazonaws aws-java-sdk-swf-libraries 1.9.40
com.amazonaws aws-java-sdk-workspaces 1.9.40
com.chuusai shapeless_2.11 2.0.0
com.clearspring.analytics stream 2.7.0
com.databricks Rserve 1.8-3
com.databricks dbml-local_2.11 0.1.1-spark2.1-SNAPSHOT
com.databricks dbml-local_2.11-tests 0.1.1-spark2.1-SNAPSHOT
com.databricks jets3t 0.7.1-0
com.databricks.scalapb compilerplugin_2.11 0.4.15-9
com.databricks.scalapb scalapb-runtime_2.11 0.4.15-9
com.esotericsoftware kryo-shaded 3.0.3
com.esotericsoftware minlog 1.3.0
com.fasterxml classmate 1.0.0
com.fasterxml.jackson.core jackson-annotations 2.4.5
com.fasterxml.jackson.core jackson-core 2.4.5
com.fasterxml.jackson.core jackson-databind 2.4.5
com.fasterxml.jackson.datatype jackson-datatype-joda 2.4.5
com.fasterxml.jackson.module jackson-module-paranamer 2.4.5
com.fasterxml.jackson.module jackson-module-scala_2.11 2.4.5
com.github.fommil jniloader 1.1
com.github.fommil.netlib core 1.1.2
com.github.fommil.netlib native_ref-java 1.1
com.github.fommil.netlib native_ref-java-natives 1.1
com.github.fommil.netlib native_system-java 1.1
com.github.fommil.netlib native_system-java-natives 1.1
com.github.fommil.netlib netlib-native_ref-linux-x86_64-natives 1.1
com.github.fommil.netlib netlib-native_system-linux-x86_64-natives 1.1
com.github.rwl jtransforms 2.4.0
com.google.code.findbugs jsr305 2.0.1
com.google.code.gson gson 2.2.4
com.google.guava guava 15.0
com.google.protobuf protobuf-java 2.6.1
com.googlecode.javaewah JavaEWAH 0.3.2
com.h2database h2 1.3.174
com.jcraft jsch 0.1.50
com.jolbox bonecp 0.8.0.RELEASE
com.mchange c3p0 0.9.5.1
com.mchange mchange-commons-java 0.2.10
com.ning compress-lzf 1.0.3
com.sun.mail javax.mail 1.5.2
com.thoughtworks.paranamer paranamer 2.6
com.trueaccord.lenses lenses_2.11 0.3
com.twitter chill-java 0.8.0
com.twitter chill_2.11 0.8.0
com.twitter parquet-hadoop-bundle 1.6.0
com.twitter util-app_2.11 6.23.0
com.twitter util-core_2.11 6.23.0
com.twitter util-jvm_2.11 6.23.0
com.typesafe config 1.2.1
com.typesafe.scala-logging scala-logging-api_2.11 2.1.2
com.typesafe.scala-logging scala-logging-slf4j_2.11 2.1.2
com.univocity univocity-parsers 2.2.1
com.zaxxer HikariCP 2.4.1
commons-beanutils commons-beanutils 1.7.0
commons-beanutils commons-beanutils-core 1.8.0
commons-cli commons-cli 1.2
commons-codec commons-codec 1.10
commons-collections commons-collections 3.2.2
commons-configuration commons-configuration 1.6
commons-dbcp commons-dbcp 1.4
commons-digester commons-digester 1.8
commons-httpclient commons-httpclient 3.1
commons-io commons-io 2.4
commons-lang commons-lang 2.6
commons-logging commons-logging 1.1.3
commons-net commons-net 2.2
commons-pool commons-pool 1.5.4
info.ganglia.gmetric4j gmetric4j 1.0.7
io.dropwizard.metrics metrics-core 3.1.2
io.dropwizard.metrics metrics-ganglia 3.1.2
io.dropwizard.metrics metrics-graphite 3.1.2
io.dropwizard.metrics metrics-healthchecks 3.1.2
io.dropwizard.metrics metrics-jetty9 3.1.2
io.dropwizard.metrics metrics-json 3.1.2
io.dropwizard.metrics metrics-jvm 3.1.2
io.dropwizard.metrics metrics-log4j 3.1.2
io.dropwizard.metrics metrics-servlets 3.1.2
io.netty netty 3.8.0.Final
io.netty netty-all 4.0.42.Final
io.prometheus simpleclient 0.0.16
io.prometheus simpleclient_common 0.0.16
io.prometheus simpleclient_dropwizard 0.0.16
io.prometheus simpleclient_servlet 0.0.16
io.prometheus.jmx collector 0.7
javax.activation activation 1.1
javax.annotation javax.annotation-api 1.2
javax.el javax.el-api 2.2.4
javax.jdo jdo-api 3.0.1
javax.servlet javax.servlet-api 3.1.0
javax.servlet.jsp jsp-api 2.1
javax.transaction jta 1.1
javax.validation validation-api 1.1.0.Final
javax.ws.rs javax.ws.rs-api 2.0.1
javax.xml.bind jaxb-api 2.2.2
javax.xml.stream stax-api 1.0-2
javolution javolution 5.5.1
jline jline 2.11
joda-time joda-time 2.9.3
log4j apache-log4j-extras 1.2.17
log4j log4j 1.2.17
mysql mysql-connector-java 5.1.27
net.hydromatic eigenbase-properties 1.1.5
net.java.dev.jets3t jets3t 0.7.1
net.jpountz.lz4 lz4 1.3.0
net.razorvine pyrolite 4.13
net.sf.jpam jpam 1.1
net.sf.opencsv opencsv 2.3
net.sf.py4j py4j 0.10.4
net.sf.supercsv super-csv 2.2.0
net.sourceforge.f2j arpack_combined_all 0.1
org.acplt oncrpc 1.0.7
org.antlr ST4 4.0.4
org.antlr antlr-runtime 3.4
org.antlr antlr4-runtime 4.5.3
org.antlr stringtemplate 3.2.1
org.apache.ant ant 1.9.2
org.apache.ant ant-jsch 1.9.2
org.apache.ant ant-launcher 1.9.2
org.apache.avro avro 1.7.7
org.apache.avro avro-ipc 1.7.7
org.apache.avro avro-ipc-tests 1.7.7
org.apache.avro avro-mapred-hadoop2 1.7.7
org.apache.calcite calcite-avatica 1.2.0-incubating
org.apache.calcite calcite-core 1.2.0-incubating
org.apache.calcite calcite-linq4j 1.2.0-incubating
org.apache.commons commons-compress 1.4.1
org.apache.commons commons-crypto 1.0.0
org.apache.commons commons-lang3 3.5
org.apache.commons commons-math3 3.4.1
org.apache.curator curator-client 2.6.0
org.apache.curator curator-framework 2.6.0
org.apache.curator curator-recipes 2.6.0
org.apache.derby derby 10.10.2.0
org.apache.directory.api api-asn1-api 1.0.0-M20
org.apache.directory.api api-util 1.0.0-M20
org.apache.directory.server apacheds-i18n 2.0.0-M15
org.apache.directory.server apacheds-kerberos-codec 2.0.0-M15
org.apache.hadoop hadoop-annotations 2.7.3
org.apache.hadoop hadoop-auth 2.7.3
org.apache.hadoop hadoop-client 2.7.3
org.apache.hadoop hadoop-common 2.7.3
org.apache.hadoop hadoop-hdfs 2.7.3
org.apache.hadoop hadoop-mapreduce-client-app 2.7.3
org.apache.hadoop hadoop-mapreduce-client-common 2.7.3
org.apache.hadoop hadoop-mapreduce-client-core 2.7.3
org.apache.hadoop hadoop-mapreduce-client-jobclient 2.7.3
org.apache.hadoop hadoop-mapreduce-client-shuffle 2.7.3
org.apache.hadoop hadoop-yarn-api 2.7.3
org.apache.hadoop hadoop-yarn-client 2.7.3
org.apache.hadoop hadoop-yarn-common 2.7.3
org.apache.hadoop hadoop-yarn-server-common 2.7.3
org.apache.htrace htrace-core 3.1.0-incubating
org.apache.httpcomponents httpclient 4.5.2
org.apache.httpcomponents httpcore 4.4.4
org.apache.ivy ivy 2.4.0
org.apache.parquet parquet-column 1.8.1
org.apache.parquet parquet-common 1.8.1
org.apache.parquet parquet-encoding 1.8.1
org.apache.parquet parquet-format 2.3.0-incubating
org.apache.parquet parquet-hadoop 1.8.1
org.apache.parquet parquet-jackson 1.8.1
org.apache.thrift libfb303 0.9.2
org.apache.thrift libthrift 0.9.2
org.apache.xbean xbean-asm5-shaded 4.4
org.apache.zookeeper zookeeper 3.4.6
org.codehaus.jackson jackson-core-asl 1.9.13
org.codehaus.jackson jackson-jaxrs 1.9.13
org.codehaus.jackson jackson-mapper-asl 1.9.13
org.codehaus.jackson jackson-xc 1.9.13
org.codehaus.janino commons-compiler 3.0.0
org.codehaus.janino janino 3.0.0
org.datanucleus datanucleus-api-jdo 3.2.6
org.datanucleus datanucleus-core 3.2.10
org.datanucleus datanucleus-rdbms 3.2.9
org.eclipse.jetty jetty-client 9.3.3.v20150827
org.eclipse.jetty jetty-continuation 9.3.3.v20150827
org.eclipse.jetty jetty-http 9.3.3.v20150827
org.eclipse.jetty jetty-io 9.3.3.v20150827
org.eclipse.jetty jetty-jndi 9.3.3.v20150827
org.eclipse.jetty jetty-plus 9.3.3.v20150827
org.eclipse.jetty jetty-proxy 9.3.3.v20150827
org.eclipse.jetty jetty-security 9.3.3.v20150827
org.eclipse.jetty jetty-server 9.3.3.v20150827
org.eclipse.jetty jetty-servlet 9.3.3.v20150827
org.eclipse.jetty jetty-servlets 9.3.3.v20150827
org.eclipse.jetty jetty-util 9.3.3.v20150827
org.eclipse.jetty jetty-webapp 9.3.3.v20150827
org.eclipse.jetty jetty-xml 9.3.3.v20150827
org.fusesource.leveldbjni leveldbjni-all 1.8
org.glassfish.hk2 hk2-api 2.4.0-b34
org.glassfish.hk2 hk2-locator 2.4.0-b34
org.glassfish.hk2 hk2-utils 2.4.0-b34
org.glassfish.hk2 osgi-resource-locator 1.0.1
org.glassfish.hk2.external aopalliance-repackaged 2.4.0-b34
org.glassfish.hk2.external javax.inject 2.4.0-b34
org.glassfish.jersey.bundles.repackaged jersey-guava 2.22.2
org.glassfish.jersey.containers jersey-container-servlet 2.22.2
org.glassfish.jersey.containers jersey-container-servlet-core 2.22.2
org.glassfish.jersey.core jersey-client 2.22.2
org.glassfish.jersey.core jersey-common 2.22.2
org.glassfish.jersey.core jersey-server 2.22.2
org.glassfish.jersey.media jersey-media-jaxb 2.22.2
org.hibernate hibernate-validator 5.1.1.Final
org.iq80.snappy snappy 0.2
org.javassist javassist 3.18.1-GA
org.jboss.logging jboss-logging 3.1.3.GA
org.jdbi jdbi 2.63.1
org.joda joda-convert 1.7
org.jodd jodd-core 3.5.2
org.jpmml pmml-model 1.2.15
org.jpmml pmml-schema 1.2.15
org.json4s json4s-ast_2.11 3.2.11
org.json4s json4s-core_2.11 3.2.11
org.json4s json4s-jackson_2.11 3.2.11
org.mockito mockito-all 1.9.5
org.objenesis objenesis 2.1
org.postgresql postgresql 9.4-1204-jdbc41
org.roaringbitmap RoaringBitmap 0.5.11
org.rosuda.REngine REngine 2.1.0
org.scala-lang scala-compiler_2.11 2.11.8
org.scala-lang scala-library_2.11 2.11.8
org.scala-lang scala-reflect_2.11 2.11.8
org.scala-lang scalap_2.11 2.11.8
org.scala-lang.modules scala-parser-combinators_2.11 1.0.2
org.scala-lang.modules scala-xml_2.11 1.0.2
org.scala-sbt test-interface 1.0
org.scalacheck scalacheck_2.11 1.12.5
org.scalanlp breeze-macros_2.11 0.12
org.scalanlp breeze_2.11 0.12
org.scalatest scalatest_2.11 2.2.6
org.slf4j jcl-over-slf4j 1.7.16
org.slf4j jul-to-slf4j 1.7.16
org.slf4j slf4j-api 1.7.16
org.slf4j slf4j-log4j12 1.7.16
org.spark-project.hive hive-beeline 1.2.1.spark2
org.spark-project.hive hive-cli 1.2.1.spark2
org.spark-project.hive hive-exec 1.2.1.spark2
org.spark-project.hive hive-jdbc 1.2.1.spark2
org.spark-project.hive hive-metastore 1.2.1.spark2
org.spark-project.spark unused 1.0.0
org.spire-math spire-macros_2.11 0.7.4
org.spire-math spire_2.11 0.7.4
org.springframework spring-core 4.1.4.RELEASE
org.springframework spring-test 4.1.4.RELEASE
org.tukaani xz 1.0
org.xerial sqlite-jdbc 3.8.11.2
org.xerial.snappy snappy-java 1.1.2.6
org.yaml snakeyaml 1.16
oro oro 2.0.8
stax stax-api 1.0.1
xerces xercesImpl 2.9.1
xml-apis xml-apis 1.3.04
xmlenc xmlenc 0.52