Bioinformatics libraries

The following sections show you how to install libraries on Databricks clusters that enable the use of Apache Spark to parallelize genomic data analysis.


These libraries are also included in Databricks Runtime for Genomics, which is deprecated. Databricks is no longer building new Databricks Runtime for Genomics releases and will remove support for Databricks Runtime for Genomics on September 24, 2022, when Databricks Runtime for Genomics 7.3 LTS support ends. At that point Databricks Runtime for Genomics will no longer be available for selection when you create a cluster. For more information about the Databricks Runtime deprecation policy and schedule, see Supported Databricks runtime releases and support schedule.

To learn how to install and manage single-node bioinformatics tools and libraries, see Libraries and Customize containers with Databricks Container Services.