Genomics guide

To learn how to get started with genomics on Databricks, see:

Databricks Runtime for Genomics (Deprecated) offers secondary analysis pipelines parallelized with Apache Spark.

Note

Databricks Runtime for Genomics is deprecated. Databricks is no longer building new Databricks Runtime for Genomics releases and will remove support for Databricks Runtime for Genomics on September 24, 2022, when Databricks Runtime for Genomics 7.3 LTS support ends. At that point Databricks Runtime for Genomics will no longer be available for selection when you create a cluster. For more information about the Databricks Runtime deprecation policy and schedule, see Supported Databricks runtime releases and support schedule. Bioinformatics libraries that were part of the runtime have been released as Docker Containers, which you can find on the ProjectGlow Dockerhub page.

Features of Databricks Runtime for Genomics have been open-sourced as part of the Databricks-Regeneron project Glow. For information on Glow, see the Glow documentation.