Databricks Runtime 7.1 for Genomics (Unsupported)
Databricks released this image in July 2020.
Databricks Runtime 7.1 for Genomics is a version of Databricks Runtime 7.1 (Unsupported) optimized for working with genomic and biomedical data. It is a component of the Databricks Unified Analytics Platform for Genomics.
For more information, including instructions for creating a Databricks Runtime for Genomics cluster, see Genomics guide. For more information on developing genomics applications, see Genomics guide.
Databricks Runtime 7.1 for Genomics is built on top of Databricks Runtime 7.1. For information on what’s new in Databricks Runtime 7.1, see the Databricks Runtime 7.1 (Unsupported) release notes.
Glow now provides a function transform_loco to perform the GloWGR ridge regression transformation with a leave one chromosome out (LOCO) strategy. Partitioning the predicted phenotype values avoids proximal contamination during downstream association testing. The GloWGR documentation demonstrates the new usage.
GloWGR output reshaping function
Glow now provides a function reshape_for_gwas to convert the phenotype estimates output by GloWGR from a Pandas DataFrame to a Spark DataFrame compatible with the Glow genome-wide association study (GWAS) regression functions. The GloWGR documentation reflects the new usage.
RNASeq outputs unpaired alignments
The RNASeq pipeline now outputs unpaired alignments from STAR. These previously were dropped in favor of only paired alignments.