Databricks released this image in July 2020.
Databricks Runtime 7.1 for Genomics is a version of Databricks Runtime 7.1 (Unsupported) optimized for working with genomic and biomedical data. It is a component of the Databricks Unified Analytics Platform for Genomics.
Databricks Runtime 7.1 for Genomics is built on top of Databricks Runtime 7.1. For information on what’s new in Databricks Runtime 7.1, see the Databricks Runtime 7.1 (Unsupported) release notes.
Glow now provides a function transform_loco to perform the GloWGR ridge regression transformation with a leave one chromosome out (LOCO) strategy. Partitioning the predicted phenotype values avoids proximal contamination during downstream association testing. The GloWGR documentation demonstrates the new usage.
Glow now provides a function reshape_for_gwas to convert the phenotype estimates output by GloWGR from a Pandas DataFrame to a Spark DataFrame compatible with the Glow genome-wide association study (GWAS) regression functions. The GloWGR documentation reflects the new usage.
The RNASeq pipeline now outputs unpaired alignments from STAR. These previously were dropped in favor of only paired alignments.