In the sections below, we provide example notebooks to demonstrate how to install Theano on Databricks using Cluster Node Initialization Scripts and how to run example Theano programs. See Integrating Deep Learning Libraries with Apache Spark for an example of integrating a deep learning library with Spark.
This guide is not a comprehensive guide on Theano. Refer to the Theano website.
Databricks recommends using Cluster Node Initialization Scripts to install Theano to make it available on all cluster nodes. The example notebook below installs an init script that installs Theano and enables GPU features.
To test and migrate single-machine Theano workflows, you can start with a driver-only cluster on Databricks by setting the number of workers to zero. Though Apache Spark is not functional under this setting, it is a cost-effective way to run single-machine Theano workflows.