Get started with scikit-learn in Databricks

This 10-minute tutorial is designed as an introduction to machine learning in Databricks. It uses algorithms from the popular machine learning package scikit-learn along with MLflow for tracking the model development process and Hyperopt to automate hyperparameter tuning.

Requirements

Databricks Runtime 7.0 ML or above.

Example notebooks

If you are using Databricks Runtime 7.3 LTS ML or above, Databricks recommends using MLflow autologging, illustrated in this notebook.

Get started with scikit-learn and MLflow autologging notebook

Open notebook in new tab

You can use the following notebook with Databricks Runtime 7.0 ML or above. This notebook uses manual MLflow logging to track model development.

Get started with scikit-learn notebook

Open notebook in new tab