Labelbox is a training data platform used to create training data from images, video, audio, text, and tiled imagery. Using Labelbox, AI teams can customize a workflow to operate, manage and improve data labeling, data cataloging, and model debugging in a single, unified platform. Labelbox is designed to help AI teams build and operate production-grade machine learning systems.
You can connect your Databricks clusters that have the Machine Learning version of the Databricks Runtime to Labelbox.
If you already have a Labelbox account, Databricks recommends that you skip ahead to Connect to Labelbox manually instead. This is because the new connection experience in Partner Connect is optimized for new Labelbox accounts.
Make sure your Databricks account, workspace, and the signed-in user all meet the requirements for Partner Connect.
In the sidebar, click Partner Connect.
Click the Labelbox tile.
If the Labelbox tile has a check mark icon inside of it, this means that someone else in your workspace has already created an ML cluster in this workspace along with a related Databricks service principal, and a notebook named
labelbox_databricks_exampleshould have already been added to the Workspace/Shared/labelbox_demo folder in your workspace. Skip ahead to Connect to Labelbox manually.
In the Connect to partner dialog, click Next. Partner Connect creates the following resources in your workspace:
An ML cluster named LABELBOX_CLUSTER by default. (You can change this default name before you click Next.)
A Databricks service principal named LABELBOX_USER.
For Email, enter the email address that you want to use for your new Labelbox account.
Click Connect to Labelbox.
Follow the on-screen instructions to Sign Up for your new Labelbox account.
After you sign in to your new Labelbox account, Partner Connect creates the following resources in your workspace, if they do not already exist:
Create a Labelbox API key for your Labelbox account, if you do not have one. Copy the API key and save it in a secure location, as the key will eventually be hidden from view.
Skip ahead to Set up the ML cluster and Labelbox starter notebook.
The following instructions describe how to connect Labelbox to a Databricks cluster.
To connect faster, use Partner Connect.
You must have an available cluster that has the Machine Learning version of the Databricks Runtime. To check this for an existing cluster, look for ML in the Runtime column when you display the cluster in your workspace. If you do not have an available ML cluster, create a cluster and for Databricks Runtime Version, choose a version from the ML list.
Go to the Labelbox page to Sign Up for a new Labelbox account or to Log In to your existing Labelbox account.
Create a Labelbox API key for your Labelbox account, if you do not have one. Copy the API key and save it in a secure location, as the key will eventually be hidden from view, and you will need this key later.
Check for a Labelbox starter notebook in your workspace:
In your Databricks workspace, ensure that you are in the Data Science & Engineering or Databricks Machine Learning environment. Use the sidebar persona-switcher if necessary.
In the sidebar, click Workspace > Shared.
If a folder named labelbox_demo does not already exist, create it:
Click the down arrow next to Shared.
Click Create > Folder.
Click Create Folder.
Click the labelbox_demo folder. If a starter notebook named labelbox_databricks_example.ipynb does not exist in the folder, import it:
Click the down arrow next to labelbox_demo.
https://github.com/Labelbox/labelbox-python/blob/develop/examples/integrations/databricks/labelbox_databricks_example.ipynband click Import.
Continue with Set up the ML cluster and Labelbox starter notebook.
In your Databricks workspace, ensure that you are still in the Data Science & Engineering or Databricks Machine Learning environment. Use the sidebar persona-switcher if necessary.
Check that the required Labelbox libraries are installed in your ML cluster:
In the sidebar, click Compute.
Click your ML cluster. Use the Filter box to find it, if necessary.
If you used Partner Connect to connect to Labelbox, the ML cluster’s name should be LABELBOX_CLUSTER.
Click the Libraries tab.
If the labelbox package is not listed, install it:
Click Install New.
For Package, enter labelbox.
If the labelspark package is not listed, install it:
Click Install New.
For Package, enter labelspark.
Attach your ML cluster to the starter notebook:
In the sidebar, click Workspace > Shared > labelbox_demo > labelbox_databricks_example.ipynb.
Attach your ML cluster to the notebook.
Browse through the notebook to learn how to automate Labelbox.