Looker

You can connect Looker to a Databricks cluster using JDBC.

Step 1: Connect Looker to a Databricks cluster

  1. Get your cluster’s server hostname, port, and HTTP path, using the instructions in Connecting BI Tools.

  2. In Looker, go to Admin > Connections > New Database Connection.

  3. Enter the cluster connection parameters that you retrieved in step 1.

    ../../_images/looker-spark-2-x.png
    • For token-based authentication, enter “token” in the Username field and a personal access token in the Password field.
    • Do not enable Persistent Derived Tables.
    • Keep the Max Connections and Connection Pool Timeout defaults.
    • Leave Database Time Zone blank (assuming that you are storing everything in UTC).
    • Adjust Query Time Zone if you want to translate queries into other time zones.
    • Additional Params should include the following, concatenated: ;transportMode=http;ssl=true;httpPath=</the/http/path/from/step/1>

For more information, see the Looker documentation.

Step 2: Begin modeling your database in Looker by creating a project and running the generator

This step assumes that there are permanent tables stored in the default database of your cluster.

  1. If necessary, apply Developer Mode by toggling the Dev button from OFF to ON.
  2. Go to LookML > Manage Projects.
  3. Click New LookML Project.
  4. Configure the new project.
    • Give the project a name.
    • Select Generate Model & Views.
    • Select the Connection name that you provided when you created the database connection.
    • Select All Tables.
    • Set Schemas to default, unless you have other databases to model in the cluster.
  5. Click Create Project.

After you create the project and the generator runs, Looker displays a user interface with one model file and multiple view files. The model file shows the tables in the schema and any discovered join relations between them, and the view files list each dimension (column) available for each table in the schema.