Connect to Anomalo

Anomalo is a data quality validation platform that ensures accurate, complete, and consistent data that is in line with your expectations. By connecting to Databricks, Anomalo brings a unifying layer that ensures you can trust the quality of your data before it is consumed by various business intelligence and analytics tools or modeling and machine learning frameworks.

You can integrate your Databricks clusters and Databricks SQL warehouses (formerly Databricks SQL endpoints) with Anomalo.

Connect to Anomalo using Partner Connect

To connect your Databricks workspace to Anomalo using Partner Connect, see Connect to data governance partners using Partner Connect.

Note

Partner Connect only supports Databricks SQL warehouses for Anomalo. To connect a cluster in your Databricks workspace to Anomalo, connect to Anomalo manually.

Connect to Anomalo manually

This section describes how to connect an existing SQL warehouse or cluster to Anomalo manually.

Requirements

Before you connect to Anomalo manually, you must have the following:

  • A cluster or SQL warehouse in your Databricks workspace.

  • The connection details for your cluster or SQL warehouse, specifically the Server Hostname, Port, and HTTP Path values.

  • A Databricks personal access token. To create a personal access token, do the following:

    1. In your Databricks workspace, click your Databricks username in the top bar, and then select Settings from the drop down.

    2. Click Developer.

    3. Next to Access tokens, click Manage.

    4. Click Generate new token.

    5. (Optional) Enter a comment that helps you to identify this token in the future, and change the token’s default lifetime of 90 days. To create a token with no lifetime (not recommended), leave the Lifetime (days) box empty (blank).

    6. Click Generate.

    7. Copy the displayed token to a secure location, and then click Done.

    Note

    Be sure to save the copied token in a secure location. Do not share your copied token with others. If you lose the copied token, you cannot regenerate that exact same token. Instead, you must repeat this procedure to create a new token. If you lose the copied token, or you believe that the token has been compromised, Databricks strongly recommends that you immediately delete that token from your workspace by clicking the trash can (Revoke) icon next to the token on the Access tokens page.

    If you are not able to create or use tokens in your workspace, this might be because your workspace administrator has disabled tokens or has not given you permission to create or use tokens. See your workspace administrator or the following topics:

    Note

    As a security best practice when you authenticate with automated tools, systems, scripts, and apps, Databricks recommends that you use OAuth tokens.

    If you use personal access token authentication, Databricks recommends using personal access tokens belonging to service principals instead of workspace users. To create tokens for service principals, see Manage tokens for a service principal.

Steps to connect

To connect to Anomalo manually, do the following:

  1. Sign up for a new Anomalo account, or sign in to your existing Anomalo account.

  2. In the sidebar of your Anomalo home page, click the Support icon, then click Anomalo Documentation.

  3. Follow the steps in the article titled Connecting your data.

Next steps