`ai_extract` function

Applies to: Databricks SQL Databricks Runtime

Preview

During the preview, the underlying language model can handle several languages, but this AI Function is tuned for English.

The ai_extract() function allows you to invoke a state-of-the-art generative AI model to extract entities specified by labels from a given text using SQL. This function uses a chat model serving endpoint made available by Databricks Foundation Model APIs.

Requirements

important

The underlying models that might be used at this time are licensed under the Apache 2.0 License, Copyright © The Apache Software Foundation or the LLAMA 3.3 Community License Copyright © Meta Platforms, Inc. All rights reserved. Customers are responsible for ensuring compliance with applicable model licenses.

Databricks recommends reviewing these licenses to ensure compliance with any applicable terms. If models emerge in the future that perform better according to Databricks's internal benchmarks, Databricks might change the model (and the list of applicable licenses provided on this page).

This function is only available on workspaces in AI Functions using Foundation Model APIs supported regions.
This function is not available on Databricks SQL Classic.
Check the Databricks SQL pricing page.
In Databricks Runtime 15.1 and above, this function is supported in Databricks notebooks, including notebooks that are run as a task in a Databricks workflow.
Batch inference workloads require Databricks Runtime 15.4 ML LTS for improved performance.

note

In Databricks Runtime 15.1 and above, this function is supported in Databricks notebooks, including notebooks that are run as a task in a Databricks workflow.

Syntax

ai_extract(content, labels)

Arguments

content: A STRING expression.
labels: An ARRAY<STRING> literal. Each element is a type of entity to be extracted.

Returns

A STRUCT where each field corresponds to an entity type specified in labels. Each field contains a string representing the extracted entity. If more than one candidate for any entity type is found, only one is returned.

If content is NULL, the result is NULL.

Examples

SQL
> SELECT ai_extract(
    'John Doe lives in New York and works for Acme Corp.',
    array('person', 'location', 'organization')
  );
 {"person": "John Doe", "location": "New York", "organization": "Acme Corp."}

> SELECT ai_extract(
    'Send an email to jane.doe@example.com about the meeting at 10am.',
    array('email', 'time')
  );
 {"email": "jane.doe@example.com", "time": "10am"}

ai_mask function

Requirements​

Syntax​

Arguments​

Returns​

Examples​

Related functions​

Requirements

Syntax

Arguments

Returns

Examples

Related functions