Use Agent Bricks: Custom LLM to create a gen AI agent for text

Beta

This feature is in Beta.

This article describes how to create a generative AI agent for custom text-based tasks using Agent Bricks: Custom LLM.

Agent Bricks provides a simple, no-code approach to build and optimize domain-specific, high-quality AI agent systems for common AI use cases.

What can you do with Custom LLM?

Use Agent Bricks: Custom LLM to generate high-quality results for any domain-specific task, such as summarization, classification, text transformation, and content generation.

Agent Bricks: Custom LLM is ideal for the following use cases:

Summarizing the issue and resolution of customer calls.
Analyzing the sentiment of customer reviews.
Classifying research papers by topic.
Generating press releases for new features.

Given high-level instruction and examples, Agent Bricks: Custom LLM optimizes prompts on behalf of users, automatically infers evaluation criteria, evaluates the system from provided data, and deploys the model as a productionizable endpoint.

Agent Bricks: Custom LLM leverages automated evaluation capabilities, including MLflow and Agent Evaluation, to enable rapid assessment of the cost-quality tradeoff for your specific extraction task. This assessment allows you to make informed decisions about the balance between accuracy and resource investment.

Requirements

A workspace that includes the following:
- Mosaic AI Agent Bricks Preview (Beta) enabled. See Manage Databricks Previews.
- Serverless compute enabled. See Enable serverless compute.
- Unity Catalog enabled. See Enable a workspace for Unity Catalog.
- Partner-powered AI assistive features enabled.
- A workspace in one of the supported regions: us-east-1 or us-west-2.
- Access to Mosaic AI Model Serving.
- Access to foundation models in Unity Catalog through the system.ai schema.
- Access to a serverless budget policy with a nonzero budget.
Ability to use the ai_query SQL function.
You must have input data ready to use. You can choose to provide either:
- A Unity Catalog table. The table name cannot contain any special characters (such as -).
  - If you want to use PDFs, convert them to a Unity Catalog table. See Use PDFs in Agent Bricks.
- At least 3 example inputs and outputs. If you choose this option, you'll need to specify a Unity Catalog schema destination path for the agent, and you must have CREATE REGISTERED MODEL and CREATE TABLE permissions to this schema.
If you want to optimize your agent, you need at least 100 inputs (either 100 rows in a Unity Catalog table or 100 manually-provided examples).

Create a custom LLM agent

Go to Agents in the left navigation pane of your workspace and click Custom LLM.

ABCL task.

Step 1: Configure your agent

On the Configure tab, click Show an example > to expand an example input and model response for a Custom LLM agent.

In the pane below, configure your agent:

Under Describe your task, enter a clear and detailed description of your specialization task, including its purpose and desired outcome.
Provide a labeled dataset, an unlabeled dataset, or a few examples to use to create your agent.

If you want to use PDFs, convert them to a Unity Catalog table first. See Use PDFs in Agent Bricks.

The following data types are supported: string, int, and double.
- Labeled dataset
- Unlabeled dataset
- A few examples
If you select Labeled dataset:
1. Under Select dataset as UC table, click Browse to select the table in Unity Catalog you want to use. The table name cannot contain any special characters (such as -).
  
  The following is an example:
  
  main.model_specialization.customer_call_transcripts
2. In the Input column field, select the column you want to use as your input text. The dropdown menu is automatically populated with columns from your selected table.
3. In the Output column (optional), select the column you want to provide as an example output for the expected transformation. Providing this data helps configure your agent to more accurately adapt to your domain-specific needs.
If you select Unlabeled dataset:
1. Under Select dataset as UC table, click Browse to select the table in Unity Catalog you want to use. The table name cannot contain any special characters (such as -).
2. In the Input column field, select the column you want to use as your input text. The dropdown menu is automatically populated with columns from your selected table.
If you select A few examples:
1. Provide at least 3 examples of inputs and expected outputs for your specialization task. Providing high-quality examples helps configure your specialization agent to better understand your requirements.
2. To add more examples, click + Add.
3. Under Agent destination, select the Unity Catalog schema where you'd like Agent Bricks to help you create a table with evaluation data. You must have CREATE REGISTERED MODEL and CREATE TABLE permissions to this schema.
Name your agent.
Click Create agent.

Step 2: Build and improve your agent

In the Build tab, you can review recommendations to improve your agent, review sample model outputs, and adjust your task instructions and evaluation criteria.

In the Recommendation pane, Databricks provides recommendations to help you define evaluation metrics for your agent and evaluate sample responses as good or bad.

Review the Databricks recommendations for optimizing agent performance.
Review suggested evaluation criteria. These recommended evaluation criteria are automatically inferred to help you optimize your agent.

For each recommendation:
- To accept the recommendation, select Yes. This adds the evaluation criteria in the Agent configuration pane.
- To reject the criteria, select No.
- You can also choose to Dismiss the recommendation.
Under Review results, review sample model inputs and outputs and provide optional human feedback. This evaluation helps improve the model's reponses.

For each sample, select whether or not it was a good response. If No, provide optional feedback on the response and click Save to move onto the next one.
After you've finished reviewing recommendations, review the Agent configuration pane.
1. You can adjust the task instructions to be more specific to improve the model's performance.
2. Review the evaluation criteria you added from the recommendations. You can remove criteria by clicking X.
3. If you'd like to add more evaluation criteria, click + Add to add your own.
Click Update agent to save those changes to your agent. The examples under Review results update to show new example model outputs.

Step 3: Try out and optimize your agent

Try out your agent in workflows across Databricks.

On the Use tab,

Click Try in SQL to open the SQL editor and use ai_query to send requests to your new Custom LLM agent.
(Optional) Click Optimize if you want to optimize your agent for cost.
- Optimization requires at least 100 inputs. If you provided a Unity Catalog dataset, the table must contain at least 100 rows. If you did not provide a dataset, you need to provide at least 100 examples.
- Optimization can take about an hour.
- Making changes to your currently active agent is blocked when optimization is in progress.

When optimization completes, you are directed to the Review tab to view a comparison of your currently active agent and an agent optimized for cost. See (Optional) Step 4: Review and deploy an optimized agent.

(Optional) Select Create pipeline to deploy a pipeline that runs at scheduled intervals to use your agent on new data. See Lakeflow Declarative Pipelines for more information about pipelines.

Use or optimize the CL agent, or create a pipeline.

(Optional) Step 4: Review and deploy an optimized agent

Databricks recommends at least 100 inputs (either 100 rows in your Unity Catalog table or 100 manually-provided examples) to optimize your agent. When you add more inputs, the knowledge base that the agent can learn from increases, which improves agent quality and its response accuracy.

When you select Optimize on the Use tab, Databricks compares multiple different optimization strategies to build and deploy an optimized agent. These strategies include Foundation Model Fine-tuning which uses Databricks Geos.

On the Review tab,

In Evaluation results, you can review the evaluation metrics for the optimized agent. To perform evaluation, Databricks uses metrics based on the evaluation criteria you defined in the Build tab
Click on a request to open more details. Here, you can see a detailed assessment of each evaluation metric, including the rationale for why it passed or failed. This uses Databricks built-in AI judges. You can also inspect the input and response.
After you review these results, select the best model under Deploy best model to an endpoint and click Deploy.

Use PDFs in Agent Bricks

PDFs are not yet supported natively in Agent Bricks: Information Extraction and Custom LLM. However, you can use Agent Brick's UI workflow to convert a folder of PDF files into markdown, then use the resulting Unity Catalog table as input when building your agent. This workflow uses ai_parse_document for the conversion. Follow these steps:

Click Agents in the left navigation pane to open Agent Bricks in Databricks.
In the top right corner, click Use PDFs in Agent Bricks.
In the panel that opens, enter the following fields to create a new workflow to convert your PDFs:
1. Select folder with PDFs: Select the Unity Catalog folder containing the PDFs you want to use.
2. Select destination table: Select the destination schema for the converted markdown table and, optionally, adjust the table name in the field below.
3. Select active SQL warehouse: Select the SQL warehouse to run the workflow.
Click Start import.
You will be redirected to the All workflows tab, which lists all of your PDF workflows. Use this tab to monitor the status of your jobs.

If your workflow fails, click on the job name to open it and view error messages to help you debug.
When your workflow has completed successfully, click on the job name to open the table in Catalog Explorer to explore and understand the columns.
Use the Unity Catalog table as input data in Agent Bricks when configuring your agent.

Limitations

Databricks recommends at least 100 inputs (either 100 rows in your Unity Catalog table or 100 manually-provided samples) to optimize your agent. When you add more inputs, the knowledge base that the agent can learn from increases, which improves agent quality and its response accuracy.
If you provide a Unity Catalog table, the table name cannot contain any special characters (such as -).
Only the following data types are supported as inputs: string, int, and double.
Usage capacity is currently limited to 100k input and output tokens per minute.
Workspaces that use PrivateLink, including storage behind PrivateLink, are not supported.

What can you do with Custom LLM?​

Requirements​

Create a custom LLM agent​​

Step 1: Configure your agent​

Step 2: Build and improve your agent​

Step 3: Try out and optimize your agent​

(Optional) Step 4: Review and deploy an optimized agent​

Use PDFs in Agent Bricks​

Limitations​