Deploy a version of a RAG Application

Preview

This feature is in Private Preview. To try it, reach out to your Databricks contact.

Looking for a different RAG Studio doc? Go to the RAG documentation index

The following guide walks you through deploying an version of the application in the development Environment so you can chat with it through the 💬 Review UI.

Note

The default RAG Studio template ships with a fully functioning application. You can deploy the code as-is. See Create versions of your RAG application to iterate on the app’s quality to understand how to create a new Version.

Step 1: Build and deploy the sample application

Note

This step will run the 🗃️ Data Processor, package the 🔗 Chain into a Unity Catalog model, and then deploy the 🔗 Chain to Model Serving.

  1. Deploy the application to your workspace by running the following command in your console. This step will take approximately 15-30 minutes.

    ./rag create-rag-version -e dev
    

Step 2: View your RAG Application in the 💬 Review UI

  1. Congrats! You have deployed a fully functioning RAG application, complete with logging, the ability to collect feedback from users and LLM-Judges, and automated quality/cost/latency metric computation.

    Note

    While we are only exploring this application for the purposes of getting started with RAG Studio, this application is ready to be deployed to your production environment.

  2. In the console, you will see output similar to below. Open the URL in your web browser to open the 💬 Review UI.

    ...truncated for clarity of docs...
    =======
    Task deploy_chain_task:
    Your Review UI is now available. Open the Review UI here: https://<workspace-url>/ml/review/model/catalog.schema.rag_studio_databricks-docs-bot/version/1/environment/dev
    
  3. You can now interact with the RAG application!

    RAG application

Data flow

RAG review app

Follow the next tutorial!

View logs & assessments