Part 2: Create and optimize a DSPy program for RAG

This notebook shows how to:

This notebook is part 2 of 2 notebooks for creating a DSPy program for RAG.

Requirements

This notebook assumes:

You have completed and run the Part 1: Prepare data and vector search index for a RAG DSPy program notebook.
You have specified the following information in the notebook widgets:
- vs_index: Databricks Vector Search index to be used in the RAG program.
- source_catalog: UC catalog of the schema where the index is located.
- source_schema: UC schema containing the Vector Search index.

3

5

7

A DSPy program consists of a Python class inherited from dspy.Module that implements the forward() method, which runs the following steps:

Query a Databricks Vector Search index to retrieve document chunks (context) related to the request.
Generate an response by sending the context containing the document chunks and the request to an LLM.

The __init__ function initializes the resources the forward function uses. In this example, the resources are:

retrieve: Databricks Vector Search retriever
lm: Databricks Foundation Model pay-per-token Llama3-1-70B-instruct
response_generator: The prediction technique, in this case DSPy.predict, that uses an LLM to process retrieved documents and instructions to generate a response. Additional prediction techniques include dspy.ChainOfThought and dspy.ReAct.

9

11

13

16

20

22

24

26

28