Optimize your migrated DSPy program

After you migrate your LangChain code to DSPy, you can optimize your application's quality through automatic prompt engineering using DSPy. To demonstrate, this notebook walks you through the training phase and compares the optimized RAG's performance against the original LangChain LCEL chain.

To optimize a DSPy program, you need:

A scoring function or metric that measures how your program performs.
A few examples for automatic prompt engineering, like trainset that supervises the training.
A few examples for evaluation, like valset that validates the optimization result.

This notebook assumes you have migrated your LangChain code to DSPy

3

5

Define metrics

Now that you have a training set, you can define how to score your program's performance.

A common way to evaluate how well a RAG application works is to use an LLM to judge application responses according to specific criteria. This example uses gpt-4o-mini as the judge.:

If the answer is faithful to the retrieved document context, meaning there are no hallucinations, faithfulness will be 1, otherwise 0.
If the answer is correct, correctness will be 1, otherwise 0.

The total score is the sum of faithfulness and correctness, which can be 0, 1, or 2.

8

10

12

14

15

optimize-migrated-dspy(Python)

Optimize your migrated DSPy program

Prepare training dataset

Define metrics

Optimize your DSPy program with DSPy Optimizers