Migrate LangChain model code to DSPy

In this notebook, you learn how to convert your LangChain model code into DSPy using a LangChain RAG example.

2

3

5

6

Convert your LangChain code to DSPY

The following is the code to convert to DSPy:

rag_chain = (
    {"context": retriever | format_docs, "question": RunnablePassthrough()}
    | [prompt](url)
    | llm
    | StrOutputParser()
)

This standard RAG chain consists of 3 parts:

A prompt template that organizes the LLM query.
A retriever module that fetches the relevant context.
An LLM to answer the query.

The following sections show how you can convert each of these parts into DSPy, where the

Prompt template is converted to a dspy.Signature.
Retriever can be as simple as a callable, or using DSPy global language together with dspy.Retrieve module.
LLM call can be wrapped by a dspy.Module. You can choose to use a basic dspy.Predict or more advanced modules like dspy.ChainOfThought.

Convert prompt template to DSPy signature

A DSPy signature is an abstraction of a prompt that consists of 3 components:

Input fields to define the input.
Output fields to define the output.
Instruction to help the language model understand expected behavior.

These components are combined into the actual prompt sent to the language model. The DSPy signature does not contain every piece of information included in the actual prompt sent to the language model. For example, signatures do not include information regarding "training" (optimizer.compile() calls). See the DSPy signature documentation for more information.

The easiest way to construct the input and output fields in a dspy.Signature is using the syntax "{input_field_name_1}, {input_field_name_2}, ... -> {output_field_name_1}, {output_field_name_2}..." and wrapping it in the dspy.signatures.make_signature utility function. instructions is optional, but can help the LLM function better.

The following creates your signature and instructions for LLM calls. For this RAG use case, you can simply write instructions="Answer the question based on context".

9

10

Define the DSPy program

Next, you can convert the LangChain chain into a DSPy program. These are callables expressed in different contexts. Similar to writing a PyTorch model, you can write a DSPy program in two parts:

Define all submodules of your program inside the __init__() method.
Implement the forward() method to define prediction / inference logic.

Let's take a look at your code!

The following directly passes the vector store-based LangChain retriever to DSPy as the retriever and wraps the language model calls in dspy.ChainOfThought along with the signature you previously created.

12

14

16

Summary

You have successfully converted our Langchain LCEL chain into a DSPy program. To recap,

You created a class that subclasses dspy.Module to represent your program.
You defined a dspy.ChainOfThought module to generate responses using a configured LLM.
You translated other parts of the LCEL LangChain into Python function calls within the DSPy forward() method. For example, self.retriever() is directly called inside the forward() method, and the retrieved documents are formatted. This is equivalent to the "context": retriever | format_docs part of the LCEL chain.

These 3 steps apply to other LCEL chains, not just RAG use cases.

If you are only interested in the conversion part, then that's it! To use DSPy to improve the performance of your application, see how to optimize a migrated RAG chain.

migrate-langchain-dspy(Python)

Migrate LangChain model code to DSPy

LangChain RAG example

Convert your LangChain code to DSPY

Convert prompt template to DSPy signature

Define the DSPy program

Summary