Mosaic AI Agent Evaluation(Python)

Loading...

Mosaic AI Agent Evaluation example

The following code shows how to call and test Agent Evaluation on previously generated outputs. It returns a dataframe with evaluation scores calculated by LLM judges that are part of Agent Evaluation.

Evaluating: 0%| | 0/2 [Elapsed: 00:00, Remaining: ?]