MLflow now integrates TruLens scorers, bringing the Agent GPA (Goal-Plan-Action) framework to agent trace evaluation via mlflow.genai.evaluate(). The integration adds 10 scorers that analyze the full span tree of an agent's execution—covering plan quality, tool selection, plan adherence, tool calling validity, logical

5m read timeFrom mlflow.org
Post cover image
Table of contents
The Agent GPA Framework ​How Trace Evaluation Catches What Output Evaluation Misses ​Combining Agent and RAG Evaluation ​Getting Started ​Resources ​Provenance ​
1 Comment

Sort: