-
Notifications
You must be signed in to change notification settings - Fork 139
docs: add Agent eval notebook #1812
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
}, | ||
"expected_output": { | ||
"response_facts": [ | ||
"By default, traces are retained indefinetly", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typo: 'indefinetly' should be 'indefinitely'.
"By default, traces are retained indefinetly", | |
"By default, traces are retained indefinitely", |
|
||
## 3. Create an agent that can search the Langfuse docs | ||
|
||
We use the Lagfuse Docs MCP Server to provide tools to the agent: https://langfuse.com/docs/docs-mcp |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typo: 'Lagfuse Docs MCP Server' should be 'Langfuse Docs MCP Server'.
We use the Lagfuse Docs MCP Server to provide tools to the agent: https://langfuse.com/docs/docs-mcp | |
We use the Langfuse Docs MCP Server to provide tools to the agent: https://langfuse.com/docs/docs-mcp |
|
||
|
||
```python | ||
DATSET_NAME = "pydantic-ai-mcp-agent-evaluation" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typo: The variable name 'DATSET_NAME' seems to be misspelled. Consider renaming it to 'DATASET_NAME'.
Reasoning: The output contains "watchTv" which was not expected. | ||
Score: 0 | ||
|
||
## This excercise |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typo: 'excercise' should be 'exercise'.
## This excercise | |
## This exercise |
📦 Next.js Bundle Analysis for langfuse-docsThis analysis was generated by the Next.js Bundle Analysis action. 🤖 New Page AddedThe following page was added to the bundle from the code in this PR:
|
Important
Adds a new notebook for evaluating Pydantic AI MCP Agents with Langfuse, updates to Python 3.12, and modifies dependencies and scripts accordingly.
example_pydantic_ai_mcp_agent_evaluation.ipynb
tocookbook/_routes.json
andpages/guides/cookbook/example_pydantic_ai_mcp_agent_evaluation.mdx
..python-version
andpyproject.toml
.scripts/update_cookbook_docs.sh
to use Python 3.12.ipykernel
anduv
topyproject.toml
under dev dependencies.uv.lock
to reflect new dependencies.This description was created by
for 4b00fe4. You can customize this summary. It will automatically update as commits are pushed.