Line-of-Business Agent with Evaluation

This project demonstrates a line-of-business (LOB) chatbot implementation using a Support Ticket Management System as the sample application. It showcases both a functional workflow for managing support tickets and a methodology for evaluating LOB agent's performance in business contexts.

Key Features

Support Ticket Management Chatbot

The Support Ticket Management chatbot is built with Microsoft Agent Framework, where users can:

Create and update support tickets
Manage action items within tickets
Search historical tickets for reference

Refer to the architecture documentation for more details.

Evaluation Framework

The project includes an evaluation framework designed to address the challenges of assessing non-deterministic, LLM-powered agents in business applications with key features:

LLM-based user agent for simulating user-chatbot interactions
Test cases factory with scenarios templating and injection of business data to run evaluations at scale
Azure AI Evaluation SDK integration for calculating metrics and enabling tracking and comparing evaluation runs in Azure AI Foundry
LLM-power error analysis with actionable summaries

Refer to the evaluation documentation for more information.

Initial Setup

Deploy an OpenAI chat model in Azure (GPT-4o or better preferably) - see documentation.
Once your model is ready, create an .env file by copying .env.template and replacing values with your configuration.
Open this project with Visual Studio Code using the Dev Containers extension. This will ensure all dependencies are correctly installed in an isolated environment. (Alternatively, if you'd like to run the project on your local machine, and manually create a virtual Python env, change the following .env file var to PYTHONPATH=., then run make install)

Running the Sample

make chatbot  # Runs the chatbot application
make chatbot-eval  # Runs evaluation against ground truth datasets

Project Structure

app/chatbot/ - Support Ticket Management implementation
- tools/support_ticket_system/ - Agent Framework tools for function calling
- data_models/ - Data structures for tickets and action items
- workflow-definitions/ - Workflow definitions that guide conversations
evaluation/ - Evaluation framework components
- evaluation_service.py - Core evaluation service
- chatbot/evaluate.py - Chatbot evaluation entry point
- chatbot/evaluators/ - Specialized evaluators for different metrics
- chatbot/ground-truth/ - Ground truth datasets and related code used for evaluation

Migrating the sample

This sample can be used as a template to create chatbots for other line-of-business applications. To migrate this sample to your specific use case:

In Visual Studio Code use the Chat: Run Prompt command from the Command Palette.
Choose migrate to attach it to the Copilot chat.
Clearly describe your target use case and business requirements.
Review the generated migration plan and adapt as required.
Implement the plan phase-by-phase, testing thoroughly at each stage.

Documentation

Architecture - Chatbot architecture overview
Evaluation Guide - How the evaluation framework works
User Guide - How to use the Support Ticket Chatbot

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
app		app
docs		docs
evaluation		evaluation
.env.template		.env.template
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Line-of-Business Agent with Evaluation

Key Features

Support Ticket Management Chatbot

Evaluation Framework

Initial Setup

Running the Sample

Project Structure

Migrating the sample

Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Languages

marcgs/lob-agent-evaluation

Folders and files

Latest commit

History

Repository files navigation

Line-of-Business Agent with Evaluation

Key Features

Support Ticket Management Chatbot

Evaluation Framework

Initial Setup

Running the Sample

Project Structure

Migrating the sample

Documentation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Languages

Packages