Exploratory data analysis and interactive model-understanding and evaluation tool for chatbot training data and feedback
-
Updated
Feb 2, 2024 - Jupyter Notebook
Exploratory data analysis and interactive model-understanding and evaluation tool for chatbot training data and feedback
This is a repository for a Jupyter based tool to calculate Greedy Matching, Vector Extrema and Average Embedding evaluation metrics for generative AI chatbots
Evaluation results and experimental data for TRACER, demonstrating its effectiveness in discovering chatbot functionalities and detecting errors with coverage analysis and mutation testing.
Add a description, image, and links to the chatbot-evaluation topic page so that developers can more easily learn about it.
To associate your repository with the chatbot-evaluation topic, visit your repo's landing page and select "manage topics."