Skip to content
#

model-performance

Here are 5 public repositories matching this topic...

Agentic Workflow Evaluation: Text Summarization Agent. This project includes an AI agent evaluation workflow using a text summarization model with OpenAI API and Transformers library. It follows an iterative approach: generate summaries, analyze metrics, adjust parameters, and retest to refine AI agents for accuracy, readability, and performance.

  • Updated Feb 23, 2025
  • Python

Causal analysis framework using Double Machine Learning to quantitatively isolate the effect of model size on deep learning performance while controlling for confounders such as dataset size, training time, and hyperparameters.

  • Updated Feb 14, 2026
  • Python

Improve this page

Add a description, image, and links to the model-performance topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-performance topic, visit your repo's landing page and select "manage topics."

Learn more