benchmarking-framework

Here are 19 public repositories matching this topic...

JinjieNi / MixEval

The official evaluation suite and dynamic data release for MixEval.

benchmark evaluation benchmarking-suite evaluation-framework benchmarking-framework foundation-models large-language-models large-language-model llm-inference llm-evaluation large-multimodal-models llm-evaluation-framework benchmark-mixture mixeval

Updated Nov 10, 2024
Python

Python Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity, CPU cores and caching), load balancing and profiling capabilities of the external apps on NUMA architecture

multiprocessing parallel-computing numa monitoring-server cache-control task-queue application-framework parallel-processing execution-pool benchmarking-framework load-balancing in-memory-computations

Updated Aug 28, 2019
Python

microsoft / MLOS

Star

MLOS is a project to enable autotuning for systems.

infrastructure performance-engineering benchmarking data-science autotuning benchmarking-framework optimize-systems

Updated Jan 21, 2026
Python

spcl / npbench

Star

NPBench - A Benchmarking Suite for High-Performance NumPy

python numpy benchmarking-suite benchmarking-framework

Updated Jan 28, 2026
Python

pyrddlgym-project / pyRDDLGym

Star

A toolkit for auto-generation of OpenAI Gym environments from RDDL description files.

Updated Mar 9, 2026
Python

ArlineQ / arline_benchmarks

Star

Arline Benchmarks platform allows to benchmark various algorithms for quantum circuit mapping/compression against each other on a list of predefined hardware types and target circuit classes

python benchmarking benchmark quantum quantum-computing benchmarks benchmarking-framework quantum-benchmarks

Updated Mar 2, 2022
Python

aallan / benchmarking-ml-on-the-edge

Sponsor

Star

Benchmarking machine learning inferencing on embedded hardware.

raspberry-pi benchmark machine-learning benchmarking-framework movidius jetson-nano coral-tpu tiny-ml small-ai

Updated Jul 18, 2025
Python

hsnlab / tipsy

Star

Telco pIPeline benchmarking SYstem

benchmarking dpdk vpp openvswitch ovs lagopus benchmarking-framework moongen bess

Updated Feb 25, 2026
Python

dunnkers / fseval

Star

Benchmarking framework for Feature Selection and Feature Ranking algorithms 🚀

python benchmarking machine-learning scikit-learn feature-selection hydra benchmarks automl benchmarking-framework feature-ranking wandb feature-rankers

Updated Apr 12, 2023
Python

sandeep-krishnamurthy / dl-operator-benchmark

Star

Framework for benchmarking deep learning operators for Apache MXNet

performance deep-learning benchmarking-framework apache-mxnet

Updated May 9, 2019
Python

justindobbs / Tracecore

Star

Deterministic runtime for agent evaluation

reliability-engineering specification agents ai-agents benchmarking-framework autogen fastapi langchain observability-platform ai-evaluation-framework agent-benchmark deterministic-testing

Updated Mar 8, 2026
Python

pwwang / benchwork

Star

A framework for benchmarking in python

python testing benchmarking benchmarking-suite benchmark-framework benchmarking-framework

Updated Nov 12, 2022
Python

alambertini01 / Crossbar_Models_Comparison

Star

Crossbar Parasitics Simulator – A tool for benchmarking parasitic resistance models in RRAM crossbars and evaluating neural networks under realistic hardware constraints.

benchmarking-framework crossbar memristive-crossbar rram parasitic-estimation

Updated Sep 4, 2025
Python

ast-fortiss-tum / STELLAR

Star

STELLAR: A Search-Based Testing Framework for Large Language Model Applications" (SANER 2026)

ai generative benchmarking-framework llm

Updated Feb 24, 2026
Python

redblock-ai / parrot-python

Star

PARROT (Performance Assessment of Reasoning and Responses On Trivia) is a novel benchmarking framework designed to evaluate Large Language Models (LLMs) on real-world, complex, and ambiguous QA tasks.

benchmarking-framework llm-inference llm-datasets llm-qa-document llm-benchmarking

Updated Oct 14, 2024
Python

Pytest-with-Eric / pytest-benchmark-example

Star

How To Measure And Improve Code Efficiency with Pytest Benchmark (The Ultimate Guide)

benchmark python-benchmark benchmarking-framework pytest-benchmark

Updated Aug 25, 2023
Python

Ad2m1109 / A-Benchmarking-Framework-for-Multi-Dataset-Sign-Language-Recognition

Star

A modular research framework engineered to benchmark CNN models across multiple sign language datasets. Featuring a scalable architecture (Factory Pattern), optimized HSV-based hand segmentation, and real-time inference capabilities for edge deployment.

python computer-vision deep-learning tensorflow keras design-patterns cnn research-tool benchmarking-framework sign-language-recognition dataset-management edge-ai real-time-inference hsv-segmentation

Updated Dec 24, 2025
Python

Jefino9488 / Lost-in-the-Middle-Analyzer

Sponsor

Star

A lightweight benchmarking and visualization framework to analyze long-context failures in large language models (LLMs) using synthetic datasets, retrieval-augmented methods, and evaluation metrics.

machine-learning bm25 benchmarking-framework rag steamlit llm

Updated Nov 27, 2025
Python

ipoukoumondi / IWR-Bench

Star

🌐 Evaluate LVLMs' ability to reconstruct dynamic, interactive webpages from user interaction videos with the IWR-Bench benchmark.

python java open-source data-science machine-learning data-visualization software-engineering reproducibility user-experience algorithm-development model-evaluation system-testing performance-benchmarking benchmarking-framework analytics-tools iwr-bench

Updated Mar 9, 2026
Python

Improve this page

Add a description, image, and links to the benchmarking-framework topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the benchmarking-framework topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmarking-framework

Here are 19 public repositories matching this topic...

JinjieNi / MixEval

eXascaleInfolab / PyExPool

microsoft / MLOS

spcl / npbench

pyrddlgym-project / pyRDDLGym

ArlineQ / arline_benchmarks

aallan / benchmarking-ml-on-the-edge

hsnlab / tipsy

dunnkers / fseval

sandeep-krishnamurthy / dl-operator-benchmark

justindobbs / Tracecore

pwwang / benchwork

alambertini01 / Crossbar_Models_Comparison

ast-fortiss-tum / STELLAR

redblock-ai / parrot-python

Pytest-with-Eric / pytest-benchmark-example

Ad2m1109 / A-Benchmarking-Framework-for-Multi-Dataset-Sign-Language-Recognition

Jefino9488 / Lost-in-the-Middle-Analyzer

ipoukoumondi / IWR-Bench

Improve this page

Add this topic to your repo