automatic-evaluation

Star

Here are 5 public repositories matching this topic...

terryyz / ice-score

Star

[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code

evaluation code-generation code-quality automatic-evaluation gpt-4 large-language-models llm

Updated Jun 16, 2024
Python

davidheineman / salsa

Star

Success and Failure Linguistic Simplification Annotation 💃

nlp text-simplification human-evaluation automatic-evaluation thresh

Updated May 11, 2024
Python

laihuiyuan / eval-formality-transfer

Star

Multidimensional Evaluation for Text Style Transfer Using ChatGPT. Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer (HumEval 2022)

text-style-transfer formality-style-transfer human-evaluation automatic-evaluation

Updated Apr 27, 2023
Python

hprodrig / MONSERRATE_Corpus

Star

MONSERRATE is a dataset specifically created to evaluate Question Generation systems. It has, on average, 26 questions associated to each source sentence, attempting to be an “exhaustive” reference.

evaluation corpus dataset squad question-generation msmarco automatic-evaluation

Updated Oct 28, 2022
Python

itdojp / req2run-benchmark

Star

Requirements-to-Running-Code benchmark for AI/LLM systems and frameworks—builds, runs, and auto-scores apps across functional and non-functional metrics.

Updated Sep 8, 2025
Python

Improve this page

Add a description, image, and links to the automatic-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automatic-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automatic-evaluation

Here are 5 public repositories matching this topic...

terryyz / ice-score

davidheineman / salsa

laihuiyuan / eval-formality-transfer

hprodrig / MONSERRATE_Corpus

itdojp / req2run-benchmark

Improve this page

Add this topic to your repo