automatic-evaluation

Star

Here are 9 public repositories matching this topic...

terryyz / ice-score

Star

[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code

evaluation code-generation code-quality automatic-evaluation gpt-4 large-language-models llm

Updated Jun 16, 2024
Python

davidheineman / salsa

Star

Success and Failure Linguistic Simplification Annotation 💃

nlp text-simplification human-evaluation automatic-evaluation thresh

Updated May 11, 2024
Python

laihuiyuan / eval-formality-transfer

Star

Multidimensional Evaluation for Text Style Transfer Using ChatGPT. Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer (HumEval 2022)

text-style-transfer formality-style-transfer human-evaluation automatic-evaluation

Updated Apr 27, 2023
Python

hprodrig / MONSERRATE_Corpus

Star

MONSERRATE is a dataset specifically created to evaluate Question Generation systems. It has, on average, 26 questions associated to each source sentence, attempting to be an “exhaustive” reference.

evaluation corpus dataset squad question-generation msmarco automatic-evaluation

Updated Oct 28, 2022
Python

johnny-brav0 / AutomaticEvaluation

Star

Automatic Evaluation of Textual Answers on the famous Kaggle Automated Essay Scoring (AES) dataset.

tensorflow word2vec essayscoring essay-grading automatic-evaluation

Updated Mar 17, 2022
Jupyter Notebook

prathamSharma25 / WebAES

Star

An AI expert system to automatically evaluate subjective answers submitted in online assessments.

mysql python php natural-language-processing web-application artificial-intelligence latent-dirichlet-allocation doc2vec bert natural-language-understanding bert-embeddings online-assessments automatic-evaluation

Updated Jun 16, 2022
Jupyter Notebook

SpeekeR99 / TSP_Kimlova_Savel_Slechta_Vdoviak_Zappe

Star

Týmový projekt KIV/TSP1 a KIV/TSP2 - Týmový Softwarový Projekt 1 a 2

web image-processing image-recognition image-segmentation thresholding automatic-evaluation

Updated Mar 22, 2025
TypeScript

Jivl00 / KIV_TSP

Star

Týmový projekt - TSP1 a TSP2

spreadsheet image-recognition automatic-test-generation grading-students-solution automatic-evaluation

Updated May 29, 2025
TypeScript

itdojp / req2run-benchmark

Star

Requirements-to-Running-Code benchmark for AI/LLM systems and frameworks—builds, runs, and auto-scores apps across functional and non-functional metrics.

Updated Sep 8, 2025
Python

Improve this page

Add a description, image, and links to the automatic-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automatic-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automatic-evaluation

Here are 9 public repositories matching this topic...

terryyz / ice-score

davidheineman / salsa

laihuiyuan / eval-formality-transfer

hprodrig / MONSERRATE_Corpus

johnny-brav0 / AutomaticEvaluation

prathamSharma25 / WebAES

SpeekeR99 / TSP_Kimlova_Savel_Slechta_Vdoviak_Zappe

Jivl00 / KIV_TSP

itdojp / req2run-benchmark

Improve this page

Add this topic to your repo