Skip to content

Pull requests: huggingface/evaluate

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

add keras callback
#182 opened Jul 11, 2022 by lvwerra Draft updated Aug 5, 2022
Customize Gradio Interface that is Launched
#298 opened Sep 22, 2022 by abidlabs Loading… updated Sep 27, 2022
Allow EvaluationSuite to receive a Preprocessor
#366 opened Nov 18, 2022 by NimaBoscarino Loading… updated Nov 25, 2022
fix outputs of WER/CER
#381 opened Dec 8, 2022 by lvwerra Loading… updated Dec 9, 2022
Scikit-learn integration
#383 opened Dec 8, 2022 by lvwerra Loading… updated Dec 19, 2022
Normalize exact match and F1 Score
#409 opened Feb 1, 2023 by avinashsai Loading… updated Feb 8, 2023
Add pyarrow
#429 opened Mar 9, 2023 by fabaff Loading… updated Mar 14, 2023
[WIP] Add Panoptic Quality (PQ)
#408 opened Jan 30, 2023 by NielsRogge Draft updated Mar 15, 2023
2 tasks
Allow EvaluationSuite to receive a custom Evaluator
#367 opened Nov 18, 2022 by NimaBoscarino Loading… updated Mar 27, 2023
Correct a typo in sari.py
#430 opened Mar 16, 2023 by felixonmars Loading… updated Mar 30, 2023
Fix a bit typo in docs README
#439 opened Mar 31, 2023 by guspan-tanadi Loading… updated Apr 13, 2023
Fix incorrect error instructing to "pip install sklearn"
#436 opened Mar 24, 2023 by dongreenberg Loading… updated Apr 13, 2023
[feature] use comet model from a local path
#390 opened Dec 21, 2022 by k-blo Loading… updated Apr 15, 2023
Add the support to pass a pre-loaded model and a pre-loaded tokenizer
#448 opened Apr 19, 2023 by JulesGM Loading… updated Apr 19, 2023
feat: custom evaluator and metric name to support llm evaluation #433
#459 opened May 15, 2023 by fecet Loading… updated May 15, 2023
Add kendall tau distance (not as a submodule)
#446 opened Apr 13, 2023 by unna97 Loading… updated May 24, 2023
ANLS metric
#413 opened Feb 4, 2023 by QuadV Loading… updated Jun 1, 2023
[doc build] Use secrets
#468 opened Jun 9, 2023 by mishig25 Loading… updated Jun 9, 2023
Fix mean computation in Mahalanobis distance
#471 opened Jul 12, 2023 by j0ma Loading… updated Jul 12, 2023
Fix SQuAD v2 metric docs on references format
#474 opened Jul 18, 2023 by bryant1410 Loading… updated Jul 18, 2023
Fix formatting of model names in radar plot
#479 opened Jul 29, 2023 by potamides Loading… updated Jul 29, 2023
Adding tokenizer_id to perplexity evaluation
#482 opened Aug 4, 2023 by Albertoimpl Loading… updated Aug 9, 2023
when using custom models for perplexity metrix, support trust_remote_code
#496 opened Sep 13, 2023 by winglian Loading… updated Sep 13, 2023
Add support for glue metric in EvaluationSuite
#488 opened Aug 25, 2023 by tybrs Loading… updated Sep 14, 2023
removed unnecessary module_type attribute
#499 opened Oct 3, 2023 by leo-gan Loading… updated Oct 3, 2023
ProTip! Exclude everything labeled bug with -label:bug.