Winogrande_ChatGPT

The evaluation of Winogrande Debiased Validation set on ChatGPT

I finished the whole 1267 samples.

The accuracy over 1267 samples: %62.75

Sometimes the model answers like Option1: Answer, and sometimes it only responds with a single number.

I decided to ask model to explain the answers. You can find the explanations in the dataframe.

Generic text:

Choose the option that is a better fit for the gap in the sentence. The answer should be in following form:
'''
 OptionX
 Explanation: ...
'''
Sentence : {}
    
Option1 : {}
Option2 : {}

Old generic text:

Choose the option that is a better fit for the gap in the sentence. Do not explain. Only say 1 or 2
Sentence : {}
    
Option1 : {}
Option2 : {}

I will update this page regularly.

Citation to the author's paper

@InProceedings{ai2:winogrande,
title = {WinoGrande: An Adversarial Winograd Schema Challenge at Scale},
authors={Keisuke, Sakaguchi and Ronan, Le Bras and Chandra, Bhagavatula and Yejin, Choi
},
year={2019}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Winogrande_ChatGPT

The accuracy over 1267 samples: %62.75

Generic text:

Old generic text:

Files

README.md

Latest commit

History

README.md

File metadata and controls

Winogrande_ChatGPT

The accuracy over 1267 samples: %62.75

Generic text:

Old generic text: