Skip to content

Conversation

@rbawden
Copy link
Contributor

@rbawden rbawden commented May 22, 2022

Automatically created prompts for MT using the Flores-101 dataset.

Contains prompts for all language directions using 31 BigScience languages (1 English prompt per language direction = 930 templates).

Question: is this format ok or should I separate out those that are for inclusion in the upcoming evaluation (i.e. only into and from English). This depends on how the notion of subtask is going defined and whether there is a possibility of selecting only certain templates.

Rachel Bawden and others added 30 commits April 29, 2022 01:16
… to be done separately later. Also some slight modifs such as removing excess words and quotes
@rbawden
Copy link
Contributor Author

rbawden commented Jun 16, 2022

Adding @fyvo for info

@rbawden
Copy link
Contributor Author

rbawden commented Jun 20, 2022

@jzf2101

@VictorSanh VictorSanh self-assigned this Jun 25, 2022
Copy link
Member

@VictorSanh VictorSanh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me, thank you @rbawden !
(nice usage of anchors in yaml btw! :) )

I don't have a strong opinion on the format so will defer to the evaluation folks!

@rbawden
Copy link
Contributor Author

rbawden commented Jun 27, 2022

(Actually python seems to have dealt with that automatically since I created the templates automatically, so I won't take too much credit)!

@VictorSanh
Copy link
Member

thank you @rbawden for the fixes!
merging now

@VictorSanh VictorSanh merged commit 48ade7e into bigscience-workshop:eval-hackathon Jun 29, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants