Add prompts for LinCE dataset Sentiment Analysis Task #746

RosenZhang · 2022-04-26T18:27:06Z

Added 5 prompts to SA task from LinCE. The templates are close to ones of imdb dataset. The LinCE dataset is in Huggingface Datasets but it's unavailable from the promptsource interface. The filter_english_datasets method in utils.py is hence modified to add LinCE in the list before filtering. If there's more correct way to do so or other problems with the prompts, please comment below. Thanks!

* Add GEM/xsum prompts * uncommit this hack * Add GEM in INCLUDED_USERS Co-authored-by: Albert Webson <awebson@cs.brown.edu>

awebson

Thanks Ruochen!

Does this same prompt apply to all other subsets of LinCE? Or are we only asked to evaluate the sa_spaeng subset?
Some prompts are missing the field of Answer Choices: positive ||| negative ||| neutral
This prompt's wording could be more natural

The following post expresses what sentiment?

What sentiment does the following post express? Positive, negative, or neutral?

(in that case, you should also mark the "Choices in template" flag. That is, models are explicitly told the choices "Positive, negative, or neutral?" in the input.)

We're looking for at least 5 original task prompts. You're missing one.

RosenZhang · 2022-04-27T22:14:50Z

Thanks Ruochen!

Does this same prompt apply to all other subsets of LinCE? Or are we only asked to evaluate the sa_spaeng subset?

Some prompts are missing the field of Answer Choices: positive ||| negative ||| neutral

This prompt's wording could be more natural

The following post expresses what sentiment?

What sentiment does the following post express? Positive, negative, or neutral?

(in that case, you should also mark the "Choices in template" flag. That is, models are explicitly told the choices "Positive, negative, or neutral?" in the input.)

We're looking for at least 5 original task prompts. You're missing one.

Hi Albert, thanks so much for reviewing this!
Re comments:

The templates in this PR would only apply to the sa_spaeng subset, other subsets are for different tasks, namely, language identification, POS, NER. I'm currently about to finish templates on NER tasks and probably will update the templates for it later today. I'm curious if we should develop prompts for all different tasks in this PR or go ahead to eval harness for testing the whole pipeline first? I think a bit more time is needed for developing prompts for language identification and POS tasks as there are less examples. (Welcome any pointers to similar tasks! Thanks! )
For Answer Choices, apologize for the inconsistency. I was wondering how the answer choices are passed into the model? In this dataset, the target is given in string format directly like 'sa':'positive' and therefore referred directly as sa in the template, unlike other NER task where we may refer to the answer choices and write target as choices[label]. Would answer choices still be required in this case?
Noted, I'll improve the wording of the prompts!
Also noted, I'll try to re-phrase and add an extra one! (If I understand correctly, negation wouldn't consider as original task right?)

# Conflicts: # promptsource/templates/lince/sa_spaeng/templates.yaml # promptsource/utils.py

RosenZhang · 2022-04-28T14:05:16Z

Close this PR due to issues when rebasing to newly updated eval-hackathon branch.

RosenZhang added 2 commits April 26, 2022 14:06

add prompts for lince sentiment task

96506cd

update prompt details

c12c28e

RosenZhang changed the base branch from main to eval-hackathon April 26, 2022 20:51

Create CODEOWNERS

f5c3977

awebson self-assigned this Apr 26, 2022

update string to double quotes

3a6e164

jzf2101 mentioned this pull request Apr 27, 2022

Add LinCE Testbed to Full Benchmark bigscience-workshop/evaluation#25

Open

RosenZhang changed the title ~~Add prompts to LinCE dataset Sentiment Analysis Task~~ Add prompts for LinCE dataset Sentiment Analysis Task Apr 27, 2022

RosenZhang and others added 2 commits April 27, 2022 12:07

update actions to run tests for eval-hackathon branch (#751)

c25d5c1

Add GEM/xsum prompts (#745)

06bd60d

* Add GEM/xsum prompts * uncommit this hack * Add GEM in INCLUDED_USERS Co-authored-by: Albert Webson <awebson@cs.brown.edu>

awebson self-requested a review April 27, 2022 21:46

awebson requested changes Apr 27, 2022

View reviewed changes

RosenZhang and others added 13 commits April 27, 2022 18:43

improve wording, add answer choice and extra original prompt

9a80047

Remove English-only filter. (#755)

5dea218

add prompts for lince sentiment task

ca5ad35

update prompt details

7c1fb17

update string to double quotes

63d39f6

improve wording, add answer choice and extra original prompt

4f3d905

Merge branch 'main' of github.com:RosenZhang/promptsource

07a5404

add prompts for lince sentiment task

c6343ca

update prompt details

6a0cb92

update string to double quotes

d388245

improve wording, add answer choice and extra original prompt

25183e7

add prompts for lince sentiment task

a4a6ec5

Merge remote-tracking branch 'origin/main'

2ceab05

# Conflicts: # promptsource/templates/lince/sa_spaeng/templates.yaml # promptsource/utils.py

RosenZhang closed this Apr 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add prompts for LinCE dataset Sentiment Analysis Task #746

Add prompts for LinCE dataset Sentiment Analysis Task #746

RosenZhang commented Apr 26, 2022

awebson left a comment

RosenZhang commented Apr 27, 2022

RosenZhang commented Apr 28, 2022

Add prompts for LinCE dataset Sentiment Analysis Task #746

Add prompts for LinCE dataset Sentiment Analysis Task #746

Conversation

RosenZhang commented Apr 26, 2022

awebson left a comment

Choose a reason for hiding this comment

RosenZhang commented Apr 27, 2022

RosenZhang commented Apr 28, 2022