Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

around as an ADP? #81

Closed
AngledLuffa opened this issue Feb 23, 2024 · 2 comments
Closed

around as an ADP? #81

AngledLuffa opened this issue Feb 23, 2024 · 2 comments

Comments

@AngledLuffa
Copy link

Is it a mistag? I would think this instance of around is an ADP.

# sent_id = GUM_vlog_covid-28
16      results result  NOUN    NNS     Number=Plur     14      obj     14:obj  Entity=49)|MSeg=result-s
17      which   which   PRON    WDT     PronType=Rel    21      nsubj   10:ref  Discourse=elaboration-attribute:76->74:1:syn-relcl-521,525
18      was     be      AUX     VBD     Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   21      cop     21:cop  _
19      around  around  ADV     RB      Degree=Pos      21      advmod  21:advmod       Entity=(48-time-giv:act-cf3-3-coref|MSeg=a-round
20      like    like    INTJ    UH      _       21      discourse       21:discourse    _
21      6:00    6:00    NUM     CD      NumForm=Digit|NumType=Card      10      acl:relcl       10:acl:relcl    XML=<time when:::"18:00:00">
22      pm      p.m.    NOUN    NN      Number=Sing     21      nmod:tmod       21:nmod:tmod    Entity=48)48)|SpaceAfter=No|XML=</time>
23      .       .       PUNCT   .       _       7       punct   7:punct _

Also, in general I find myself wondering why around is never an RP in the XPOS this treebank, such as open up_RP vs come around_RB or close around_RB. Although it's not very consistent in PTB to label around as RP vs RB in turn around, get around, etc, it looks like RP is more common. However, this time around_RP, last time around_RP is consistent. Even this time around isn't 100% consistent, though

                 (NP
                    (NP
                      (ADJP (RB even) (JJR more) )
                      (NN caution) )
                    (PP (IN than)
                      (NP (DT the) (JJ first) (NN time) (RB around) ))))))))
@nschneid
Copy link

"which was around like 6:00 pm" should be ADP, yes.

The idiom "this time around" (more generally "X time around") is interesting. My gut feeling is that "around" can be omitted so it is a modifier, presumably RB/ADV/advmod.

If memory serves RP should be aligned with compound:prt, which (unfortunately IMO) is defined in terms of idiomaticity. "come around" would presumably be RB/ADV/advmod if it is purely spatial and RP/ADP/compound:prt in the idiomatic meaning of 'change to my preferred opinion'.

@amir-zeldes
Copy link
Contributor

which was around like 6:00 pm

Yes, this is an error, will fix

in general I find myself wondering why around is never an RP ... it's not very consistent in PTB to label around as RP vs RB in turn around, get around, etc, it looks like RP is more common

Indeed, it's not consistent in PTB, but while RP is more common in PTB proper, RB is more common overall in OntoNotes 5 (which also includes WSJ but much more other data as well). So in sum we went with the OntoNotes majority and decided to consistently tag it RB in these contexts.

amir-zeldes added a commit to amir-zeldes/gum that referenced this issue Feb 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants