Skip to content

Commit 7aaebe8

Browse files
committed
guidelines: more specific MWE criteria; quasi-modals; nonrestrictive RCs
1 parent 28db09d commit 7aaebe8

File tree

1 file changed

+28
-5
lines changed

1 file changed

+28
-5
lines changed

guidelines/guidelines.md

+28-5
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# FUDG-GFL Annotation Guidelines 1.0
1+
# FUDG-GFL Annotation Guidelines 1.3
22

33
Chris Dyer
44
Brendan O’Connor
@@ -7,6 +7,7 @@ David Bamman
77
Noah A. Smith
88

99
*Document history:*
10+
2013-11-09: version 1.3: updated Multiwords; note about nonrestrictive relative clauses; Verb complexes: Quasi-modals
1011
2013-10-26: version 1.2: Contractions
1112
2013-09-30: version 1.1: Nesting and the Substitution Principle
1213
2013-09-22: version 1.0
@@ -160,12 +161,21 @@ The formalism makes no commitment to the surface ordering within a square bracke
160161
161162
![Brendan O’Connor helped write this guide up.](writeup.0.png)
162163

163-
Multiwords allow an annotator to punt on expressions that are best understood as idiosyncratic phrases or that have forbiddingly complicated compositional analyses, while still describing their relationship with the rest of the sentence (e.g. [putnam_catenae_examples](https://github.com/brendano/gfl_syntax/blob/master/anno/putnam_catenae_examples.anno) from Osborne et al 2011). Another example:
164+
Multiwords allow an annotator to punt on expressions that are best understood as idiosyncratic phrases or that have forbiddingly complicated compositional analyses, while still describing their relationship with the rest of the sentence (e.g. [putnam_catenae_examples](https://github.com/brendano/gfl_syntax/blob/master/anno/putnam_catenae_examples.anno) from Osborne et al 2011).
164165

165-
> sometimes I'm up there waiting for BART
166+
We generally reserve multiwords for the following cases that are semantically coherent but not easy to analyze syntactically:
166167

167-
sometimes > i'm < [up there]
168-
i'm < [waiting for] < BART
168+
a. multiword proper names: `[Brendan O’Connor]`
169+
b. verb-particle constructions: `[wake up]`
170+
c. multiple input tokens conventionally written as one word: `[over priced]`
171+
d. highly noncompositional compounds and foreign expressions: `[class act]`, `the > [lost and found]`, `[post hoc]`
172+
e. syntactically difficult idioms: `[let alone]`, `[had better]` (see [quasi-modals](#verb-complexes))
173+
174+
But we try to decompose idioms that have a plausible (if atypical) syntactic analysis:
175+
176+
kick < (the > bucket)
177+
I > (kid < you) < not
178+
be < on < (the > verge < of < victory)
169179

170180
## Fudge Expressions (FEs)
171181

@@ -378,6 +388,17 @@ For long and tricky-to-analyze verb chains, consider FEs:
378388

379389
I > will < try < to < (love < you) < more
380390

391+
**Quasi-modals** are usually decomposed:
392+
393+
have < to < announce
394+
ought < to < announce
395+
would < like < to < announce
396+
(would < rather) < announce
397+
398+
An exception is *had better*, which is treated as a multiword because *had* does not function morphosyntactically as a perfect auxiliary or main verb:
399+
400+
you > [had better] < believe < it
401+
381402
## Existentials
382403

383404
Existential *there* counts as a subject:
@@ -399,6 +420,8 @@ GFL supports special **undirected node-node relations** for explicit anaphora. C
399420

400421
Semantically, the *man* is both the object of *arrest* **and** the subject of *robbed*. But, syntactically, the complementizer *who* occupies the subject position of the RC. We therefore make the semantics of the semantic link clear by writing: `who = man`. The head of the embedded clause (the verb *robbed*) also serves as the dependent of the nominal head (*man*).
401422

423+
Nonrestrictive relative clauses (*The police arrested the man, who robbed our bank*) are analyzed like their restrictive counterparts.
424+
402425
Sometimes the relative pronoun is the object of a preposition, which may be stranded or fronted:
403426

404427
> He is the guy **who** I worked **with**.

0 commit comments

Comments
 (0)