You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: guidelines/guidelines.md
+28-5
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,4 @@
1
-
# FUDG-GFL Annotation Guidelines 1.0
1
+
# FUDG-GFL Annotation Guidelines 1.3
2
2
3
3
Chris Dyer
4
4
Brendan O’Connor
@@ -7,6 +7,7 @@ David Bamman
7
7
Noah A. Smith
8
8
9
9
*Document history:*
10
+
2013-11-09: version 1.3: updated Multiwords; note about nonrestrictive relative clauses; Verb complexes: Quasi-modals
10
11
2013-10-26: version 1.2: Contractions
11
12
2013-09-30: version 1.1: Nesting and the Substitution Principle
12
13
2013-09-22: version 1.0
@@ -160,12 +161,21 @@ The formalism makes no commitment to the surface ordering within a square bracke
160
161
161
162

162
163
163
-
Multiwords allow an annotator to punt on expressions that are best understood as idiosyncratic phrases or that have forbiddingly complicated compositional analyses, while still describing their relationship with the rest of the sentence (e.g. [putnam_catenae_examples](https://github.com/brendano/gfl_syntax/blob/master/anno/putnam_catenae_examples.anno) from Osborne et al 2011). Another example:
164
+
Multiwords allow an annotator to punt on expressions that are best understood as idiosyncratic phrases or that have forbiddingly complicated compositional analyses, while still describing their relationship with the rest of the sentence (e.g. [putnam_catenae_examples](https://github.com/brendano/gfl_syntax/blob/master/anno/putnam_catenae_examples.anno) from Osborne et al 2011).
164
165
165
-
> sometimes I'm up there waiting for BART
166
+
We generally reserve multiwords for the following cases that are semantically coherent but not easy to analyze syntactically:
166
167
167
-
sometimes > i'm < [up there]
168
-
i'm < [waiting for] < BART
168
+
a. multiword proper names: `[Brendan O’Connor]`
169
+
b. verb-particle constructions: `[wake up]`
170
+
c. multiple input tokens conventionally written as one word: `[over priced]`
171
+
d. highly noncompositional compounds and foreign expressions: `[class act]`, `the > [lost and found]`, `[post hoc]`
172
+
e. syntactically difficult idioms: `[let alone]`, `[had better]` (see [quasi-modals](#verb-complexes))
173
+
174
+
But we try to decompose idioms that have a plausible (if atypical) syntactic analysis:
175
+
176
+
kick < (the > bucket)
177
+
I > (kid < you) < not
178
+
be < on < (the > verge < of < victory)
169
179
170
180
## Fudge Expressions (FEs)
171
181
@@ -378,6 +388,17 @@ For long and tricky-to-analyze verb chains, consider FEs:
378
388
379
389
I > will < try < to < (love < you) < more
380
390
391
+
**Quasi-modals** are usually decomposed:
392
+
393
+
have < to < announce
394
+
ought < to < announce
395
+
would < like < to < announce
396
+
(would < rather) < announce
397
+
398
+
An exception is *had better*, which is treated as a multiword because *had* does not function morphosyntactically as a perfect auxiliary or main verb:
399
+
400
+
you > [had better] < believe < it
401
+
381
402
## Existentials
382
403
383
404
Existential *there* counts as a subject:
@@ -399,6 +420,8 @@ GFL supports special **undirected node-node relations** for explicit anaphora. C
399
420
400
421
Semantically, the *man* is both the object of *arrest***and** the subject of *robbed*. But, syntactically, the complementizer *who* occupies the subject position of the RC. We therefore make the semantics of the semantic link clear by writing: `who = man`. The head of the embedded clause (the verb *robbed*) also serves as the dependent of the nominal head (*man*).
401
422
423
+
Nonrestrictive relative clauses (*The police arrested the man, who robbed our bank*) are analyzed like their restrictive counterparts.
424
+
402
425
Sometimes the relative pronoun is the object of a preposition, which may be stranded or fronted:
0 commit comments