forked from UniversalDependencies/docs
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathstests.yaml
261 lines (225 loc) · 12.8 KB
/
stests.yaml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
-
name: Aux chain
required_empty: true
expr: "_ <aux|<aux:pass|<cop (_ <aux|<aux:pass|<cop _)"
desc: Auxiliary dependencies should not form a chain.
pos_ex: "Do you think that he will have left when we come ?\naux(think, Do)\naux(left, will)\naux(left, have)"
neg_ex: "Do you think that he will have left when we come ?\naux(think, Do)\naux(have, will)\naux(left, have)"
link: http://universaldependencies.org/u/dep/aux_.html
-
name: Flat is right-headed
required_empty: true
expr: _ <flat@R _
desc: Flat relations should be left-headed, not right.
pos_ex: "Carl XVI Gustaf\nflat(Carl, XVI)\nflat(Carl, Gustaf)"
neg_ex: "Carl XVI Gustaf\nflat(Gustaf, XVI)\nflat(Gustaf, Carl)"
link: http://universaldependencies.org/u/dep/flat.html
-
name: Fixed is right-headed
required_empty: true
expr: _ <fixed@R _
desc: Fixed expressions should be left-headed, not right.
pos_ex: "He cried because of you\nfixed(because, of)"
neg_ex: "He cried because of you\nfixed(of, because)"
link: http://universaldependencies.org/u/dep/fixed.html
-
name: Fixed chain
required_empty: true
expr: "_ <fixed (_ <fixed _)"
desc: Fixed dependencies should not be chained. All dependents should be attached directly to the first one.
link: http://universaldependencies.org/u/dep/fixed.html
-
name: Conj is right-headed
required_empty: true
expr: _ <conj@R _
pos_ex: "Bill is big and honest\nconj(big, honest)"
neg_ex: "Bill is big and honest\nconj(honest, big)"
desc: Coordination dependencies should be left-headed, not right.
link: http://universaldependencies.org/u/dep/conj.html
#-
# name: Conj is left-headed
# required_empty: true
# expr: _ <conj@L _
# desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> In contrast, this test shows examples where coordination is left-headed.
# link: http://universaldependencies.org/u/dep/conj.html
-
name: Appos is right-headed
required_empty: true
expr: _ <appos@R _
pos_ex: "Sam , my brother , arrived\nappos(Sam, brother)"
neg_ex: "Sam , my brother , arrived\nappos(brother, Sam)"
desc: Apposition dependencies should be left-headed, not right.
link: http://universaldependencies.org/u/dep/appos.html
#-
# name: Non-passive subject with passive auxiliary
# required_empty: true
# expr: "_ >nsubj|>csubj _ >aux:pass|>expl:pass _"
# desc: A passive auxiliary implies that the subject is passive, too.
# pos_ex: "The reconstruction was finished yesterday .\nnsubj:pass(finished, reconstruction)\naux:pass(finished, was)"
# neg_ex: "The reconstruction was finished yesterday .\nnsubj(finished, reconstruction)\naux:pass(finished, was)"
# link: http://universaldependencies.org/docs/u/dep/nsubj-pass.html
-
name: Copula is not AUX
required_empty: true
expr: (!(AUX|PRON)) <cop _
expr-pos: _ <cop _
desc: Verbal copulas should be tagged AUX, not VERB. Pronouns can also serve as copulas. Punctuation (dashes) and nouns are not copulas.
link: http://universaldependencies.org/docs/u/dep/cop.html
-
name: PRON or DET is mark
required_empty: true
expr: (PRON|DET) <mark _
desc: Pronouns must not be attached using the mark relation. Relative pronouns must not be confused with subordinating conjunctions, even if the word is ambiguous.
pos_ex: "This is a fact that we cannot ignore . You know that we cannot ignore it .\ndobj(ignore-8, that-5)\nmark(ignore-15, that-12)"
neg_ex: "This is a fact that we cannot ignore .\nmark(ignore-8, that-5)"
link: http://universaldependencies.org/docs/u/dep/mark.html
-
name: Relation det is used for node that is neither DET nor PRON
required_empty: true
expr: "(!DET&!PRON) <det _"
desc: The det relation is primarily intended for determiners, i.e. words tagged DET. Pronouns are tolerated at least until the borderline between the two classes is better investigated and defined.
link: http://universaldependencies.org/docs/u/dep/det.html
-
name: Relation punct is used for node that is not PUNCT
required_empty: true
expr: "!PUNCT <punct _"
desc: Only nodes tagged PUNCT can be attached using the punct relation.
link: http://universaldependencies.org/u/dep/punct.html
-
name: PUNCT is attached as neither punct nor root
required_empty: true
expr: "PUNCT (<!punct&<!root) _"
desc: Nodes tagged PUNCT can only be attached using the punct relation (and exceptionally, if it is the only node in the sentence, also root).
link: http://universaldependencies.org/u/dep/punct.html
-
name: PRON or DET lacks the PronType feature
expr: "(PRON|DET)&!PronType"
desc: All pronouns and determiners should be further categorized using the PronType feature. Other POS may or may not have this feature.
link: http://universaldependencies.org/docs/u/feat/PronType.html
-
name: NUM lacks the NumType feature
expr: "NUM&!NumType"
desc: All numerals should be further categorized using the NumType feature. Other POS may or may not have this feature.
link: http://universaldependencies.org/docs/u/feat/NumType.html
-
name: VERB lacks the VerbForm feature
expr: "(VERB)&!VerbForm"
desc: All verbs should be further categorized using the VerbForm feature. Other POS may or may not have this feature. (Note that AUX may include non-verbs, thus we cannot generally extend this test to auxiliaries.)
link: http://universaldependencies.org/docs/u/feat/VerbForm.html
-
name: Finite verb is not a verb
required_empty: true
expr: "(VerbForm=Fin)&!(VERB|AUX)"
desc: Only non-finite VerbForms are expected to appear with non-verb parts of speech (NOUN, ADJ, ADV).
link: http://universaldependencies.org/docs/u/feat/VerbForm.html
-
name: Finite verb lacks the Mood feature
required_empty: true
expr: "(VerbForm=Fin)&!Mood"
desc: All finite verb forms should be further categorized using the Mood feature.
link: http://universaldependencies.org/docs/u/feat/Mood.html
-
name: Degree feature used with a word that is neither adjective nor adverb
required_empty: true
expr: "Degree&!ADJ&!ADV"
desc: The Degree feature is normally associated with (a subset of) adjectives and adverbs. Other tags than ADJ and ADV are probably wrong.
link: http://universaldependencies.org/u/feat/Degree.html
-
name: Maximum one subject
expr: "_ >nsubj|>csubj|>nsubj:pass|>csubj:pass _ >nsubj|>csubj|>nsubj:pass|>csubj:pass _"
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> No predicate can have more than one subject. Note that subordinate clauses do not head copulas for this reason.
link: http://universaldependencies.org/u/dep/nsubj.html
-
name: Maximum one object
expr: "_ >obj|>ccomp _ >obj|>ccomp _"
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> No predicate can have more than one direct object. Ccomp counts as direct object. (To certain extent xcomp does too, but dobj can co-occur with xcomp in cases of secondary predication, thus this test does not look at xcomp.)
link: http://universaldependencies.org/u/dep/obj.html
-
name: NOUN and case
expr: NOUN <case _
expr-pos: _ <case _
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> A word marked as a NOUN should not be a case dependency.
link: http://universaldependencies.org/docs/u/dep/case.html
-
name: ADP is not leaf
expr: "ADP >!conj&>!cc&>!punct&>!fixed _"
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> By default adposition is a leaf node attached to a nominal as case. Exceptions where adpositions are not leaves involve technical relations such as mwe and conj.
link: http://universaldependencies.org/docs/u/dep/case.html
-
name: Appos chain
required_empty: true
expr: "_ <appos (_ <appos _)"
pos_ex: "Sam , my brother , John 's cousin , arrived\nappos(Sam, brother)\nappos(Sam, cousin)"
neg_ex: "Sam , my brother , John 's cousin , arrived\nappos(Sam, brother)\nappos(brother, cousin)"
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> Apposition dependencies should not be chained. Multiple appositions should all be attached to the head, or they should be coordinated (with the first apposition as the head). Legitimate exceptions do occur, however, in the rare case when an apposition has an apposition of its own, which does not apply to head of the first apposition.
link: http://universaldependencies.org/u/dep/appos.html
-
name: Relation advmod used for node that is not ADV
expr: "(!ADV) <advmod _"
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> Advmod is intended only for adverbs. Modifiers that are called adverbial in some traditional grammars, but are in fact prepositional or noun phrases, should be attached as nmod.
link: http://universaldependencies.org/docs/u/dep/advmod.html
-
name: Heads of an advmod are nominal
expr: "_ <advmod (NOUN|PROPN|NUM|PRON)"
desc: "<b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> The heads of an adverbial modifier should normally not be nominal (noun, proper noun, numeral, or pronoun). Exceptions occur in nominal clauses, where a nominal is the main predicate and can therefore take clause adverbials. Example: This is probably an exception."
link: http://universaldependencies.org/docs/u/dep/advmod.html
-
name: Acl not dependent on NOUN/PROPN/PRON/DET
expr: "!NOUN&!PROPN&!PRON&!DET >acl _"
expr-pos: _ >acl _
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> Clausal modifiers of nouns should depend on NOUN/PROPN only; those in the following table depend on other parts of speech.
link: http://universaldependencies.org/u/dep/acl.html
-
name: Marked as NUM but not nummod, nmod or compound
expr: "NUM (<!nummod&<!nmod&<!compound) _"
expr-pos: "_ (<!nummod&<!nmod&<!compound) _"
desc: "<b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> If a word is marked as a numeral (POS), then it should be marked as being a nummod, compound or nmod dependency. Exceptions occur when the numeral is promoted to a higher function through ellipis. Example: Take five!"
link: http://universaldependencies.org/docs/u/pos/NUM.html
-
name: Marked as nummod but not NUM
expr: "!NUM <nummod _"
expr-pos: _ <nummod _
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> If a word is marked as a numeric modifier, it should be marked as a numeral (POS).
link: http://universaldependencies.org/docs/u/dep/nummod.html
-
name: Marked as AUX but not aux or aux:pass
expr: "AUX (<!aux&<!aux:pass) _"
expr-pos: "_ (<!aux&<!aux:pass) _"
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> If a word is marked as the auxiliary POS, it should be marked as either aux or aux:pass dependency.
link: http://universaldependencies.org/docs/u/pos/AUX_.html
-
name: Marked as aux or aux:pass but not AUX
expr: "!AUX (<aux|<aux:pass) _"
expr-pos: _ (<aux|<aux:pass) _
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> If a word is marked as a (passive) auxiliary dependency, it should be marked as the auxiliary POS.
link: http://universaldependencies.org/docs/u/dep/aux_.html
-
name: Marked as cc but not CCONJ, SYM or ADV
expr: "(!CCONJ&!SYM&!ADV) <cc _"
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> The relation cc is for coordinating conjunctions, hence the most expected POS tag is CCONJ. Symbols such as + may occasionally replace conjunctions, and in some languages some adverbs may take this syntactic function too. However, most other tags are suspicious at least.
link: http://universaldependencies.org/docs/u/dep/cc.html
-
name: Cc is not leaf
expr: _ < (_ <cc _)
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> Coordinating conjunction should be attached to the first conjunct and it should not have its own dependents.
link: http://universaldependencies.org/docs/u/dep/cc.html
-
name: Parts of speech of expl
expr: _ <expl _
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> The parts of speech of words marked as being expletive dependencies.
link: http://universaldependencies.org/u/dep/expl.html
-
name: Dependents of expl
expr: _ < (_ <expl _)
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> The dependents of a word marked as being an expletive dependency. This should be empty.
link: http://universaldependencies.org/u/dep/expl.html
-
name: Heads of expl
expr: _ >expl _
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> Parts of speech of words which have an expletive dependency.
link: http://universaldependencies.org/u/dep/expl.html
-
name: Heads of xcomp
expr: _ >xcomp _
desc: <b>DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID.</b> Parts of speech of words which have an open clausal complement dependency.
link: http://universaldependencies.org/u/dep/xcomp.html