Skip to content

en: "nothing but N" vs. cs: "nic než N" #799

Open
@nesor

Description

@nesor

I came across the Czech equivalent of English nothing but ... as in He inherited nothing but debts. The single example in Czech PDT has než as deprel=mark & upos=SCONJ, while the following NOUN is deprel=advcl. This is how UDPipe parses similar Czech sentences.

UD 2.3 - Czech PDT:
https://lindat.mff.cuni.cz/services/kontext/view?ctxattrs=word%2Cdeprel&attr_vmode=mixed&pagesize=40&refs=%3Ddata.id&q=~shO32i5rtQLq&viewmode=kwic&attrs=word%2Cdeprel&corpname=ud_23_cs_pdt_a&attr_allpos=all

Is this intended? I'm not happy mainly about the NOUN ending up as deprel=advcl. There's nothing clausal about it, is there? There are more English examples with nothing but in one of the treebanks, and they are consistently annotated in a more intuitive way: but is deprel=cc & upos=ADP while the following NOUN is deprel=conj:

UD 2.3 - English EWT:
https://lindat.mff.cuni.cz/services/kontext/view?ctxattrs=word&attr_vmode=mouseover&pagesize=40&refs=%3Ddata.id&q=~HPQO0Pl1HrNb&viewmode=kwic&attrs=word&corpname=ud_23_en_ewt_a&attr_allpos=all

I'd be happy just to learn that the Czech structure should be annotated more like its English equivalent.

Thanks!

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions