Skip to content

Appropriate relation for -th, as in 4th, 5th, etc, when -th is a separate token #1133

@AngledLuffa

Description

@AngledLuffa

This is what happens in Urdu:

Image

PART seems like a reasonable POS, but I find dep as the relation to be very unsatisfying. Is there a different relation type which would better represent the relation between 97 and th here?

In the Sindhi treebank, we'd usually been setting both the number and هين to NUM and having them point at the same head, but then I happened to notice that wasn't consistently done and wanted to unify the annotations to one standard. I'd be happy to unify them all to NUM (breaking with the Urdu treebank's analysis) if that seems like an okay approach.

The languages I can actually speak (even badly) tend to have -th annotated as part of the number, such as 4th, 第五, ...

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions