You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Tokens containing multiple adjacent digits are inverted (character order is reversed) in MWT text and sentence comment text throughout the corpus, for example here:
# sent_id = 5930# text = מ5491 עד 1989 היה זה אזור אסור.1-2מ5491________1ממADPADP_2case__219451945NUMNUM_7nmod__3עדעדADPADP_4case__419891989NUMNUM_7nmod__5היה_AUXAUXGender=Masc|Number=Sing|Person=3|Polarity=Pos|Tense=Past|VerbType=Cop7cop__6זהזהPRONPRONGender=Masc|Number=Sing|Person=37nsubj__7אזוראזורNOUNNOUNGender=Masc|Number=Sing0root__8אסוראסורADJADJGender=Masc|Number=Sing7amod_SpaceAfter=No9..PUNCTPUNCT_7punct__
The second year number in this sentence is correct in both the tokens and the sentence text. The first year number is inverted in the MWT and sentence text, but not in the actual token. I suspect this only(?) happens if there is a MWT, but it's hard to be sure for numbers that aren't obviously year numbers without having the original underlying text.
The text was updated successfully, but these errors were encountered:
Tokens containing multiple adjacent digits are inverted (character order is reversed) in MWT text and sentence comment text throughout the corpus, for example here:
https://github.com/UniversalDependencies/UD_Hebrew-HTB/blob/master/he_htb-ud-test.conllu#L6229-L6230
The second year number in this sentence is correct in both the tokens and the sentence text. The first year number is inverted in the MWT and sentence text, but not in the actual token. I suspect this only(?) happens if there is a MWT, but it's hard to be sure for numbers that aren't obviously year numbers without having the original underlying text.
The text was updated successfully, but these errors were encountered: