Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added missing subtoken information #22

Open
wants to merge 43 commits into
base: dev
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
43 commits
Select commit Hold shift + click to select a range
6f70cd4
Added missing subtoken information
amir-zeldes Jan 29, 2018
3d05af0
update to UD V2.8
amir-zeldes Jun 20, 2021
3ae6f86
Initial revisions
amir-zeldes Jun 21, 2021
fc86f18
markdown
amir-zeldes Jun 21, 2021
2350a84
remove redundant deprel subtypes
amir-zeldes Jun 22, 2021
0121d89
document added nsubj:pass, csubj:pass
amir-zeldes Jun 22, 2021
4b80da6
Unify demonstrative lemmas
amir-zeldes Jun 22, 2021
b49731e
pronominal copulas are PRON, have no verbal morphology
amir-zeldes Jun 22, 2021
dbda960
change CCONJ to ADV, SCONJ or ADP depending on deprel
amir-zeldes Jun 22, 2021
3b0311f
Demonstrative lemma adjustment
amir-zeldes Jun 22, 2021
382e292
SCONJ כדי
amir-zeldes Jun 23, 2021
5f3e62e
number lemmas
amir-zeldes Jun 23, 2021
5b2f4bb
fix corrupt text
amir-zeldes Jun 23, 2021
363a34a
attributive number articles are det
amir-zeldes Jun 23, 2021
d411bb3
fixed expression כל אימת ש
amir-zeldes Jun 23, 2021
2cdfed4
totally mangled sentence
amir-zeldes Jun 23, 2021
9e63046
major overhaul of number tokens
amir-zeldes Jun 23, 2021
56885fc
constructions with הרבה
amir-zeldes Jun 23, 2021
f63e8b9
more corrupt tokens
amir-zeldes Jun 23, 2021
cffeb0b
even more corrupt tokens
amir-zeldes Jun 23, 2021
a9e09c4
finite zero acl is also acl:relcl
amir-zeldes Jun 23, 2021
a4afed8
Put definiteness feature on clitic possessor, not on possessed NOUN
amir-zeldes Jun 23, 2021
8dd2038
add PronType=Art to fused ADP articles
amir-zeldes Jun 24, 2021
f782910
PROPN fixes
amir-zeldes Jun 24, 2021
dba89b0
More PROPN
amir-zeldes Jun 24, 2021
a0149e8
tense for participle VERB
amir-zeldes Jun 24, 2021
10b9b9e
manual correction
amir-zeldes Jun 24, 2021
17a149f
fix lots of broken year numbers
amir-zeldes Jun 26, 2021
906b6da
more broken clitics
amir-zeldes Jun 26, 2021
1ca7d13
add nmod:tmod and obl:tmod
amir-zeldes Jun 26, 2021
c45231e
fix all remaining MWTs with subtokens not matching text
amir-zeldes Jun 28, 2021
f3e7aeb
completely valid at UD validator level 3
amir-zeldes Jun 28, 2021
ffa1899
README
amir-zeldes Jun 28, 2021
39f7d6c
set compound:affix POS based on parent
amir-zeldes Jul 2, 2021
5a442cf
remove HebExistential feat
amir-zeldes Jul 8, 2021
14741d7
auto convert impersonal modals to head+csubj
amir-zeldes Jul 9, 2021
7d211fd
error correction
amir-zeldes Jul 12, 2021
22fe4f6
remove Case=Tem
amir-zeldes Jul 22, 2021
5c2645f
Remove Poss=Yes from pronouns with של
amir-zeldes Jul 26, 2021
a2234c1
extensive lemma corrections
amir-zeldes Jul 28, 2021
caa8f5f
remove Person=3 from כך
amir-zeldes Aug 3, 2021
60a177f
Revise Number and some lemmas for numerals
amir-zeldes Aug 11, 2021
e69eb10
Merge pull request #6 from IAHLT/dev
amir-zeldes Aug 11, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
manual correction
  • Loading branch information
amir-zeldes committed Jun 24, 2021
commit 10b9b9e930c0c95c6c23b2eb93153959bc44d200
8 changes: 4 additions & 4 deletions he_htb-ud-train.conllu
Original file line number Diff line number Diff line change
Expand Up @@ -6734,21 +6734,21 @@
20 - - PUNCT PUNCT _ 19 punct _ SpaceAfter=No
21-22 המקלע _ _ _ _ _ _ _ _
21 ה ה DET DET Definite=Def|PronType=Art 22 det _ _
22 מקלע מקלע NOUN NOUN Gender=Masc|Number=Sing 18 nmod _ _
22 מקלע מקלע NOUN NOUN Gender=Masc|Number=Sing 18 appos _ _
23-25 ולחבר _ _ _ _ _ _ _ _
23 ו ו CCONJ CCONJ _ 25 cc _ _
24 ל ל ADP ADP Definite=Def|PronType=Art 25 case _ _
25 חבר חבר NOUN NOUN Gender=Masc|Number=Sing 18 conj _ _
26 סכין סכין NOUN NOUN Gender=Fem,Masc|Number=Sing 25 nmod _ HebSource=ConvUncertainLabel|SpaceAfter=No
26 סכין סכין NOUN NOUN Gender=Fem,Masc|Number=Sing 25 appos _ SpaceAfter=No
27 " " PUNCT PUNCT _ 13 punct _ SpaceAfter=No
28 , , PUNCT PUNCT _ 6 punct _ _
29 ברי ברי CCONJ CCONJ _ 0 root _ _
29 ברי ברי ADJ ADJ _ 0 root _ _
30 כי כי SCONJ SCONJ _ 35 mark _ _
31 תנועת תנועה NOUN NOUN Definite=Cons|Gender=Fem|Number=Sing 35 nsubj _ _
32 " " PUNCT PUNCT _ 33 punct _ SpaceAfter=No
33 כך כך PROPN PROPN _ 31 compound _ SpaceAfter=No
34 " " PUNCT PUNCT _ 33 punct _ _
35 מקדשת קידש VERB VERB Gender=Fem|HebBinyan=PIEL|Number=Sing|Person=1,2,3|Tense=Pres|VerbForm=Part|Voice=Act 29 advcl _ SpaceAfter=No
35 מקדשת קידש VERB VERB Gender=Fem|HebBinyan=PIEL|Number=Sing|Person=1,2,3|Tense=Pres|VerbForm=Part|Voice=Act 29 csubj _ SpaceAfter=No
36 , , PUNCT PUNCT _ 37 punct _ _
37 למפרע למפרע ADV ADV _ 35 advmod _ SpaceAfter=No
38 , , PUNCT PUNCT _ 40 punct _ _
Expand Down