Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

anotar as lendas de Casasnovas (2006) #353

Open
12 of 15 tasks
leoalenc opened this issue Jan 12, 2024 · 5 comments
Open
12 of 15 tasks

anotar as lendas de Casasnovas (2006) #353

leoalenc opened this issue Jan 12, 2024 · 5 comments
Assignees
Labels
corpus This issue pertains to corpus data enhancement New feature or request UD Annotation This issue relates to Universal Dependencies annotation

Comments

@leoalenc
Copy link
Contributor

leoalenc commented Jan 12, 2024

  • anotar a primeira sentença (linhas 1-2 da p. 64) da lenda "Urubú, Wirá-Wasú"
  • anotar as sentenças de 2 a 6
  • anotar as demais senteças
  • anotar uma sentença da segunda lenda
  • anotar as demais sentenças da segunda lenda
  • lendas 3, 4, 5, 6, 8 (até p. 85, No. 10)
  • renomear Casasnovas2006:8:11:200 --> Casasnovas2006:8:10:78
  • restante da lenda 8, lendas 9, 10, 11 (até p. 95, No. 26-27)
  • incluir no treebank as sentenças Casasnovas2006:11:20:151 até Casasnovas2006:11:29:160
  • restante da lenda 11
  • incluir Casasnovas2006:11:35:166
  • início da lenda 12
  • incluir Casasnovas2006:12:5:171
  • mais sentenças da lenda 12
  • restante da lenda 12
@leoalenc leoalenc added enhancement New feature or request corpus This issue pertains to corpus data UD Annotation This issue relates to Universal Dependencies annotation labels Jan 12, 2024
leoalenc added a commit that referenced this issue Jan 12, 2024
@leoalenc leoalenc changed the title anotar sentenças da lenda "Urubú, Wirá-Wasú" de Casasnovas (2006) anotar sentenças das lendas de Casasnovas (2006) Jan 23, 2024
@leoalenc leoalenc changed the title anotar sentenças das lendas de Casasnovas (2006) anotar as lendas de Casasnovas (2006) Jan 23, 2024
leoalenc added a commit that referenced this issue Jan 23, 2024
leoalenc added a commit that referenced this issue Apr 8, 2024
leoalenc added a commit that referenced this issue Apr 8, 2024
leoalenc added a commit that referenced this issue Apr 8, 2024
leoalenc added a commit that referenced this issue Aug 6, 2024
leoalenc added a commit that referenced this issue Aug 7, 2024
leoalenc added a commit that referenced this issue Aug 7, 2024
leoalenc added a commit that referenced this issue Aug 7, 2024
leoalenc added a commit that referenced this issue Aug 8, 2024
leoalenc added a commit that referenced this issue Aug 21, 2024
leoalenc added a commit that referenced this issue Aug 21, 2024
leoalenc added a commit that referenced this issue Aug 22, 2024
@leoalenc
Copy link
Contributor Author

leoalenc commented Sep 7, 2024

@juliana-gurgel , seguindo a nova política de expansão do treebank, incorporei neste commit as últimas sentenças do seu commit mais recente diretamente, sem passar por revisão minha prévia. Nos commits futuros deste repositório você pode acompanhar eventuais revisões que venham a ser feitas nessas sentenças.

@leoalenc
Copy link
Contributor Author

~/ud_tools/tools$ python3 validate.py --lang=yrl --max-err=0 ~/complin/nheengatu/data/corpus/universal-dependencies/yrl_complin-ud-test.conllu
[Line 25984 Sent Casasnovas2006:11:34:165]: [L1 Format extra-empty-line] Spurious empty line. Only one empty line is expected after every sentence.
[Line 26083 Sent Casasnovas2006:12:4:170 Node 8]: [L3 Syntax too-many-subjects] Multiple subjects [2, 7] not subtyped as ':outer'. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 26143 Sent Casasnovas2006:12:7:173]: [L2 Metadata text-form-mismatch] Mismatch between the text attribute and the FORM field. Form[15] is 'nẽ' but text is 'ne maã suí, sera, paá,...'
[Line 26153 Sent Casasnovas2006:12:7:173]: [L2 Metadata text-extra-chars] Extra characters at the end of the text attribute, not accounted for in the FORM fields: 'ne maã suí, sera, paá, Buburi.'
[Line 26186 Sent Casasnovas2006:12:9:175]: [L2 Metadata text-form-mismatch] Mismatch between the text attribute and the FORM field. Form[1] is 'Aape' but text is 'Aápe, paá, aintá, uyusua...'
[Line 26199 Sent Casasnovas2006:12:9:175]: [L2 Metadata text-extra-chars] Extra characters at the end of the text attribute, not accounted for in the FORM fields: 'Aápe, paá, aintá, uyusuantí, ta uyupirú ta upurungitá'
Format errors: 1
Metadata errors: 4
Syntax errors: 1
*** FAILED *** with 6 errors

@leoalenc
Copy link
Contributor Author

leoalenc commented Sep 11, 2024

@juliana-gurgel , no lugar de Casasnovas2006:11:35:166, há uma linha em branco adicional, causando o erro acima. Faltou mesmo essa sentença ou inexiste?

leoalenc added a commit that referenced this issue Sep 11, 2024
leoalenc added a commit that referenced this issue Sep 11, 2024
@juliana-gurgel
Copy link
Collaborator

juliana-gurgel commented Sep 11, 2024

@juliana-gurgel , no lugar de Casasnovas2006:11:35:166, há uma linha em branco adicional, causando o erro acima. Faltou mesmo essa sentença ou inexiste?

@leoalenc , duas sentenças (Casasnovas2006:11:35:166 e Casasnovas2006:12:5:171) estavam levando muito tempo para serem anotadas. Para acelerar a anotação, pulei-as por enquanto e retomarei a anotação quando terminar a lenda 12. Eu deveria ter colocado essa observação no último commit. Nos próximos farei isso.

leoalenc added a commit that referenced this issue Sep 17, 2024
leoalenc added a commit that referenced this issue Sep 19, 2024
@juliana-gurgel
Copy link
Collaborator

incluir Casasnovas2006:11:35:166

  • incluir Casasnovas2006:11:35:166
  • incluir Casasnovas2006:12:5:171
  • incluir Casasnovas2006:12:12:178
  • incluir Casasnovas2006:12:30:196

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
corpus This issue pertains to corpus data enhancement New feature or request UD Annotation This issue relates to Universal Dependencies annotation
Projects
None yet
Development

No branches or pull requests

3 participants