Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 1.26 KB

README.md

File metadata and controls

17 lines (13 loc) · 1.26 KB

UCCA-Annotated Hebrew Corpus: הנסיך הקטן (The Little Prince)

Pilot version (December 29, 2020)

This bundle contains 58 passages, with a total number of 21600 tokens. It is a pilot version (0.9) of the UCCA annotation of the book "The Little Prince" by Antoine de Saint-Exupéry in its classic translation by Arieh Lerner. The text is taken from the Ben Yehuda project (https://benyehuda.org/read/4041). While the annotation was conducted according to version 2 of UCCA's foundational layer guidelines, since this is a pilot version, the version of the corpus is 0.9.

Please note that since this is still unpublished work, if you decide to use this corpus, please add a link to this repository. We will make sure to add a proper reference once it is published.

Format and Source Code

Information about the format of the xml files and source code for reading and manipulating them are available at https://universalconceptualcognitiveannotation.github.io/.

Licensing

The UCCA annotation is distributed under the "Attribution-ShareAlike 3.0 Unported" license (http://creativecommons.org/licenses/by-sa/3.0/). Please follow the link for exact details.