Skip to content

Commit

Permalink
Remove obsolete note on flat dir structure in Samples.
Browse files Browse the repository at this point in the history
  • Loading branch information
TomazErjavec committed Oct 16, 2024
1 parent 4bbd8dd commit 2bc9d17
Showing 1 changed file with 4 additions and 3 deletions.
7 changes: 4 additions & 3 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,9 @@
## Git and GitHub

Sample data should be pushed to the Data branch of the ParlaMint repository directly into the samples folder
(*`Samples/ParlaMint-XX`*) in a flat structure of files.
(*`Samples/ParlaMint-XX`*) with a directory structure as explained in the the [Section
on Filenames and directory structure](https://clarin-eric.github.io/ParlaMint/#sec-files) of the
Guidelines.

### Setup

Expand Down Expand Up @@ -99,8 +101,7 @@ also be run.
Once samples have been validated and incorporated into the ParlaMint GitHub repository the
complete corpus can be processed and submitted.

First, pls. note that the samples in GitHub use a flat directory structure, while the complete
corpus is structure differently. First, the linguistically non-annotated corpus should be stored
First, the linguistically un-annotated corpus should be stored
in the directory named ParlaMint-XX.TEI/, while the linguistically annotated corpus should be
stored separately, in the directory named ParlaMint-XX.TEI.ana/. Second, the component files
should be stored in subdirectories, one for each year. Note that this is explained in the [Section
Expand Down

0 comments on commit 2bc9d17

Please sign in to comment.