Skip to content

Commit 4ee2f1c

Browse files
committed
Licenses sync
1 parent 9ea936e commit 4ee2f1c

File tree

7 files changed

+2912
-4
lines changed

7 files changed

+2912
-4
lines changed
Lines changed: 69 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,69 @@
1+
<div id="faq-like">
2+
<p><h1 id="anchor">LINDAT Web Crawl Licence</h1></p>
3+
<div>(2024/12/13)</div>
4+
<hr></hr>
5+
<div class="well">
6+
<h2>License Terms</h2>
7+
<p>LINDAT Web Crawl Licence (further as “Agreement”) between The
8+
publisher of the dataset (“The Publisher”) and The user of the dataset
9+
(“The User”).</p>
10+
<p>The User agrees to use the collection data set collected by The
11+
Publisher (the "Collection"). The User agrees to abide by the following
12+
understandings, terms and conditions. These understandings, terms and
13+
conditions apply equally to all or to part of the Collection, including
14+
any updates or new versions of the Collection supplied under this
15+
Agreement.</p>
16+
<p><h3 id="anchor-1">Copyright</h3></p>
17+
<p>The Collection has been obtained by crawling the Internet. Due to the
18+
size of the Collection it has not been practicable to obtain permission
19+
from copyright owners to provide the Collection for the uses permitted
20+
under this Agreement (“Permitted Uses”).</p>
21+
<p>The User understands that all the documents in the Collection are
22+
documents which have been at some time made publicly available on the
23+
Internet and which have been collected using a process which respects
24+
the commonly accepted methods (such as robots.txt) for indicating that
25+
the documents should not be so collected.</p>
26+
<p>Owners of copyright in individual documents may choose to request
27+
deletion of these documents from the Collection.</p>
28+
<p>The limitation on permitted use contained in the following section is
29+
intended to reduce the risk of any action being brought by copyright
30+
owners, but if this happens The User agrees to bear all associated
31+
liability.</p>
32+
<p><h3 id="anchor-2">Permitted Uses</h3></p>
33+
<p>The Collection may only be used for research and development of
34+
natural-language processing, information-retrieval or
35+
document-understanding systems.</p>
36+
<p>Summaries, analyses and interpretations of the linguistic properties
37+
of the Collection may be derived and published, provided it is not
38+
possible to reconstruct the Collection from these summaries.</p>
39+
<p>Small excerpts of the Collection may be displayed to others or
40+
published in a scientific or technical context, solely for the purpose
41+
of describing the research and development carried out and related
42+
issues.</p>
43+
<p>All efforts must be made not to infringe the rights of any third
44+
party including, but not limited to, the authors and publishers of any
45+
excerpts used in accordance with the clauses above in this “Permitted
46+
Uses” section.</p>
47+
<p>The User must make sure that they only display the Collection to or
48+
share the Collection with persons who also signed this Agreement with
49+
The Publisher.</p>
50+
<p><h3 id="anchor-3">Agreement to Delete Data on Request</h3></p>
51+
<p>The User undertakes to delete within thirty days of receiving notice
52+
all copies of any nominated document that is part of the Collection
53+
whenever requested to do so by either The Publisher or by the owner of
54+
copyright for the particular document.</p>
55+
<p><h3 id="anchor-4">No Warranty</h3></p>
56+
<p>The Collection is provided "as is", without warranty of any kind,
57+
express or implied, including but not limited to the warranties of
58+
merchantability, fitness for a particular purpose and noninfringement.
59+
In no event shall The Publisher be liable for any claim, damages or
60+
other liability, whether in an action of contract, tort or otherwise,
61+
arising in any way of the use of the Collection.</p>
62+
<p><h3 id="anchor-5">Termination</h3></p>
63+
<p>Either The Publisher or The User may terminate this Agreement at any time
64+
by notifying the other party in writing. On termination of the Agreement
65+
The User shall destroy all copies of the Collection.</p>
66+
<p><h3 id="anchor-6">Applicable Law</h3></p>
67+
<p>This Agreement is governed by the laws of the Czech Republic.</p>
68+
</div>
69+
</div>
Lines changed: 143 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,143 @@
1+
<div id="faq-like">
2+
<h2 id="ufal-point-faq">CorefUD v1.3 License Agreement</h2>
3+
<div>
4+
(2025/04/10)
5+
</div>
6+
<hr />
7+
<div class="well">
8+
<h3>CorefUD v1.3 License Terms</h3>
9+
<p>CorefUD v1.3 (referred to as &#x201C;CorefUD&#x201D; in the rest of this document)
10+
is a collection of linguistic data. Each of the corpora has its own license terms
11+
and you (the &#x201C;User&#x201D;) are responsible for complying with the license terms
12+
applicable to those parts of CorefUD which you use. If you do not agree with the license terms,
13+
you must stop using CorefUD and destroy all copies of CorefUD data that you have obtained.</p>
14+
<br />
15+
<p class="alert alert-danger">You are specifically reminded that some of the corpora
16+
permit only non-commercial usage.</p>
17+
<br />
18+
<p>The license for every corpus included in the release is specified in the appropriate
19+
corpus directory.</p>
20+
<br />
21+
22+
<h3>Overview of the corpora and their license terms</h3>
23+
<table class="table table-striped">
24+
<thead>
25+
<tr><th>Corpus</th><th>License</th></tr>
26+
</thead>
27+
<tbody>
28+
<tr>
29+
<td>CorefUD_Ancient_Greek-PROIEL</td>
30+
<td>CC BY-NC-SA 4.0</td>
31+
</tr>
32+
<tr>
33+
<td>CorefUD_Ancient_Hebrew-PTNK</td>
34+
<td>CC BY-NC 4.0</td>
35+
</tr>
36+
<tr>
37+
<td>CorefUD_Catalan-AnCora</td>
38+
<td>CC BY 4.0</td>
39+
</tr>
40+
<tr>
41+
<td>CorefUD_Czech-PCEDT</td>
42+
<td>CC BY-NC-SA 3.0</td>
43+
</tr>
44+
<tr>
45+
<td>CorefUD_Czech-PDT</td>
46+
<td>CC BY-NC-SA 4.0</td>
47+
</tr>
48+
<tr>
49+
<td>CorefUD_English-GUM</td>
50+
<td>CC BY-NC-SA 4.0</td>
51+
</tr>
52+
<tr>
53+
<td>CorefUD_English-LitBank</td>
54+
<td>CC BY 4.0</td>
55+
</tr>
56+
<tr>
57+
<td>CorefUD_English-ParCorFull</td>
58+
<td>CC BY-NC 4.0</td>
59+
</tr>
60+
<tr>
61+
<td>CorefUD_French-ANCOR</td>
62+
<td>CC BY-NC-SA 4.0</td>
63+
</tr>
64+
<tr>
65+
<td>CorefUD_French-Democrat</td>
66+
<td>CC BY-SA 4.0</td>
67+
</tr>
68+
<tr>
69+
<td>CorefUD_German-ParCorFull</td>
70+
<td>CC BY-NC 4.0</td>
71+
</tr>
72+
<tr>
73+
<td>CorefUD_German-PotsdamCC</td>
74+
<td>CC BY-NC-SA 4.0</td>
75+
</tr>
76+
<tr>
77+
<td>CorefUD_Hindi-HDTB</td>
78+
<td>CC BY-NC-SA 4.0</td>
79+
</tr>
80+
<tr>
81+
<td>CorefUD_Hungarian-KorKor</td>
82+
<td>CC BY 4.0</td>
83+
</tr>
84+
<tr>
85+
<td>CorefUD_Hungarian-SzegedKoref</td>
86+
<td>CC BY 4.0</td>
87+
</tr>
88+
<tr>
89+
<td>CorefUD_Korean-ECMT</td>
90+
<td>CC BY 4.0</td>
91+
</tr>
92+
<tr>
93+
<td>CorefUD_Lithuanian-LCC</td>
94+
<td>CLARIN-LT End User License</td>
95+
</tr>
96+
<tr>
97+
<td>CorefUD_Norwegian-BokmaalNARC</td>
98+
<td>CC BY-SA 4.0</td>
99+
</tr>
100+
<tr>
101+
<td>CorefUD_Norwegian-NynorskNARC</td>
102+
<td>CC BY-SA 4.0</td>
103+
</tr>
104+
<tr>
105+
<td>CorefUD_Old_Church_Slavonic-PROIEL</td>
106+
<td>CC BY-NC-SA 4.0</td>
107+
</tr>
108+
<tr>
109+
<td>CorefUD_Polish-PCC</td>
110+
<td>CC BY 3.0</td>
111+
</tr>
112+
<tr>
113+
<td>CorefUD_Russian-RuCor</td>
114+
<td>CC BY-SA 4.0</td>
115+
</tr>
116+
<tr>
117+
<td>CorefUD_Spanish-AnCora</td>
118+
<td>CC BY 4.0</td>
119+
</tr>
120+
<tr>
121+
<td>CorefUD_Turkish-ITCC</td>
122+
<td>CC BY-NC-SA 4.0</td>
123+
</tr>
124+
</tbody>
125+
</table>
126+
127+
<h3>Licenses</h3>
128+
<table class="table">
129+
<thead>
130+
<tr><th>License</th><th>URL</th></tr>
131+
</thead>
132+
<tbody>
133+
<tr><td>CC BY 3.0</td><td><a href="http://creativecommons.org/licenses/by/3.0/">http://creativecommons.org/licenses/by/3.0/</a></td></tr>
134+
<tr><td>CC BY 4.0</td><td><a href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</a></td></tr>
135+
<tr><td>CC BY-NC 4.0</td><td><a href="http://creativecommons.org/licenses/by-nc/4.0/">http://creativecommons.org/licenses/by-nc/4.0/</a></td></tr>
136+
<tr><td>CC BY-NC-SA 3.0</td><td><a href="http://creativecommons.org/licenses/by-nc-sa/3.0/">http://creativecommons.org/licenses/by-nc-sa/3.0/</a></td></tr>
137+
<tr><td>CC BY-NC-SA 4.0</td><td><a href="http://creativecommons.org/licenses/by-nc-sa/4.0/">http://creativecommons.org/licenses/by-nc-sa/4.0/</a></td></tr>
138+
<tr><td>CC BY-SA 4.0</td><td><a href="http://creativecommons.org/licenses/by-sa/4.0/">http://creativecommons.org/licenses/by-sa/4.0/</a></td></tr>
139+
<tr><td>CLARIN-LT End User License</td><td><a href="https://clarin.vdu.lt/licenses/eula/PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm">https://clarin.vdu.lt/licenses/eula/PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm</a></td></tr>
140+
</tbody>
141+
</table>
142+
</div>
143+
</div>

src/static-files/license-pcedt2.html

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,8 @@
66
-->
77

88
<div class="alert alert-info" role="alert">
9-
Location of the original license: <a href="http://ufal.mff.cuni.cz/pcedt2.0/en/pcedt_license.html">http://ufal.mff.cuni.cz/pcedt2.0/en/pcedt_license.html</a>. A copy of the original license (as on 13-Aug-2014) is given below.
9+
<p>The text of the license (as on 13-Aug-2014) is given below.</p>
10+
<p>NOTE: On 19-Jun-2024 the links to LDC were updated to reflect the current location of Treebank 3.</p>
1011
</div>
1112

1213
<h2 align="center" id="ufal-point-faq"><b>Prague Czech English Dependency Treebank 2.0 License Agreement</b></h2>
@@ -51,11 +52,11 @@ <h2 align="center" id="ufal-point-faq"><b>Prague Czech English Dependency Treeba
5152
<div class="well well-sm">
5253
<p>Prague Czech English Dependency Treebank 2.0 (PCEDT) is a parallel,
5354
bilingual corpus. The English Part is based on Penn Treebank 3 and
54-
therefore to use PCEDT, you need a license for <a href="http://www.ldc.upenn.edu/Catalog/catalogEntry.jsp?catalogId=LDC99T42">Treebank 3</a>.
55+
therefore to use PCEDT, you need a license for <a href="https://catalog.ldc.upenn.edu/LDC99T42">Treebank 3</a>.
5556
</p>
5657

5758
<p>The dependency annotation of the English data, as well as all the Czech data
58-
is licensed under the terms of <a href="http://www.ldc.upenn.edu/Catalog/catalogEntry.jsp?catalogId=LDC99T42">CC-BY-NC-SA 3.0</a>.
59+
is licensed under the terms of <a href="https://creativecommons.org/licenses/by-nc-sa/3.0/">CC-BY-NC-SA 3.0</a>.
5960
</p>
6061
</div>
6162

@@ -65,7 +66,7 @@ <h2 align="center" id="ufal-point-faq"><b>Prague Czech English Dependency Treeba
6566

6667
<div class="well well-sm">
6768
<p>By accepting this license and downloading the data you declare you have a valid
68-
license for <a href="http://www.ldc.upenn.edu/Catalog/catalogEntry.jsp?catalogId=LDC99T42">Treebank 3</a>.
69+
license for <a href="https://catalog.ldc.upenn.edu/LDC99T42">Treebank 3</a>.
6970
</p>
7071
</div>
7172

Lines changed: 45 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,45 @@
1+
<div id="faq-like">
2+
<h2 id="ufal-point-faq">The SELEXINI Corpora License Agreement</h2>
3+
<div>
4+
(2025/01/10)
5+
</div>
6+
<hr />
7+
<div class="well">
8+
<h3>License Terms</h3>
9+
<p>The SELEXINI Corpora is a collection of linguistic data. Each of the corpora has its own license terms and you (the “User”) are responsible for complying with the license terms applicable to those parts which you use. If you do not agree with the license terms, you must stop using the corpora and destroy all copies of the data that you have obtained.</p>
10+
<br />
11+
<p>The licenses for the annotations (columns 1,3-10) and words are different, which is indicated in the table below. All files in the bin/ and trial/ folders are licensed under CC BY 4.0.</p>
12+
<br />
13+
14+
<h3>Overview of the corpora and their license terms</h3>
15+
<table class="table table-striped">
16+
<thead>
17+
<tr><th>Original Corpora</th><th>Words (column 2)</th><th>Annotations (columns 1,3-10)</th></tr>
18+
</thead>
19+
<tbody>
20+
<tr>
21+
<td>HPLT</td>
22+
<td>CC0</td>
23+
<td>CC BY-NC-SA</td>
24+
</tr>
25+
<tr>
26+
<td>BigScience</td>
27+
<td>RAIL</td>
28+
<td>CC BY-NC-SA</td>
29+
</tr>
30+
</tbody>
31+
</table>
32+
33+
<h3>Licenses</h3>
34+
<table class="table">
35+
<thead>
36+
<tr><th>License</th><th>URL</th></tr>
37+
</thead>
38+
<tbody>
39+
<tr><td>CC0</td><td><a href="https://creativecommons.org/public-domain/cc0">https://creativecommons.org/public-domain/cc0</a></td></tr>
40+
<tr><td>RAIL</td><td><a href="https://huggingface.co/spaces/bigscience/license">https://huggingface.co/spaces/bigscience/license</a></td></tr>
41+
<tr><td>CC BY-NC-SA</td><td><a href="https://creativecommons.org/licenses/by-nc-sa/4.0/">https://creativecommons.org/licenses/by-nc-sa/4.0/</a></td></tr>
42+
</tbody>
43+
</table>
44+
</div>
45+
</div>

0 commit comments

Comments
 (0)