Skip to content
This repository was archived by the owner on Nov 7, 2020. It is now read-only.

Commit aefe64b

Browse files
committed
Add real data and simple scrips for preparation
1 parent 8957d07 commit aefe64b

File tree

3 files changed

+17
-0
lines changed

3 files changed

+17
-0
lines changed

dat/fb15k.tgz

7.31 MB
Binary file not shown.

dat/wordnet-mlj12.tar.gz

3.14 MB
Binary file not shown.

scripts/preprocess.sh

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
2+
cd ../dat
3+
4+
# Freebase
5+
tar zxvf fb15k.tgz
6+
cd FB15k
7+
cat freebase_mtr100_mte100-train.txt | cut -f 2 | sort | uniq > train.rellist
8+
cat freebase_mtr100_mte100-train.txt | cut -f 1,3 | perl -pe 's/\t/\n/g' | sort | uniq > train.entlist
9+
cat freebase_mtr100_mte100-train.txt freebase_mtr100_mte100-valid.txt freebase_mtr100_mte100-test.txt > whole.txt
10+
11+
# WordNet
12+
cd ..
13+
tar zxvf wordnet-mlj12.tar.gz
14+
cd wordnet-mlj12
15+
cat wordnet-mlj12-train.txt | cut -f 2 | perl -pe 's/\t/\n/g' | sort | uniq > train.rellist
16+
cat wordnet-mlj12-train.txt | cut -f 1,3 | perl -pe 's/\t/\n/g' | sort | uniq > train.entlist
17+
cat wordnet-mlj12-train.txt wordnet-mlj12-valid.txt wordnet-mlj12-test.txt > whole.txt

0 commit comments

Comments
 (0)