Skip to content

Commit ebed018

Browse files
authored
Update README.md
1 parent 6264bcb commit ebed018

File tree

1 file changed

+34
-1
lines changed

1 file changed

+34
-1
lines changed

README.md

Lines changed: 34 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,34 @@
1-
# id2vec
1+
# id2vec
2+
3+
**Operating id2vec on an Ubuntu machine:
4+
5+
First, run the following commands:
6+
7+
```bash
8+
cd /path/to/id2vec
9+
dos2unix init.sh preprocess.sh train.sh scripts/splitData.sh
10+
chmod 744 init.sh preprocess.sh train.sh
11+
```
12+
13+
Then run init.sh in order to install necessary packages:
14+
./init.sh
15+
16+
Then, remove unnecessary files and split the data before preprocessing it:
17+
find raw_data ! -name '*.ts' -type f -exec rm -f {} +
18+
scripts/splitData.sh raw_data 80
19+
cd raw_data
20+
mv train_dir train_dir_tmp
21+
../scripts/splitData.sh train_dir_tmp 80
22+
mv train_dir_tmp/test_dir train_dir_tmp/val_dir
23+
mv train_dir_tmp/* .
24+
cd ..
25+
rmdir raw_data/train_dir_tmp
26+
27+
To preprocess, run the following commands inside a shell of your ubuntu machine in order for the script to run in the background:
28+
./preprocess.sh &
29+
disown
30+
31+
Explanation: the & literal disconnects stdin from the process that runs the preprocess.sh script,
32+
and returns it to your shell. Then, the disown command removes the process from the shell's job control. This way,
33+
even if your terminal session is terminated (as happens when your ssh connection to the machine is terminated)
34+
the process will continue running in the background.

0 commit comments

Comments
 (0)