Deep Learning Skunk Works Projects

TODO: filter to the N most common words in the training corpus and mark the rest as OOV
TODO: download a larger dataset (GloVe paper uses Gigaword5, Wikipedia2014, and Common Crawl)
TODO: Train GloVe embeddings
TODO: increase the size of the context vector to 300 depending on training speed
TODO: Evaluate on word similarity task WordSim-353 used in GloVe paper
TODO: Extrinsic model evaluation (NER)
TODO: Write unit tests for model training and inference on small data

cd deep-learning-skunk-works/
export PYTHONPATH=`pwd`
export PROJECT_ROOT=`pwd`
pip install -r requirements.txt

Launch tensorboard

tensorboard --logdir=data/$MODEL/models/

Train model

python src/main.py --train --model $MODEL

Evaluate model

python src/main.py --eval --model $MODEL

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback