word2veconlinelearning

The code is based on the Google word2vec, with online learning function added. This version only support negative sampling, and the hierarchical softmax function will be added in the future version. From the newly added function,every word has its own learning parameter,which update sepearately through the training process. In some experiments, this different updating strategy goes beyond the fastText and original word2vec in word-similarity tasks. If you need some new modifications of this code,please feel free to contact me at any time,my email is:zhenyutang2011@gmail.com Please feel free to try this code in your practical tasks.

Usage

run*.sh is the scripts for running word2vec. Everyone can get some running information from these scripts.

TransE + Word2vec

Adding triple information, like from freebase, when train Word2vec model, which make better word embedding.

Lexical Relational + Word2vec

Try to utilize synonyms corpus antonyms corpus and triple corpus in the CBOW model trainng, for better word embedding, which can discover more complex word relation representation and get better performance in synonyms and antonyms recognition.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
LexicalRelational_Word2vec		LexicalRelational_Word2vec
TransE_Word2vec		TransE_Word2vec
README.md		README.md
compute-accuracy.c		compute-accuracy.c
demo-analogy.sh		demo-analogy.sh
demo-classes.sh		demo-classes.sh
demo-phrase-accuracy.sh		demo-phrase-accuracy.sh
demo-phrases.sh		demo-phrases.sh
demo-word-accuracy.sh		demo-word-accuracy.sh
demo-word.sh		demo-word.sh
distance.c		distance.c
distance_fast.c		distance_fast.c
distance_for_inputfile_new.c		distance_for_inputfile_new.c
distance_for_inputfile_new_bak.c		distance_for_inputfile_new_bak.c
distance_txt.c		distance_txt.c
kmeans.c		kmeans.c
makefile		makefile
prepare.sh		prepare.sh
run_alpha.sh		run_alpha.sh
run_multiclass.sh		run_multiclass.sh
testMultiThreads.c		testMultiThreads.c
title_doc_pair.sample		title_doc_pair.sample
vec_for_wordlist.c		vec_for_wordlist.c
word-analogy.c		word-analogy.c
word2phrase		word2phrase
word2phrase.c		word2phrase.c
word2vec		word2vec
word2vec.c		word2vec.c
word2vec_multiclass		word2vec_multiclass
word2vec_multiclass.c		word2vec_multiclass.c
word2vec_title_context.py		word2vec_title_context.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

word2veconlinelearning

Usage

TransE + Word2vec

Lexical Relational + Word2vec

About

Releases

Packages

Languages

tangzhenyu/word2vecIncrementalLearning

Folders and files

Latest commit

History

Repository files navigation

word2veconlinelearning

Usage

TransE + Word2vec

Lexical Relational + Word2vec

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages