Skip to content

SWE Toolkit. Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints. A general framework to incorporate semantic knowledge into the popular data-driven learning process of word vectors. Applications including word similarity, sentence completion, etc. ACL-2015, Beijing, China

License

Notifications You must be signed in to change notification settings

iunderstand/SWE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Semantic Word Embeddings (SWE)

SWE represents a general framework to incorporate semantic knowledge into the popular data-driven learning process of word embeddings to improve the quality of them. Under the SWE framework, semantic knowledge could be quantized as many ordinal ranking inequalities and the learning of word vectors is formulated as a constrained optimization problem. In detail, the data-derived objective function is optimized subject to all ordinal knowledge inequality constraints extracted from available knowledge resources such as Thesaurus, WordNet, knowledge graphs, etc. We have demonstrated that this constrained optimization problem can be efficiently solved by the stochastic gradient descent (SGD) algorithm, even for a large number of inequality constraints. Experimental results on four standard NLP tasks, including word similarity measure, sentence completion, name entity recognition, and the TOEFL synonym selection, have all demonstrated that the quality of learned word vectors can be significantly improved after semantic knowledge is incorporated as inequality constraints during the learning process of word embeddings.

The main SWE functions include

  • SWE_Train: main tool, support SWE model as well as Skip-gram training.
  • SWE_Test_WordSim: tool for applying word embeddings for word similarity.
  • SWE_Test_SentComplete: tool for applying word embeddings for sentence completion
  • SWE_Test_SynSel: tool for applying word embeddings for synonym selection.

The SWE application include

  • Word Similarity
  • Sentence Completion
  • Name Entity Recognition
  • Synonym Selection

Quan Liu

E-mail: quanliu@mail.ustc.edu.cn

http://home.ustc.edu.cn/~quanliu/

About

SWE Toolkit. Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints. A general framework to incorporate semantic knowledge into the popular data-driven learning process of word vectors. Applications including word similarity, sentence completion, etc. ACL-2015, Beijing, China

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published