JobDeepSearch

Job Search Engine based on semantics and documents embedding Word2Vec

Notebook available on Google Colab.

Job Search and Ranking function based on Semantics and Word Embeddings

This notebook presents a mockup of a search and ranking function based on semantics of 20000 job descriptions (dataset extracted from Monster.com jobs). This methodology using word embeddings captures the context and the semantics of the analysed text, compared to a classic search function based on words counts per documents (Vector Space Model and Term Frequency-Inverse Document Frequency).

A word embedding Work2Vec model is build from these descriptions to capture the semantics and the context. This model is then enriched with a generic Word2Vec model based on a Google News corpus, the job descriptions being not sufficient to build a full language model.

The resulted ranking of a search is based on the cosine similarity between the query and the different job descriptions scored with the word embedding model (300 dim vector).

The TSNE dimension reduction method allows to visualise the job descriptions in a 3D space.

Possible improvement: TF-IDF weighting for job descriptions scoring

Dependencies

Numpy
Gensim for text processing and word2vec model
nltk
tensorflow for T-SNE visualisation

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
JobSearchWord2Vec.ipynb		JobSearchWord2Vec.ipynb
README.md		README.md
tensorboard.gif		tensorboard.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

JobDeepSearch

Job Search and Ranking function based on Semantics and Word Embeddings

Dependencies

About

Uh oh!

Releases

Packages

Languages

Benjamin-VdB/JobDeepSearch

Folders and files

Latest commit

History

Repository files navigation

JobDeepSearch

Job Search and Ranking function based on Semantics and Word Embeddings

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages