Natural-Language-Processing

Code, Report and Data of some NLP techniques/Algorithms

Multinomial Classification The dataset is downloadable at: https://s3.amazonaws.com/amazon-reviews-pds/tsv/amazon_reviews_us_Beauty_v1_00.tsv.gz. Used NLTK package to process dataset- remove the stop words- perform lemmatization. Used sklearn to extract TF-IDF features. Trained Perceptron, SVM, Logistic Regression and Multinomial Naive Bayes model
HMM POS Tagging using Greedy and Viterbi Algorithms Created Vocabulary from train as an input to HMM (vocab.txt). Trained HMM by calculating transition and emission probabilities. Implemented the greedy and viterbi decoding algorithms.
Word2Vec FNN and RNN Generated Word2Vec features for the dataset used Gensim library for this purpose. Load the pretrained “word2vec-google-news-300” Word2Vec model and learned how to extract word embeddings. Trained a Word2Vec model using dataset and compared with “word2vec-googlenews-300” Word2Ve features. Trained a single perceptron and an SVM model for the classification problem. Using the Word2Vec features, trained a feedforward multilayer perceptron network for classification. Consider a network with two hidden layers, each with 100 and 10 nodes. Using the Word2Vec features, trained a recurrent neural network (RNN) by considering a gated recurrent unit cell and an an LSTM unit cell.
DL models on named entity recognition (NER) Used the CoNLL-2003 corpus to build a neural network for NER. Built a simple bidirectional LSTM model with the training data on NER with SGD as the optimizer. Used the GloVe word embeddings to improve the BLSTM, equip the BLSTM model in Task 2 with a CNN module to capture character-level information.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
HMM POS Tagging.py		HMM POS Tagging.py
LICENSE		LICENSE
Multinomial Classification Pipeline Explaination.pdf		Multinomial Classification Pipeline Explaination.pdf
Multinomial Classification Pipeline.py		Multinomial Classification Pipeline.py
NLP_HW2_Aditi_hande.ipynb		NLP_HW2_Aditi_hande.ipynb
POS Tagging HMM Greedy and Viterbi Algo.pdf		POS Tagging HMM Greedy and Viterbi Algo.pdf
README.md		README.md
README.txt		README.txt
Word2Vec FNN and RNN.ipynb		Word2Vec FNN and RNN.ipynb
Word2Vec FNN and RNN.pdf		Word2Vec FNN and RNN.pdf
blstm1.pt		blstm1.pt
blstm2.pt		blstm2.pt
dev1.out		dev1.out
dev2.out		dev2.out
final result.JPG		final result.JPG
greedy.out		greedy.out
hmm.json		hmm.json
named entity recognition (NER).pdf		named entity recognition (NER).pdf
task1_dev.py		task1_dev.py
task1_test.py		task1_test.py
task2_dev.py		task2_dev.py
task2_test.py		task2_test.py
test1.out		test1.out
test2.out		test2.out
viterbi.out		viterbi.out
vocab.txt		vocab.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Natural-Language-Processing

About

Uh oh!

Releases

Packages

Languages

License

Aditi-hande/Natural-Language-Processing

Folders and files

Latest commit

History

Repository files navigation

Natural-Language-Processing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages