Collection of pretrained ML and NLP models for the Urdu language.
Word2vec is a widely used for learns relationships between words and converting word into vector.
- Trained on 50,000 web news posts.
- Semantic Accuracy: 36.89%
- Syntactic Accuracy: 31.25%
- Demo (https://github.com/urduhack/models/blob/master/word-vectors/word2vec/web_news_data)
- Trained on whole wikipedia corpus.
- Semantic Accuracy: 59.59%
- Syntactic Accuracy: 37.50%
- Demo (https://github.com/urduhack/models/blob/master/word-vectors/word2vec/wikipedia)
Have a bug or a feature request? If you wish to remove or update some of the features, please file an issue first before sending a PR on the repo. please open a new issue.
Special thanks to everyone who contributed to getting the Urdu hack models to the current state
Support this project by becoming a sponsor. Your logo will show up here with a link to your website. [Become a sponsor]
Code released under the MIT License.