English - Indic -English Transliteration

This project is focused on word to to word transliteration from English to Indic languages and vice-versa. . This was done using seq2seq architecture using LSTM and GRU and LSTM with Bahdanau Attention mechanism.

Requirements

requirements.txt

In a Nutshell

We gathered the dataset(text) from Internet. You can go the input folder to get the dataset.
We have written code for seq2seq models using LSTM, GRU and LSTM with attention which can be found in src folder.
We then trained 8 models(4 for Eng-Indic & 4 for Indic-Eng) for each architecture here. The models can be found in models folder.
We have then hosted the webapp in streamlit. Link for the website to try it out.

In Details

├──  input
│    └── *.xml - datasets for English to Indic Languages.
│
│
├──  models  
│    └── *  - 8 models for 3 different architecture(LSTM, GRU and LSTM_attn).
│ 
│
├──  src
│    └── config.py  - configuration for different languages
│    └── dataset.py - dataset generator
│    └── language_preprocessing.py - preprocessing text
│    └── train.py - to train different models
│    └── webapp.py- streamlit webapp
│    └── gru.py- gru model
│    └── lstm.py- lstm model
│    └── lstm_attention.py- lstm_attn model

Future Work

This can be extended to transformer based models as well. Currently working on it.

Contributing

Any kind of enhancement or contribution is welcomed.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
input		input
models		models
notebooks		notebooks
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
init_setup.sh		init_setup.sh
requirements.txt		requirements.txt
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

English - Indic -English Transliteration

Requirements

Table Of Contents

In a Nutshell

In Details

Future Work

Contributing

About

Releases

Packages

Languages

License

mallapraveen/English-Indic-English-Transliteration

Folders and files

Latest commit

History

Repository files navigation

English - Indic -English Transliteration

Requirements

Table Of Contents

In a Nutshell

In Details

Future Work

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages