Skip to content
@BanglaGPT

BanglaGPT

Generative Pretrained Language Model for Bangla Language

BanglaGPT

BanglaGPT is a project implementing a powerful GPT language model for Bangla. It aims to address the lack of language models for Bangla, enabling applications like chatbots, translation, sentiment analysis, and more. Its goal is to empower developers and researchers working with the Bangla language.

Tokenizer for Bangla Text

The initial phase of the BanglaGPT project is finished, delivering a customized tokenizer for Bangla text. This tokenizer efficiently breaks down sentences into tokens, considering the language's complexities, facilitating analysis and processing.

Future Roadmaps

We have an ambitious roadmap ahead for the BanglaGPT project. Here are some of the key areas we plan to focus on in the future:

  • GPT Model Training

  • Model Evaluation and Benchmarking

  • Model Optimization

  • Domain-Specific Adaptation

Contributing

We strongly encourage contributions from the open-source community to help us achieve the goals of the BanglaGPT project. Whether you are a researcher, developer, or language enthusiast, there are several ways you can contribute:

  • Testing and Feedback: Try out the tokenizer and provide valuable feedback. Report any issues, suggest improvements, or share your experiences working with the tokenizer.

  • Dataset Collection: Help us gather diverse datasets in Bangla for training and evaluation purposes. High-quality and representative datasets are crucial for building robust language models.

  • Model Training: Contribute to the training process by providing computational resources, expertise in machine learning, or by sharing preprocessed datasets that can be used to train the GPT model.

  • Documentation and Code: Improve the documentation of the project, write tutorials, or contribute to the codebase. Help us make BanglaGPT more accessible to the community.

Join the Community

Stay up to date with the latest news, announcements, and discussions around the BanglaGPT project:

We believe that by collaborating and pooling our efforts, we can build a powerful language model that empowers Bangla speakers and enables innovative applications in the Bangla language ecosystem.

Let's shape the future of Bangla language processing together with BanglaGPT!

Pinned Loading

  1. bangla-gpt bangla-gpt Public

    Training code for BanglaGPT model

    Python 4 2

Repositories

Showing 5 of 5 repositories
  • SuSastho.AI Public

    AI-powered Adolescent Health Chatbot, designed to provide confidential support on Sexual, Reproductive, and Mental Health (SRMH) for adolescents.

    BanglaGPT/SuSastho.AI’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Sep 27, 2024
  • bangla-gpt Public

    Training code for BanglaGPT model

    BanglaGPT/bangla-gpt’s past year of commit activity
    Python 4 2 0 0 Updated Jul 12, 2023
  • .github Public
    BanglaGPT/.github’s past year of commit activity
    0 0 0 0 Updated Jul 9, 2023
  • bangla-tokenizer Public

    Text Tokenizer for BanglaGPT

    BanglaGPT/bangla-tokenizer’s past year of commit activity
    Python 4 1 0 0 Updated Jul 9, 2023
  • BanglaGPT/banglagpt.github.io’s past year of commit activity
    HTML 0 0 0 0 Updated Jul 8, 2023

Top languages

Python HTML

Most used topics

Loading…