Kaggle project : CommonLit-Readability-Prize

CommonLit, Inc., is a nonprofit education technology organization serving over 20 million teachers and students with free digital reading and writing lessons for grades 3-12. Together with Georgia State University, an R1 public research university in Atlanta, they are challenging Kagglers to improve readability rating methods. In this competition, we had built algorithms to rate the complexity of reading passages for grade 3-12 classroom use. To accomplish this, we had used state of the art machine learning tools with a dataset that includes readers from a wide variety of age groups and a large collection of texts taken from various domains.
You can find three version of trials :
EDA + LSTM +CNN : In this version, we had passed by an exploration data analysis step , which had allowed us to explore and extraxct valuable insights. Then, we had used an LSTM model which had gave us a good start result in compare with the benchmark results of kagglers competitors. Finaly, we had test a CNN model , which had gave a slightly better result than LSTM .

Roberta Model : We had used roberta model, to resolve this problem. So , we had trained a Roberta model using our dataset , then we had fine tunned this model using our target, finally we had combined this model with LGBM model in order to define our model of forecasting.

Bert Model : As we had done with Roberta model , we had adopt the same approach for Bert model.Except that we combined bert with SVM model in order to make our forecasting.

Built With 🔨

Kaggle platform.
Python.
Pytorch.
Tensorflow.
Bert
Roberta
LSTM
CNN
LGBM
SVM

Live Demo

Live Demo Link

Install

To get a local copy up and running follow these simple example steps.

Open terminal
Clone this project by the command:

$ git clone git@github.com:Taher-web-dev/CommonLit-Readability-Prize.git

Then go to the main folder using the next command:

$ cd CommonLit-Readability-Prize

Prerequisites

IDE to edit and run the code (We use Jupyter Notebook 🔥).
Git to versionning your work.

Usage

Data scientist practioner
For anyone interested by NLP topics.

Authors

👤 Taher Haggui

GitHub: @TaherHaggui
LinkedIn: @TaherHaggui

🤝 Contributing

Contributions, issues, and feature requests are welcome!

Show your support

Give a ⭐️ if you like this project!

Acknowledgments

kaggle plarform 💘 (https://www.kaggle.com/)
My family's support 🙌

📝 License

This project is CommonLit licensed.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Bert		Bert
EDA-LSTM-CNN		EDA-LSTM-CNN
Roberta		Roberta
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kaggle project : CommonLit-Readability-Prize

Built With 🔨

Live Demo

Install

Prerequisites

Usage

Authors

🤝 Contributing

Show your support

Acknowledgments

📝 License

About

Releases

Packages

Languages

taher-software/CommonLit-Readability-Prize

Folders and files

Latest commit

History

Repository files navigation

Kaggle project : CommonLit-Readability-Prize

Built With 🔨

Live Demo

Install

Prerequisites

Usage

Authors

🤝 Contributing

Show your support

Acknowledgments

📝 License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages