Skip to content

DravidaKavacham is an open-source tool for detecting abusive content in Dravidian (Tamil and Malayalam), focused on harmful language targeting women. Developed as part of the DravidianLangTech@NAACL 2025 shared task, it uses NLP and machine learning for accurate text classification and content analysis.

License

Notifications You must be signed in to change notification settings

Luxshan2000/dravida-kavacham

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

98 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DravidaKavacham

DravidaKavacham is an open-source tool for detecting abusive content in Dravidian focused on harmful language targeting women.

Python 3.10+ PyTorch MIT License Downloads

🛠️ Installation

Install DravidaKavacham via PyPI

pip install dravida-kavacham

💡 Quick Start

Load the Library

from dravida_kavacham import AbuseDetector

# Initialize the detector
detector = AbuseDetector()

Predict Abusive Content

text = "மிகவும் அழகாக உள்ளது."
result = detector.predict(text)

if result == "Abusive":
    print("⚠️ Abusive content detected!")
else:
    print("✔️ Text is clean.")

Example notebook on Kaggle: visit


🚀 Key Features

  • Multilingual Detection: Designed for Tamil 🇮🇳 and Malayalam 🇮🇳 text.
  • Plug-and-Play: No complex setup. Just install, load, and detect!
  • Efficient Local Processing: Downloads the pre-trained model on first use.
  • Customizable: Extendable for additional use cases with minor adjustments.

🌍 Supported Languages

Language Script Status
Tamil Tamil script ✅ Supported
Malayalam Malayalam script ✅ Supported

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.


🙌 Acknowledgements

Special thanks to the dataset authors and owners for providing the valuable resources that made this project possible!


⭐ Support

If you like this project, please consider giving it a ⭐ on DravidaKavacham!

About

DravidaKavacham is an open-source tool for detecting abusive content in Dravidian (Tamil and Malayalam), focused on harmful language targeting women. Developed as part of the DravidianLangTech@NAACL 2025 shared task, it uses NLP and machine learning for accurate text classification and content analysis.

Topics

Resources

License

Stars

Watchers

Forks