Automated Essay Scoring on The Hewlett Foundation dataset on Kaggle
-
Updated
Apr 26, 2018 - Jupyter Notebook
Automated Essay Scoring on The Hewlett Foundation dataset on Kaggle
Contributed to a vision-driven accessibility tool translating sign language into text
🤟 Enhance sign language interpretation using transfer learning and multimodal features for accurate gesture recognition and robust evaluation methods.
Python | scikit-learn, SVM, SMOTE | Automates systematic literature review screening with 95%+ recall, reducing manual workload by 64%.
💬 Advanced NLP with Spacy Course
Labeling queue library for managing human labeling workflows
Evaluation and agreement scripts for the DISCOSUMO project. Each evaluation script takes both manual annotations as automatic summarization output. The formatting of these files is highly project-specific. However, the evaluation functions for precision, recall, ROUGE, Jaccard, Cohen's kappa and Fleiss' kappa may be applicable to other domains too.
Text Mining terms of service 🔏💻
Cross-Family LLM-Judge Agreement for Institutional RAG: 5 families, 9 judges. Validated on TREC RAG 2024 (kappa=0.4941) + BEIR scifact.
Statistical validation of labeling consistency across three independent raters for a handwritten digit classification dataset.
Measure how much your LLM judges actually agree. Inter-judge agreement metrics for LLM-as-a-judge evaluations.
LLM-as-judge framework with three named bias diagnostics: position-swap flip rate, length-pad flip rate, Cohen's kappa inter-judge agreement. 30-pair built-in eval set, Groq + Anthropic backends.
Vocational-training resource: structured information extraction from German job ads with local Qwen2.5-7B, Cohen's kappa inter-annotator agreement, and frontier-LLM make-or-buy comparison. FIDP, 2nd year.
Add a description, image, and links to the cohens-kappa topic page so that developers can more easily learn about it.
To associate your repository with the cohens-kappa topic, visit your repo's landing page and select "manage topics."