You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
EmoTa is an open-access Tamil Speech Emotion Recognition dataset with 936 utterances from 22 native speakers, covering five emotions (anger, happiness, sadness, fear, and neutrality). It supports emotion classification tasks and advances Tamil language processing.
Official repository for the paper 'From Multiple-Choice to Extractive QA: A Case Study for English and Arabic' (COLING 2025). Includes datasets, scripts, and benchmarks for converting MCQA to EQA across English, Modern Standard Arabic, and Arabic dialects
Morphology-enhanced neural models for Ancient Greek interlinear translation, achieving 35-38% BLEU improvements for English and Polish translations. Includes custom T5 implementations and training code. [LoResLM@COLING2025]
Relevant code and data for the COLING2025 paper: Evaluating LLMs' Capability to Identify Lexical Semantic Equivalence: Probing with the Word-in-Context Task