GitHub - rosakun/weakly-supervised-forc: Creating 40,000 weak labels for the 2025 FoRC shared task

Overview

This repository contains code for testing various models to weakly label a dataset and for the actual labeling process. This work is part of the 2025 Field-of-Research Classification (FoRC) Shared Task which is co-located at SNLP 2025.

Shared Task Background

The FoRC shared task concerns itself with the automatic classification of scientific research papers by their field of research. We focus on classifying computational linguistics papers, taken from the ACL Anthology, by at least one of a list of 181 hierarchically organized labels. The 2025 iteration adds a weakly labeled dataset of over 40,000 papers to the manually labeled dataset of 1500 papers used for last year's iteration. More details about the shared task can be found here.

Usage

This code can:

Convert data from the ACL Anthology corpus into the same format as the FoRC4CL dataset.
Pre- and postprocess FoRC4CL-format datasets.
Train and analyse simple ML models on the FoRC4CL train/test split.
Weakly label the ACL Anthology corpus using the simple models.
Train and score transformer models on the FoRC4CL train/test split.

Results

Results are soon to be published in an overview paper.

Contributions

Contributions are welcome! Please check our CodaBench if you'd like to submit a solution!

Contact

For any questions, please contact maria.francis@dfki.de or open an issue in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
data		data
utils		utils
ACLData.py		ACLData.py
FoRC4CL.py		FoRC4CL.py
README.md		README.md
train-simple-models.ipynb		train-simple-models.ipynb
train-transformers.ipynb		train-transformers.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Shared Task Background

Usage

Results

Contributions

Contact

About

Releases

Packages

Languages

rosakun/weakly-supervised-forc

Folders and files

Latest commit

History

Repository files navigation

Overview

Shared Task Background

Usage

Results

Contributions

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages