MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
-
Updated
Jun 9, 2023 - Python
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.
Neural Grammatical Error Correction for Romanian using Transformer
Public repository for the MultiGEC dataset.
Grammatical Error Correction with Contrastive Learning in Low Error Density Domains
Sequence to sequence learning for GEC task using several deep models.
Grammar Error Correction Based on Tensor2Tensor
Textshine is a seq2seq model for grammatical error correction.
Code, models, and data for "Enhancing Text Editing for Grammatical Error Correction: Arabic as a Case Study", ACL 2025
A fluency_based evaluation tool for Chinese Grammatical Error Correction (CGEC)
Tool for converting error corpora to parallel datasets
[ACL 2023] A Text Editing Repository for reproduction and innovation.
Grammatical Error Correction at the character level using Reformers.
Data collector script for Kurdish Kurmanji grammer error correction AI model
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)
Add a description, image, and links to the gec topic page so that developers can more easily learn about it.
To associate your repository with the gec topic, visit your repo's landing page and select "manage topics."