-
Beijing Language and Culture University
- beijing
Highlights
- Pro
Stars
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
TensorFlow code and pre-trained models for BERT
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
A synthetic data generator for text recognition
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.
MaksTarnavskyi / gector-large
Forked from grammarly/gectorImproved version of GECToR
The Codebase for Quasi-Attention BERT Model for TABSA Tasks (AAAI '21)
Chinese segmentation simple by keras
The official code of the 2023 ACL paper "Enhancing Grammatical Error Correction Systems with Explanations"
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
The official code of the "Efficient and Interpretable Grammatical Error Correction with Mixture of Experts" paper
SunBK201 / Dir8urp
Forked from JeffyLapter/Dir8urpAn Open-source Path Burp Tool.