Highlights
- Pro
Lists (32)
Sort Name ascending (A-Z)
abstractive_summarization
AE
annotation_tools
anomaly
augmentations
contrastive learning
crosslingual_domain_adaptation
data analysis
datasets
dev
distillation
evaluation
formers
IR
keywords
KNN-LM
LLM
LTR
NER
NMT
optimizers
parsers
plots
QA
RE
representation_learning
rl
speech recognition
SRL
text_generation
topic_modeling
transformers
Stars
Bringing BERT into modernity via both architecture changes and scaling
ReFT: Representation Finetuning for Language Models
Efficient, Flexible and Portable Structured Generation
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
Efficiently find the best-suited language model (LM) for your NLP task
[ACL 2024] IEPile: A Large-Scale Information Extraction Corpus
Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"
Everything about the SmolLM & SmolLM2 family of models
Efficient Triton Kernels for LLM Training
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
A massively parallel, high-level programming language
Toolkit for attaching, training, saving and loading of new heads for transformer models
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
Code for our paper accepted at EMNLP 2023 (Findings)
Port of OpenAI's Whisper model in C/C++
A Serverless Text Annotation Tool for Corpus Development
[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.