Skip to content
View imvladikon's full-sized avatar

Highlights

  • Pro

Block or report imvladikon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bringing BERT into modernity via both architecture changes and scaling

Python 769 42 Updated Dec 21, 2024

ReFT: Representation Finetuning for Language Models

Python 1,262 112 Updated Dec 19, 2024

Code for BLT research paper

Python 1,153 76 Updated Dec 12, 2024
Python 274 21 Updated Dec 11, 2024
Python 2,202 252 Updated Dec 20, 2024

Efficient, Flexible and Portable Structured Generation

C++ 530 26 Updated Dec 25, 2024

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 182 4 Updated Dec 19, 2024

Efficiently find the best-suited language model (LM) for your NLP task

Python 108 10 Updated Dec 4, 2024

[ACL 2024] IEPile: A Large-Scale Information Extraction Corpus

Python 179 17 Updated Dec 23, 2024

Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"

Jupyter Notebook 404 19 Updated Dec 12, 2024

Everything about the SmolLM & SmolLM2 family of models

Python 1,474 72 Updated Dec 24, 2024

Get your documents ready for gen AI

Python 16,851 871 Updated Dec 19, 2024

Late Interaction Models Training & Retrieval

Python 204 9 Updated Nov 29, 2024

Efficient Triton Kernels for LLM Training

Python 4,059 231 Updated Dec 29, 2024

Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)

Python 359 22 Updated Oct 1, 2024

Structured Text Generation

Python 10,170 531 Updated Dec 28, 2024

A massively parallel, high-level programming language

Rust 17,832 439 Updated Dec 26, 2024

Toolkit for attaching, training, saving and loading of new heads for transformer models

Jupyter Notebook 254 22 Updated Dec 2, 2024

Tree-based indexes for neural-search

Python 28 2 Updated Mar 4, 2024

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 497 49 Updated Dec 11, 2024

🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers

Python 100 15 Updated Oct 22, 2024

Code for our paper accepted at EMNLP 2023 (Findings)

Python 12 Updated Jan 5, 2024

Port of OpenAI's Whisper model in C/C++

C++ 36,582 3,747 Updated Dec 24, 2024

A Serverless Text Annotation Tool for Corpus Development

JavaScript 54 19 Updated Dec 16, 2024

[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"

Python 31 3 Updated Oct 18, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,668 303 Updated Oct 28, 2024

Multilingual Evaluation of LLMs

6 Updated Oct 20, 2023
Jupyter Notebook 106 53 Updated Dec 19, 2023

🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,355 58 Updated Dec 29, 2024
Next