Skip to content
View davidstap's full-sized avatar

Block or report davidstap

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,486 315 Updated Jul 15, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,481 244 Updated Feb 20, 2025

Official repository of the xLSTM.

Python 1,778 134 Updated Mar 18, 2025

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 2,338 351 Updated Mar 22, 2025

Minimum Bayes Risk Decoding for Hugging Face Transformers

Python 57 7 Updated Jun 3, 2024

The Art of Debugging

C 861 39 Updated Aug 3, 2024

DSPy: The framework for programming—not prompting—language models

Python 22,586 1,730 Updated Mar 21, 2025

an easy-to-use knn-mt toolkit

Python 104 12 Updated Aug 19, 2023

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Python 270 24 Updated Oct 20, 2022

Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. This paper is accepted by NAACL 2022 Main Conference.

Python 30 2 Updated Jul 16, 2022

MAFAND-MT

Jupyter Notebook 55 27 Updated Jul 9, 2024

Gale-Church sentence aligner with options for variable parameters

Python 17 7 Updated Oct 7, 2019

A template repo for Python packages

Python 481 75 Updated Feb 20, 2025

Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.

Python 236 42 Updated Apr 19, 2024

A tool that locates, downloads, and extracts machine translation corpora

Python 154 23 Updated Mar 21, 2025

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,227 466 Updated Aug 7, 2024

A Zotero plugin for syncing items and notes into Notion

TypeScript 2,643 112 Updated Mar 1, 2025

Create a Notion collection, synced with Zotero.

Python 76 15 Updated Aug 25, 2021

A Python library for working with and comparing language codes.

Python 345 28 Updated Dec 13, 2024

Expanding natural instructions

Python 984 192 Updated Dec 11, 2023

Yet Another Neural Machine Translation Toolkit

Python 179 31 Updated Mar 7, 2025

Zero -- A neural machine translation system

Python 150 19 Updated May 8, 2023

Style guides for Google-originated open-source projects

HTML 37,981 13,306 Updated Mar 7, 2025

A formatter for Python files

Python 13,863 898 Updated Mar 10, 2025

Useful localization tools with Python API for building localization & translation systems

Python 879 321 Updated Mar 21, 2025

NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021

Python 14 1 Updated May 18, 2021

Python port of Moses tokenizer, truecaser and normalizer

Python 492 59 Updated May 26, 2024

PRML algorithms implemented in Python

Jupyter Notebook 11,557 3,255 Updated Sep 27, 2024

Minimalist NMT for educational purposes

Python 690 215 Updated Jan 29, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,588 1,436 Updated Mar 18, 2025
Next