Skip to content
View NathanYanJing's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Cornell University

Highlights

  • Pro

Block or report NathanYanJing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A bibliography and survey of the papers surrounding o1

TeX 1,016 41 Updated Nov 16, 2024
JavaScript 19 1 Updated Oct 15, 2024

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 186 14 Updated Jan 1, 2025

Fast Diffusion Models with Transformers

Python 771 100 Updated Oct 25, 2024

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 728 42 Updated Mar 12, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,546 77 Updated Jan 4, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,434 57 Updated Aug 15, 2024

Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish

Jupyter Notebook 166 5 Updated Jul 31, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,971 2,258 Updated Dec 27, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,817 123 Updated Oct 30, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 829 49 Updated Dec 9, 2024

Reading list for research topics in state-space models

253 26 Updated Dec 22, 2024

The official Meta Llama 3 GitHub site

Python 27,780 3,182 Updated Aug 12, 2024

Puzzles for learning Triton

Jupyter Notebook 1,242 95 Updated Nov 18, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 28 8 Updated Sep 27, 2023

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,752 183 Updated Sep 28, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,680 197 Updated Mar 8, 2024

Mamba SSM architecture

Python 13,684 1,173 Updated Dec 6, 2024

utilities for decoding deep representations (like sentence embeddings) back to text

Python 756 85 Updated Jan 2, 2025

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

C++ 293 28 Updated Dec 28, 2024

Official inference library for Mistral models

Jupyter Notebook 9,842 874 Updated Nov 12, 2024

Code for Fast Training of Diffusion Models with Masked Transformers

Python 383 14 Updated May 15, 2024
Python 261 21 Updated Nov 22, 2023

Inference code for CodeLlama models

Python 16,130 1,881 Updated Aug 12, 2024

Chapyter: ChatGPT Code Interpreter in Jupyter Notebooks

Python 825 70 Updated Oct 20, 2023

Generative Models by Stability AI

Python 24,999 2,773 Updated Sep 4, 2024

An open-source visual programming environment for battle-testing prompts to LLMs.

TypeScript 2,441 191 Updated Dec 30, 2024

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Python 632 53 Updated Dec 27, 2024
Next