Skip to content
View EvergreenTree's full-sized avatar

Highlights

  • Pro

Block or report EvergreenTree

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unofficial Implementation of Selective Attention Transformer

Python 15 Updated Oct 31, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 13,340 1,636 Updated Feb 1, 2025

[ICLR2025] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Cuda 616 35 Updated Jan 23, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,205 191 Updated Jan 30, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,482 425 Updated Jan 12, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 1,666 126 Updated Jan 28, 2025
Python 605 62 Updated Jan 31, 2025

PiliPala 是使用Flutter开发的BiliBili第三方客户端,感谢使用。

Dart 9,189 444 Updated Dec 14, 2024

[ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

Python 99 7 Updated Nov 7, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,748 86 Updated Oct 31, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,519 1,240 Updated Jul 23, 2024

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 6,194 580 Updated Feb 1, 2025

JAX port of FLUX.1 models using flax.nnx

Python 22 Updated Sep 28, 2024

JAX Implementation of Black Forest Labs' Flux.1 family of models

Python 27 2 Updated Oct 20, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,015 163 Updated Mar 27, 2024

PB-LLM: Partially Binarized Large Language Models

Python 150 10 Updated Nov 20, 2023

Binarized Neural Network (BNN) for pytorch

Python 509 127 Updated Nov 6, 2023

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,698 226 Updated Jan 10, 2025

Next-Token Prediction is All You Need

Python 1,978 79 Updated Oct 24, 2024
Python 575 74 Updated Oct 26, 2024

G. Peyré, L. Chizat, F-X. Vialard, J. Solomon, Quantum Optimal Transport for Tensor Field Processing, Arxiv, 2016

C++ 10 2 Updated Apr 13, 2017

Repository for NPHardEval, a quantified-dynamic benchmark of LLMs

Jupyter Notebook 51 3 Updated Mar 26, 2024

Train transformer language models with reinforcement learning.

Python 10,942 1,449 Updated Feb 1, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,893 492 Updated Feb 1, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,476 983 Updated Jan 22, 2025

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,498 154 Updated Mar 16, 2024

A simple, easy-to-understand library for diffusion models using Flax and Jax. Includes detailed notebooks on DDPM, DDIM, and EDM with simplified mathematical explanations. Made as part of my journe…

Jupyter Notebook 17 Updated Oct 24, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,403 174 Updated Aug 1, 2024

Unofficial JAX implementations of deep learning research papers

Python 153 8 Updated Jun 25, 2022
Next