Skip to content
View EvergreenTree's full-sized avatar

Highlights

  • Pro

Block or report EvergreenTree

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

s1: Simple test-time scaling

Python 3,794 426 Updated Feb 8, 2025

Linux running inside a PDF file via a RISC-V emulator

C 2,279 58 Updated Feb 2, 2025

Unofficial Implementation of Selective Attention Transformer

Python 15 Updated Oct 31, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 15,238 1,980 Updated Feb 1, 2025

[ICLR2025] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Cuda 633 38 Updated Feb 4, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,278 196 Updated Feb 5, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,502 427 Updated Jan 12, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,264 170 Updated Feb 7, 2025

PiliPala 是使用Flutter开发的BiliBili第三方客户端,感谢使用。

Dart 9,269 456 Updated Dec 14, 2024

[ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

Python 102 7 Updated Nov 7, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,753 86 Updated Oct 31, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,530 1,241 Updated Jul 23, 2024

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 9,404 868 Updated Feb 7, 2025

JAX port of FLUX.1 models using flax.nnx

Python 22 Updated Sep 28, 2024

JAX Implementation of Black Forest Labs' Flux.1 family of models

Python 27 2 Updated Oct 20, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,017 164 Updated Mar 27, 2024

PB-LLM: Partially Binarized Large Language Models

Python 150 11 Updated Nov 20, 2023

Binarized Neural Network (BNN) for pytorch

Python 509 127 Updated Nov 6, 2023

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,712 228 Updated Jan 10, 2025

Next-Token Prediction is All You Need

Python 1,983 79 Updated Oct 24, 2024
Python 578 74 Updated Oct 26, 2024

G. Peyré, L. Chizat, F-X. Vialard, J. Solomon, Quantum Optimal Transport for Tensor Field Processing, Arxiv, 2016

C++ 10 2 Updated Apr 13, 2017

Repository for NPHardEval, a quantified-dynamic benchmark of LLMs

Jupyter Notebook 51 3 Updated Mar 26, 2024

Train transformer language models with reinforcement learning.

Python 11,265 1,502 Updated Feb 7, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,916 494 Updated Feb 8, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,546 987 Updated Jan 22, 2025

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,503 155 Updated Mar 16, 2024

A simple, easy-to-understand library for diffusion models using Flax and Jax. Includes detailed notebooks on DDPM, DDIM, and EDM with simplified mathematical explanations. Made as part of my journe…

Jupyter Notebook 17 Updated Oct 24, 2024
Next