Skip to content
View hellbell's full-sized avatar

Block or report hellbell

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of MambaMia (AAAI-26 Oral)

Python 4 Updated Jan 24, 2026

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,040 130 Updated Dec 8, 2025

Multi-Agent LLM Evaluation

Python 1 Updated Nov 18, 2025

Multi-Agent LLM Evaluation

Python 15 5 Updated Jan 27, 2026
Python 33 4 Updated Nov 14, 2025
Jupyter Notebook 114 3 Updated Nov 8, 2025

Source code of "Dr.LLM: Dynamic Layer Routing in LLMs"

Python 41 3 Updated Oct 15, 2025

The best ChatGPT that $100 can buy.

Python 41,015 5,317 Updated Jan 29, 2026

reproduction of semantic segmentation using masked autoencoder (mae)

Python 170 14 Updated Feb 3, 2022

utilities for decoding deep representations (like sentence embeddings) back to text

Python 1,055 115 Updated Dec 27, 2025

Code for the Molmo Vision-Language Model

Python 864 87 Updated Dec 12, 2024
HTML 170 9 Updated Oct 27, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,701 2,028 Updated Jan 13, 2026

codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)

Python 743 72 Updated Dec 19, 2025

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 117 7 Updated Nov 1, 2025
Python 6 Updated Jul 16, 2025

🙌 OpenHands: AI-Driven Development

Python 67,275 8,375 Updated Jan 29, 2026

[NeurIPS 2025] Official PyTorch implementation of "Token Bottleneck: One Token to Remember Dynamics"

Python 25 Updated Jul 10, 2025

Dream 7B, a large diffusion language model

Python 1,157 74 Updated Nov 21, 2025
Python 220 10 Updated Oct 27, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 21,281 3,623 Updated Jan 29, 2026

A curated list for Efficient Large Language Models

Python 1,946 150 Updated Jun 17, 2025

Fully open data curation for reasoning models

Python 2,199 185 Updated Dec 2, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 66,619 8,114 Updated Jan 28, 2026

Source code of "C-SEO Bench: Does Conversational SEO Work?" NeurIPS D&B 2025

Jupyter Notebook 15 3 Updated Sep 28, 2025

Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025

Python 15 1 Updated Jan 12, 2026

Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.

Python 143 18 Updated May 29, 2025

Open-source framework for the research and development of foundation models.

HTML 741 73 Updated Jan 29, 2026

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.

Python 143 7 Updated Sep 13, 2025
Next