Skip to content
View xhchrn's full-sized avatar

Highlights

  • Pro

Block or report xhchrn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for studying the super weight in LLM

Jupyter Notebook 76 6 Updated Dec 3, 2024

AlphaFold 3 inference pipeline.

Python 5,974 721 Updated Feb 6, 2025

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

Python 1,167 79 Updated Jan 17, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,271 1,030 Updated Feb 6, 2025

[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models

Python 231 13 Updated Nov 30, 2024

A simple, performant and scalable Jax LLM!

Python 1,615 314 Updated Feb 7, 2025

The IterativeDisplay class for MATLAB is designed to assist in displaying iterative process updates in a structured and customizable manner.

MATLAB 10 Updated Jun 8, 2024

A playbook for systematically maximizing the performance of deep learning models.

27,942 2,303 Updated Jun 18, 2024

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Jupyter Notebook 659 78 Updated Jan 16, 2025

Simple clipboard manager to be integrated with rofi - Static binary available

Haskell 1,382 33 Updated Oct 1, 2023

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Python 329 27 Updated Jan 4, 2024

The official Meta Llama 3 GitHub site

Python 28,203 3,258 Updated Jan 26, 2025

LLM inference in C/C++

C++ 73,289 10,566 Updated Feb 6, 2025

Automatically exported from code.google.com/p/latex-bibitemstyler

C# 155 27 Updated Jun 7, 2021

Graph-Mamba: Towards Long-Range Graph Sequence Modelling with Selective State Spaces

Python 264 34 Updated Feb 2, 2024

Exact Combinatorial Optimization with Graph Convolutional Neural Networks (NeurIPS 2019)

Python 363 104 Updated Dec 21, 2021

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

C++ 4,780 541 Updated Oct 24, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,067 4,835 Updated Jan 31, 2025

Mamba SSM architecture

Python 13,896 1,199 Updated Jan 18, 2025

A Latex style and template for paper preprints (based on NIPS style)

TeX 1,208 325 Updated Jan 2, 2024

良性过拟合现象是深度学习方法揭示的关键奥秘之一:深度神经网络即使完全拟合噪声训练数据,似乎也能很好地预测。

Jupyter Notebook 2 Updated Jan 23, 2024

[NeurIPS'18, Spotlight oral] "Theoretical Linear Convergence of Unfolded ISTA and its Practical Weights and Thresholds", by Xiaohan Chen*, Jialin Liu*, Zhangyang Wang and Wotao Yin.

Python 60 23 Updated Dec 30, 2021

[ICLR 2023] "On Representing Mixed-Integer Linear Programs by Graph Neural Networks" by Ziang Chen, Jialin Liu, Xinshang Wang, Jianfeng Lu, Wotao Yin.

Python 41 12 Updated Aug 11, 2023

Optimization-based deep learning models can give explainability with output guarantees and certificates of trustworthiness.

Python 3 Updated May 17, 2024
Python 10 2 Updated Aug 2, 2023

LotteryFL: Empower Edge Intelligence with Personalized and Communication-Efficient Federated Learning (2021 IEEE/ACM Symposium on Edge Computing)

Python 41 6 Updated Nov 16, 2022
Jupyter Notebook 54 11 Updated May 14, 2024

Fit interpretable models. Explain blackbox machine learning.

C++ 6,378 738 Updated Feb 4, 2025
Next