-
Skoltech
- Moscow, Russia
-
AutoAWQ-FP Public
Forked from casper-hansen/AutoAWQAutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Python MIT License UpdatedJun 18, 2025 -
-
-
ComfyUI Public
Forked from comfyanonymous/ComfyUIThe most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Python GNU General Public License v3.0 UpdatedMar 24, 2025 -
SpinQuant Public
Forked from facebookresearch/SpinQuantCode repo for the paper "SpinQuant LLM quantization with learned rotations"
Python Other UpdatedFeb 14, 2025 -
Triton-Puzzles Public
Forked from srush/Triton-PuzzlesPuzzles for learning Triton
Jupyter Notebook Apache License 2.0 UpdatedFeb 6, 2025 -
GPTQModel Public
Forked from ModelCloud/GPTQModelProduction ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
Python Apache License 2.0 UpdatedNov 22, 2024 -
crosscoder-model-diff-replication Public
Forked from ckkissane/crosscoder-model-diff-replicationOpen source replication of Anthropic's Crosscoders for Model Diffing
Python UpdatedOct 27, 2024 -
sae_vis Public
Forked from ckkissane/sae_visCreate feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
HTML MIT License UpdatedOct 27, 2024 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Python Apache License 2.0 UpdatedOct 16, 2024 -
aqlm-bigcode-evaluation-harness Public
Forked from bigcode-project/bigcode-evaluation-harnessA framework for the evaluation of autoregressive code generation language models.
-
aqlm-evaluation-harness Public
lm-evaluation-harness version with support of AQLM intermediate checkpoints
-
-
marlin-scale-tuning Public
Forked from IST-DASLab/marlinFP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens. Version with scale finetuning.
Python Apache License 2.0 UpdatedFeb 29, 2024 -
-
consistencydecoder Public
Forked from openai/consistencydecoderConsistency Distilled Diff VAE
Python MIT License UpdatedNov 7, 2023 -
BK-SDM Public
Forked from Nota-NetsPresso/BK-SDMA Compressed Stable Diffusion for Efficient Text-to-Image Generation [ICCV'23 Demo] [ICML'23 Workshop]
Python Other UpdatedOct 5, 2023 -
llm-foundry Public
Forked from mosaicml/llm-foundryLLM sparse training code for MosaicML foundation models
Python Apache License 2.0 UpdatedSep 11, 2023 -
Platypus Public
Forked from arielnlee/PlatypusCode for fine-tuning Platypus fam LLMs using LoRA
Python UpdatedSep 5, 2023 -
DL4AGX Public
A fork of DL4AGX with support for timm models
Shell Apache License 2.0 UpdatedAug 8, 2023 -
LM-Kernel-FT Public
Based on https://github.com/princeton-nlp/LM-Kernel-FT
Python MIT License UpdatedJun 20, 2023 -
sparsegpt Public
Forked from IST-DASLab/sparsegptCode for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
Python Apache License 2.0 UpdatedMar 26, 2023 -
PowerLawOptimization Public
Convergence analysis of problems with power-law spectra
Jupyter Notebook Apache License 2.0 UpdatedMar 9, 2023 -
sparseml Public
Forked from neuralmagic/sparsemlLibraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Python Apache License 2.0 UpdatedMar 2, 2023 -
HeterophilySpecificModels Public
Forked from heterophily-submit/HeterophilySpecificModels -
-
flows4ad Public
Repository for anomaly detection on Tabular data with normalising flows.
-
sparse_detectron2 Public
Forked from facebookresearch/detectron2Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Python Apache License 2.0 UpdatedDec 1, 2022 -
SparseCLIP Public
Repository for CLIP sparsification
-
heterophilous-graphs Public
Forked from OlegPlatonov/heterophilous-graphsTraining GNNs on heterophilous graphs.
Python UpdatedNov 12, 2022