Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.1k 178

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 857 71

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 813 107

  4. PanzaMail PanzaMail Public

    Python 292 19

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 277 22

  6. QUIK QUIK Public

    Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024

    C++ 180 13

Repositories

Showing 10 of 63 repositories
  • FP-Quant Public
    IST-DASLab/FP-Quant’s past year of commit activity
    Python 4 0 0 0 Updated Jul 14, 2025
  • qutlass Public

    QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

    IST-DASLab/qutlass’s past year of commit activity
    C++ 7 Apache-2.0 0 0 0 Updated Jul 14, 2025
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 9 Apache-2.0 0 0 0 Updated Jun 30, 2025
  • QuEST Public

    Work in progress.

    IST-DASLab/QuEST’s past year of commit activity
    Jupyter Notebook 71 MIT 6 2 0 Updated Jun 29, 2025
  • Quartet Public
    IST-DASLab/Quartet’s past year of commit activity
    Jupyter Notebook 71 MIT 5 3 0 Updated Jun 27, 2025
  • Yolov8-Pose-Detection-on-Browser Public Forked from akbartus/Yolov8-Pose-Detection-on-Browser

    Example of YOLOv8 pose detection (estimation) on browser. It shows implementations powered by ONNX and TFJS served through JavaScript without any frameworks. It demonstrates pose detection (estimation) on image as well as live web camera,

    IST-DASLab/Yolov8-Pose-Detection-on-Browser’s past year of commit activity
    HTML 0 MIT 3 0 0 Updated Jun 13, 2025
  • MoE-Quant Public

    Code for data-aware compression of DeepSeek models

    IST-DASLab/MoE-Quant’s past year of commit activity
    Python 37 5 1 0 Updated Jun 10, 2025
  • influence_distillation Public

    Official implementation of Influence Distillation: https://www.arxiv.org/abs/2505.19051

    IST-DASLab/influence_distillation’s past year of commit activity
    Python 3 0 1 0 Updated May 29, 2025
  • EvoPress Public
    IST-DASLab/EvoPress’s past year of commit activity
    Python 24 2 0 0 Updated May 12, 2025
  • PanzaMail Public
    IST-DASLab/PanzaMail’s past year of commit activity
    Python 292 Apache-2.0 19 4 6 Updated Apr 8, 2025

Most used topics

Loading…