Skip to content
@SHI-Labs

SHI Labs

Computer Vision, Machine Learning, and AI Systems & Applications

Pinned Loading

  1. Neighborhood-Attention-Transformer Neighborhood-Attention-Transformer Public

    Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

    Python 1.2k 88

  2. NATTEN NATTEN Public

    Fast Multi-dimensional Sparse Attention

    C++ 692 54

  3. Versatile-Diffusion Versatile-Diffusion Public

    Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

    Python 1.3k 85

  4. Prompt-Free-Diffusion Prompt-Free-Diffusion Public

    Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

    Python 757 38

  5. OneFormer OneFormer Public

    [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation

    Jupyter Notebook 1.7k 146

  6. Compact-Transformers Compact-Transformers Public

    Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)

    Python 539 84

Repositories

Showing 10 of 64 repositories
  • NATTEN Public

    Fast Multi-dimensional Sparse Attention

    SHI-Labs/NATTEN’s past year of commit activity
    C++ 692 MIT 54 12 6 Updated Dec 25, 2025
  • physical-ai-bench Public

    PAI-Bench: A Comprehensive Benchmark for Physical AI

    SHI-Labs/physical-ai-bench’s past year of commit activity
    Python 39 MIT 0 1 0 Updated Dec 3, 2025
  • Forget-Me-Not Public

    Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models, 2023

    SHI-Labs/Forget-Me-Not’s past year of commit activity
    Python 135 MIT 8 7 0 Updated Oct 22, 2025
  • VisPer-LM Public

    [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation

    SHI-Labs/VisPer-LM’s past year of commit activity
    Python 68 1 2 0 Updated Oct 17, 2025
  • T2I-Copilot Public

    T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)

    SHI-Labs/T2I-Copilot’s past year of commit activity
    Jupyter Notebook 39 MIT 2 0 0 Updated Oct 6, 2025
  • SHI-Labs/shi-labs.github.io’s past year of commit activity
    CSS 0 0 0 0 Updated Oct 5, 2025
  • IMG-Multimodal-Diffusion-Alignment Public

    IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025

    SHI-Labs/IMG-Multimodal-Diffusion-Alignment’s past year of commit activity
    Python 30 3 1 0 Updated Oct 1, 2025
  • StyleNAT Public

    New flexible and efficient image generation framework that sets new SOTA on FFHQ-256 with FID 2.05, 2022

    SHI-Labs/StyleNAT’s past year of commit activity
    Python 101 MIT 13 0 0 Updated Jun 26, 2025
  • SHI-Labs/Slow-Fast-Video-Multimodal-LLM’s past year of commit activity
    Python 27 1 2 0 Updated Apr 9, 2025
  • Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment Public

    Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment, arXiv 2024 / CVPR 2025

    SHI-Labs/Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment’s past year of commit activity
    Python 38 2 1 0 Updated Mar 1, 2025

Most used topics

Loading…