Skip to content
@FoundationVision

FoundationVision

Bytedance's opensource FoundationVision models

Hi there 👋

This is FoundationVision official website repo

Popular repositories Loading

  1. VAR VAR Public

    [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

    Jupyter Notebook 8.5k 543

  2. ByteTrack ByteTrack Public

    [ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

    Python 5.8k 1.1k

  3. LlamaGen LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    Python 1.9k 89

  4. Infinity Infinity Public

    [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    Python 1.5k 81

  5. GLEE GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Python 1.2k 74

  6. Waver Waver Public

    Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

    703 68

Repositories

Showing 10 of 20 repositories
  • InfinityStar Public

    [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation

    FoundationVision/InfinityStar’s past year of commit activity
    Python 211 6 3 0 Updated Nov 9, 2025
  • Infinity Public

    [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    FoundationVision/Infinity’s past year of commit activity
    Python 1,489 MIT 81 53 4 Updated Oct 25, 2025
  • UniTok Public

    [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

    FoundationVision/UniTok’s past year of commit activity
    Python 441 MIT 10 11 0 Updated Sep 22, 2025
  • Waver Public

    Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

    FoundationVision/Waver’s past year of commit activity
    703 68 6 1 Updated Aug 27, 2025
  • BitVAE Public

    official training and inference code of bitwise tokenizer

    FoundationVision/BitVAE’s past year of commit activity
    Python 51 MIT 2 2 0 Updated May 18, 2025
  • VAR Public

    [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

    FoundationVision/VAR’s past year of commit activity
    Jupyter Notebook 8,471 MIT 543 51 (1 issue needs help) 3 Updated May 18, 2025
  • Liquid Public

    (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators

    FoundationVision/Liquid’s past year of commit activity
    Python 626 MIT 33 12 0 Updated Apr 8, 2025
  • GenerateU Public

    [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

    FoundationVision/GenerateU’s past year of commit activity
    Python 184 MIT 8 15 0 Updated Mar 29, 2025
  • FlashVideo Public

    [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

    FoundationVision/FlashVideo’s past year of commit activity
    Python 449 Apache-2.0 24 13 (2 issues need help) 1 Updated Mar 4, 2025
  • UniRef Public

    [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

    FoundationVision/UniRef’s past year of commit activity
    Python 236 MIT 15 4 0 Updated Feb 13, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…