Skip to content
View s9xie's full-sized avatar

Highlights

  • Pro

Block or report s9xie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best OSS video generation models

Python 1,608 152 Updated Nov 1, 2024

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,144 199 Updated Oct 31, 2024

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 592 26 Updated Oct 19, 2024

A viewer for json files exported from Slack workspaces.

C++ 201 13 Updated Sep 25, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,739 113 Updated Oct 30, 2024

Ongoing research training gaussian splatting at scale by distributed system

Python 359 19 Updated Aug 9, 2024

NASA/IBM HLS Foundation Model for downstream applications on Mars imagery

Jupyter Notebook 1 1 Updated Aug 21, 2024

Code release for ConvNeXt model

Python 5,753 696 Updated Jan 8, 2023

(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life

Python 312 11 Updated Jul 10, 2024

[CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

C++ 2,260 173 Updated Sep 24, 2024

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 663 35 Updated Mar 12, 2024

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 523 33 Updated Jan 7, 2024
Jupyter Notebook 4 Updated Dec 22, 2023
Python 1 Updated Dec 23, 2023

An Instruction-tuned Audio-Visual Language Model for Hate Content Detection

Python 1 1 Updated Dec 23, 2023
Jupyter Notebook 2 Updated Dec 23, 2023

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,771 177 Updated Oct 31, 2024

A modern, highly customizable, responsive Jekyll template for course websites.

SCSS 280 108 Updated Sep 19, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,444 795 Updated Oct 29, 2024

Zoomable, animated scatterplots in the browser that scales over a billion points

TypeScript 1,024 61 Updated Oct 30, 2024

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,520 171 Updated Oct 31, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,073 1,393 Updated Sep 5, 2024

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,288 134 Updated Oct 5, 2023

Official Open Source code for "Scaling Language-Image Pre-training via Masking"

Python 404 15 Updated Mar 30, 2023

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 47,409 5,609 Updated Sep 18, 2024

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python 2,699 194 Updated Dec 5, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,073 5,884 Updated Aug 19, 2024

Let us control diffusion models!

Python 30,244 2,720 Updated Feb 25, 2024
Python 494 53 Updated Oct 4, 2024
Next