Skip to content
View YicongHong's full-sized avatar
🍁
🍁

Highlights

  • Pro

Block or report YicongHong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Asynchronous Blob Tracker for Event Cameras. 2024 IEEE Transactions on Robotics (TRO).

C++ 24 2 Updated Jan 18, 2025

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 191 15 Updated Jan 30, 2025

A Triton Kernel for incorporating Bi-Directionality in Mamba2

Python 60 Updated Dec 18, 2024

A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)

Python 290 20 Updated Nov 28, 2024
Python 356 15 Updated Oct 21, 2024

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

Python 155 12 Updated Jan 27, 2025
45 2 Updated Mar 31, 2024

Scalable Diffusion Models with State Space Backbone

Python 150 9 Updated Mar 7, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,948 635 Updated Jan 30, 2025

Implementation of MagViT2 Tokenizer in Pytorch

Python 587 33 Updated Jan 12, 2025

Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Python 628 68 Updated Dec 27, 2023

Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.

Jupyter Notebook 4,934 477 Updated Jan 30, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,188 212 Updated Nov 22, 2024

[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

HTML 948 62 Updated Jan 8, 2025

An open-source impl. of Large Reconstruction Models

Python 1,028 59 Updated May 6, 2024

Generative Models by Stability AI

Python 25,161 2,786 Updated Sep 4, 2024

[NeurIPS 2023] Scalable 3D Captioning with Pretrained Models

Python 246 15 Updated Apr 25, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,240 2,340 Updated Aug 12, 2024

[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Python 183 17 Updated Nov 7, 2023
Python 8,554 509 Updated Oct 9, 2024
Python 31 2 Updated Aug 19, 2023

Code and Data for Paper: PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation

Python 74 5 Updated May 31, 2023

[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation

Python 158 5 Updated Oct 8, 2024

A latent text-to-image diffusion model

Jupyter Notebook 69,330 10,286 Updated Jun 18, 2024

[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

Python 247 17 Updated Jul 23, 2024
Jupyter Notebook 3,210 303 Updated May 14, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,064 886 Updated Jan 26, 2025

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python 2,795 201 Updated Dec 5, 2023

📷 Scripts for rendering Objaverse

Python 227 12 Updated Aug 17, 2023
Next