Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.

Jupyter Notebook 4,934 477 Updated Jan 30, 2025

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,188 212 Updated Nov 22, 2024

OpenDriveLab / DriveLM

[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

HTML 948 62 Updated Jan 8, 2025

3DTopia / OpenLRM

An open-source impl. of Large Reconstruction Models

Python 1,028 59 Updated May 6, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,161 2,786 Updated Sep 4, 2024

crockwell / Cap3D

[NeurIPS 2023] Scalable 3D Captioning with Pretrained Models

Python 246 15 Updated Apr 25, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,240 2,340 Updated Aug 12, 2024

GengzeZhou / NavGPT

[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Python 183 17 Updated Nov 7, 2023

apple / ml-ferret

Python 8,554 509 Updated Oct 9, 2024

jialuli-luka / VLN-SIG

Python 31 2 Updated Aug 19, 2023

jialuli-luka / PanoGen

Code and Data for Paper: PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation

Python 74 5 Updated May 31, 2023

wz0919 / ScaleVLN

[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation

Python 158 5 Updated Oct 8, 2024

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 69,330 10,286 Updated Jun 18, 2024

MarSaKi / ETPNav

[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

Python 247 17 Updated Jul 23, 2024

google / prompt-to-prompt

Jupyter Notebook 3,210 303 Updated May 14, 2024

timothybrooks / instruct-pix2pix

Python 6,487 544 Updated Mar 3, 2024

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,064 886 Updated Jan 26, 2025

cvlab-columbia / zero123

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python 2,795 201 Updated Dec 5, 2023

allenai / objaverse-rendering

📷 Scripts for rendering Objaverse

Python 227 12 Updated Aug 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yicong Hong (Neo Orion) YicongHong

Achievements

Achievements

Highlights

Block or report YicongHong

Stars

ziweiWWANG / AEB-Tracker

jxiw / MambaInLlama

Hprairie / Bi-Mamba2

CompVis / zigma

mira-space / Mira

kyegomez / Jamba

MambaMixer / M2

feizc / DiS

facebookresearch / xformers

lucidrains / magvit2-pytorch

ckkelvinchan / BasicVSR_PlusPlus

superduper-io / superduper