euminds

Jintao euminds

cold cver CV & AI student Tired

3 followers · 32 following

Lists (1)

Sort

🚀 My stack

Starred repositories

DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 996 65 Updated Jan 6, 2025

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,843 1,037 Updated Dec 31, 2024

microsoft / MoGe

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 649 34 Updated Dec 8, 2024

alibaba / animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 846 65 Updated Oct 18, 2024

aigc-apps / EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,653 120 Updated Jan 6, 2025

VideoVerses / VideoVAEPlus

Python 232 5 Updated Jan 2, 2025

TianxingChen / RoboTwin

RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins

Python 475 73 Updated Dec 19, 2024

EDiRobotics / mimictest

A simple testbed for robotics manipulation policies

Python 71 3 Updated Dec 5, 2024

TobiasLv / RAD

Python 26 2 Updated Dec 19, 2024

vision-x-nyu / thinking-in-space

Official repo and evaluation implementation of VSI-Bench

Python 289 16 Updated Dec 20, 2024

HRI-EU / flow_matching

Affordance-based Robot Manipulation with Flow Matching

Shell 86 8 Updated Jan 4, 2025

Tsingularity / dift

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Python 638 35 Updated May 14, 2024

vye16 / shape-of-motion

Python 854 63 Updated Aug 13, 2024

mit-han-lab / nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Cuda 573 30 Updated Dec 9, 2024

zhwzhong / Guided-Depth-Map-Super-resolution-A-Survey

Guided Depth Map Super-resolution: A Survey (ACM CSUR 2023)

Python 144 21 Updated Dec 14, 2023

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 21,835 1,738 Updated Jan 6, 2025

IamCreateAI / Ruyi-Models

Python 410 24 Updated Jan 6, 2025

baaivision / See3D

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

Python 593 14 Updated Dec 21, 2024

lucidrains / pi-zero-pytorch

Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence

Python 269 13 Updated Dec 31, 2024

MattWallingford / 360-1M

Python 30 3 Updated Dec 16, 2024

wenqsun / DimensionX

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,131 66 Updated Dec 7, 2024

ultralytics / ultralytics

Ultralytics YOLO11 🚀

Python 35,015 6,722 Updated Jan 6, 2025

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,144 950 Updated Jan 6, 2025

a-r-r-o-w / finetrainers

Memory-optimized training scripts for video models based on Diffusers

Python 657 69 Updated Jan 6, 2025

1zb / GeomDist

Python 183 3 Updated Dec 1, 2024

nerfstudio-project / nerfstudio

A collaboration friendly studio for NeRFs

Python 9,733 1,338 Updated Jan 6, 2025

MC-E / ReVideo

NeurIPS 2024

Python 339 11 Updated Sep 26, 2024

YangLing0818 / IterComp

IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Python 153 10 Updated Nov 1, 2024

EnVision-Research / MotionInversion

Official implementation of 'Motion Inversion For Video Customization'

Python 133 8 Updated Oct 22, 2024

EricGuo5513 / HumanML3D

HumanML3D: A large and diverse 3d human motion-language dataset.

Python 880 90 Updated Aug 18, 2024

Jintao euminds

Lists (1)

🚀 My stack

Starred repositories

image-to-video

text-to-video