hutchinsonian

🏠

Working from home

Zhang Jiahui hutchinsonian

🏠

Working from home

4 followers · 65 following

China

Highlights

Lists (9)

Sort

Starred repositories

pokerllm / pokerbench

7 Updated Jan 15, 2025

vision-x-nyu / thinking-in-space

Official repo and evaluation implementation of VSI-Bench

Python 325 21 Updated Jan 12, 2025

liudaizong / Awesome-3D-Visual-Grounding

😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.

108 4 Updated Jan 6, 2025

remyxai / VQASynth

Compose multimodal datasets 🎹

Python 261 13 Updated Dec 6, 2024

WeitaiKang / Robin3D

Improving 3D Large Language Model via Robust Instruction Tuning

45 3 Updated Oct 2, 2024

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,228 850 Updated Jan 15, 2025

wentaoyuan / RoboPoint

A Vision-Language Model for Spatial Affordance Prediction in Robotics

Python 79 5 Updated Oct 18, 2024

RoboFlamingo / RoboFlamingo

Code for RoboFlamingo

Python 334 28 Updated May 8, 2024

vimalabs / VIMABench

Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 290 34 Updated Sep 26, 2023

RayYoh / OCRM_survey

A Survey of Embodied Learning for Object-Centric Robotic Manipulation

168 13 Updated Oct 4, 2024

LostXine / LLaRA

LLaRA: Large Language and Robotics Assistant

Python 162 3 Updated Oct 2, 2024

landing-ai / vision-agent

Vision agent

Python 1,723 209 Updated Jan 15, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,715 221 Updated Dec 11, 2024

HCPLab-SYSU / Embodied_AI_Paper_List

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

1,011 73 Updated Dec 31, 2024

lucidrains / autoregressive-diffusion-pytorch

Implementation of Autoregressive Diffusion in Pytorch

Python 342 9 Updated Nov 3, 2024

krennic999 / STAR

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

133 1 Updated Jun 18, 2024

Alpha-VLLM / Lumina-mGPT

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 534 22 Updated Aug 16, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 5,592 930 Updated Jan 15, 2025

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,164 41 Updated Jan 15, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,823 123 Updated Oct 30, 2024

facebookresearch / metaseq

Repo for external large-scale work

Python 6,518 727 Updated Apr 27, 2024

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 406 23 Updated Dec 24, 2024

DCDmllm / MorphTokens

Python 44 2 Updated May 6, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

31,009 1,696 Updated Aug 1, 2024

mlfoundations / dclm

DataComp for Language Models

HTML 1,206 111 Updated Dec 11, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,907 112 Updated Jul 29, 2024

andrewyng / translation-agent

Python 5,036 596 Updated Aug 4, 2024

takuseno / d4rl-atari

Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)

Python 109 15 Updated Aug 30, 2024

ShareGPT4Omni / ShareGPT4Video

[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Zhang Jiahui hutchinsonian

Highlights

Lists (9)

Diffusion

Generation transformer

✨ Inspiration

LLM

MLLM

NeRF&3DGaussian

Rb&RL

Tools&Optimal

Trajectory

Starred repositories

Deep learning

Algorithm

3D