Skip to content
View hutchinsonian's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report hutchinsonian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
7 Updated Jan 15, 2025

Official repo and evaluation implementation of VSI-Bench

Python 325 21 Updated Jan 12, 2025

😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.

108 4 Updated Jan 6, 2025

Compose multimodal datasets 🎹

Python 261 13 Updated Dec 6, 2024

Improving 3D Large Language Model via Robust Instruction Tuning

45 3 Updated Oct 2, 2024

πŸ€— LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 8,228 850 Updated Jan 15, 2025

A Vision-Language Model for Spatial Affordance Prediction in Robotics

Python 79 5 Updated Oct 18, 2024

Code for RoboFlamingo

Python 334 28 Updated May 8, 2024

Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 290 34 Updated Sep 26, 2023

A Survey of Embodied Learning for Object-Centric Robotic Manipulation

168 13 Updated Oct 4, 2024

LLaRA: Large Language and Robotics Assistant

Python 162 3 Updated Oct 2, 2024

Vision agent

Python 1,723 209 Updated Jan 15, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,715 221 Updated Dec 11, 2024

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

1,011 73 Updated Dec 31, 2024

Implementation of Autoregressive Diffusion in Pytorch

Python 342 9 Updated Nov 3, 2024

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

133 1 Updated Jun 18, 2024

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 534 22 Updated Aug 16, 2024

Utilities intended for use with Llama models.

Python 5,592 930 Updated Jan 15, 2025

πŸ“° Must-read papers and blogs on LLM based Long Context Modeling πŸ”₯

1,164 41 Updated Jan 15, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,823 123 Updated Oct 30, 2024

Repo for external large-scale work

Python 6,518 727 Updated Apr 27, 2024

πŸ”₯πŸ”₯πŸ”₯ A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 406 23 Updated Dec 24, 2024
Python 44 2 Updated May 6, 2024

LLM101n: Let's build a Storyteller

31,009 1,696 Updated Aug 1, 2024

DataComp for Language Models

HTML 1,206 111 Updated Dec 11, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,907 112 Updated Jul 29, 2024

Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)

Python 109 15 Updated Aug 30, 2024

[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Python 1,026 47 Updated Oct 9, 2024

code for NeurIPS 2018 paper, "Sparse PCA from Sparse Linear Regression"

Python 1 Updated Mar 10, 2022
Next