Skip to content
View sneakerkg's full-sized avatar

Organizations

@dmlc

Block or report sneakerkg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM101n: Let's build a Storyteller

30,085 1,641 Updated Aug 1, 2024

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

446 13 Updated Oct 10, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 26,824 2,970 Updated Nov 13, 2024

Paper reading notes on Deep Learning and Machine Learning

Jupyter Notebook 1,130 176 Updated Jun 24, 2024

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

2,343 230 Updated Aug 15, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,688 98 Updated Oct 10, 2024

We write your reusable computer vision tools. 💜

Python 24,073 1,792 Updated Nov 12, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,854 130 Updated Jul 2, 2024

A curated list of foundation models for vision and language tasks

831 37 Updated Nov 8, 2024

Awesome papers & datasets specifically focused on long-term videos.

195 7 Updated Oct 17, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,128 1,400 Updated Sep 5, 2024

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 12,590 3,005 Updated Nov 12, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,702 1,965 Updated Sep 26, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 47,580 5,624 Updated Sep 18, 2024

An open-source framework for training large multimodal models.

Python 3,739 284 Updated Aug 31, 2024

🎢 Creating and sharing simulation environments for embodied and synthetic data research

Python 190 13 Updated Oct 19, 2023

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Python 4,449 422 Updated Sep 21, 2024

Visual tracking library based on PyTorch.

Python 3,243 605 Updated Aug 8, 2024

An on-going paper list on new trends in 3D vision with deep learning

328 31 Updated Jun 17, 2022

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,853 3,307 Updated Jul 23, 2024

Visualization tool for Graph Neural Networks

TypeScript 239 27 Updated Sep 20, 2022

The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .

C++ 1,003 99 Updated Jul 22, 2024

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Jupyter Notebook 2,336 231 Updated Oct 7, 2024

白板推导系列课程笔记 初版

496 106 Updated May 16, 2021

PointTrack (ECCV2020 ORAL): Segment as Points for Efficient Online Multi-Object Tracking and Segmentation

Python 262 47 Updated Oct 3, 2023

The official PyTorch implementation of the paper "Learning by Analogy: Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation".

Python 253 50 Updated Jul 23, 2023
Next