Skip to content
View TonyLianLong's full-sized avatar
🏠
Working at home
🏠
Working at home

Organizations

@ocf

Block or report TonyLianLong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official inference repo for FLUX.1 models

Python 14,086 1,008 Updated Sep 13, 2024

[ECCV 2024] ControlCap: Controllable Region-level Captioning

Python 50 Updated Aug 7, 2024

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…

Python 183 6 Updated Sep 20, 2024

OMG-LLaVA and OMG-Seg codebase

Python 1,235 47 Updated Aug 16, 2024

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Python 141 4 Updated Aug 8, 2024

A minimalist, open source online pastebin where the server has zero knowledge of pasted data. Data is encrypted/decrypted in the browser using 256 bits AES.

PHP 6,353 785 Updated Sep 15, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,276 374 Updated Sep 25, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,995 926 Updated Aug 21, 2024

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 445 14 Updated Aug 9, 2024

When do we not need larger vision models?

Python 319 9 Updated Aug 19, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,440 195 Updated Sep 8, 2024

CLAIR: A (surprisingly) simple semantic text metric with large language models.

Python 13 Updated Jan 28, 2024
Jupyter Notebook 1,121 545 Updated May 13, 2024

We release the DaTaSeg Objects365 Instance Segmentation Dataset introduced in the DaTaSeg paper, which can be used as an evaluation benchmark for weakly or semi supervised segmentation.

Jupyter Notebook 16 1 Updated Dec 9, 2023

GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)

Python 294 30 Updated Jan 8, 2024
Python 627 56 Updated Sep 22, 2024

The official homepage of the COCO-Stuff dataset.

Shell 836 144 Updated Sep 9, 2022

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,270 107 Updated Jul 19, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,771 108 Updated Jul 29, 2024

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

117 1 Updated Jun 13, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,211 48 Updated Aug 15, 2024
Python 2,480 183 Updated Sep 25, 2024

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts…

Python 266 12 Updated Feb 12, 2024

Your image is almost there!

Python 7,232 418 Updated Jul 26, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Jupyter Notebook 1,658 92 Updated Sep 6, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,822 145 Updated Sep 25, 2024

Official repository for the paper PLLaVA

Python 561 37 Updated Jul 28, 2024

A guidance language for controlling large language models.

Jupyter Notebook 18,751 1,033 Updated Sep 24, 2024

Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation

Python 34 3 Updated Jun 1, 2024

[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"

Python 165 5 Updated May 7, 2024
Next