TonyLianLong

🏠

Working at home

Long(Tony) Lian TonyLianLong

🏠

Working at home

PhD Student at UC Berkeley (EECS) | UC Berkeley 22' (CS)

207 followers · 76 following

UC Berkeley
Berkeley, California
https://tonylian.com/
in/longlian
@LongTonyLian

Achievements

x2 x2

Achievements

x2 x2

Highlights

Developer Program Member
Pro

Organizations

Stars

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 14,086 1,008 Updated Sep 13, 2024

callsys / ControlCap

[ECCV 2024] ControlCap: Controllable Region-level Captioning

Python 50 Updated Aug 7, 2024

xk-huang / segment-caption-anything

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…

Python 183 6 Updated Sep 20, 2024

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Python 1,235 47 Updated Aug 16, 2024

PhoenixZ810 / MG-LLaVA

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Python 141 4 Updated Aug 8, 2024

PrivateBin / PrivateBin

A minimalist, open source online pastebin where the server has zero knowledge of pasted data. Data is encrypted/decrypted in the browser using 256 bits AES.

PHP 6,353 785 Updated Sep 15, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 5,276 374 Updated Sep 25, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,995 926 Updated Aug 21, 2024

OpenGVLab / all-seeing

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 445 14 Updated Aug 9, 2024

bfshi / scaling_on_scales

When do we not need larger vision models?

Python 319 9 Updated Aug 19, 2024

Doubiiu / DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,440 195 Updated Sep 8, 2024

DavidMChan / clair

CLAIR: A (surprisingly) simple semantic text metric with large language models.

Python 13 Updated Jan 28, 2024

tylin / coco-caption

Jupyter Notebook 1,121 545 Updated May 13, 2024

google-research-datasets / DaTaSeg-Objects365-Instance-Segmentation

We release the DaTaSeg Objects365 Instance Segmentation Dataset introduced in the DaTaSeg paper, which can be used as an evaluation benchmark for weakly or semi supervised segmentation.

Jupyter Notebook 16 1 Updated Dec 9, 2023

JialianW / GRiT

GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)

Python 294 30 Updated Jan 8, 2024

mistralai / mistral-common

Python 627 56 Updated Sep 22, 2024

nightrome / cocostuff

The official homepage of the COCO-Stuff dataset.

Shell 836 144 Updated Sep 9, 2022

UX-Decoder / Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,270 107 Updated Jul 19, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,771 108 Updated Jul 29, 2024

UCSC-VLAA / Recap-DataComp-1B

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

117 1 Updated Jun 13, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,211 48 Updated Aug 15, 2024

LLaVA-VL / LLaVA-NeXT

Python 2,480 183 Updated Sep 25, 2024

facebookresearch / paco

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts…

Python 266 12 Updated Feb 12, 2024

lllyasviel / Omost

Your image is almost there!

Python 7,232 418 Updated Jul 26, 2024

YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Jupyter Notebook 1,658 92 Updated Sep 6, 2024

NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,822 145 Updated Sep 25, 2024

magic-research / PLLaVA

Official repository for the paper PLLaVA

Python 561 37 Updated Jul 28, 2024

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 18,751 1,033 Updated Sep 24, 2024

TonyLianLong / igligen

Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation

Python 34 3 Updated Jun 1, 2024

u2seg / U2Seg

[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"

Python 165 5 Updated May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long(Tony) Lian TonyLianLong

Achievements

Achievements

Highlights

Organizations

Block or report TonyLianLong

Stars

black-forest-labs / flux

callsys / ControlCap

xk-huang / segment-caption-anything

lxtGH / OMG-Seg

PhoenixZ810 / MG-LLaVA

PrivateBin / PrivateBin

sgl-project / sglang

facebookresearch / segment-anything-2

OpenGVLab / all-seeing

bfshi / scaling_on_scales

Doubiiu / DynamiCrafter

DavidMChan / clair

tylin / coco-caption

google-research-datasets / DaTaSeg-Objects365-Instance-Segmentation

JialianW / GRiT

mistralai / mistral-common

nightrome / cocostuff

UX-Decoder / Semantic-SAM

facebookresearch / chameleon

UCSC-VLAA / Recap-DataComp-1B

FoundationVision / LlamaGen

LLaVA-VL / LLaVA-NeXT

facebookresearch / paco

lllyasviel / Omost

YangLing0818 / RPG-DiffusionMaster

NVlabs / VILA

magic-research / PLLaVA

guidance-ai / guidance

TonyLianLong / igligen

u2seg / U2Seg