scutyuanzhi

Kpillow scutyuanzhi

1 follower · 1 following

Achievements

Stars

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,517 603 Updated Mar 7, 2025

lucidrains / mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Python 317 8 Updated Jan 12, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,021 78 Updated Oct 24, 2024

lllyasviel / ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Python 4,923 389 Updated Aug 8, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,231 559 Updated Oct 24, 2024

shannanyinxiang / ViTEraser

Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 2024)

Python 47 2 Updated Jul 4, 2024

m-bain / webvid

Large-scale text-video dataset. 10 million captioned short videos.

Python 626 39 Updated Aug 14, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,467 2,828 Updated Sep 4, 2024

kyxscut / CG-GAN

Official PyTorch implementation of the CVPR 2022 paper: "Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator"

Python 88 9 Updated Sep 17, 2022

yeungchenwa / Recommendations-Diffusion-Text-Image

A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten ge…

232 8 Updated Dec 19, 2024

tyxsspa / AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,567 294 Updated Mar 7, 2025

yeungchenwa / FontDiffuser

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Python 344 31 Updated Mar 14, 2024

lcy0604 / CTRNet

This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context".

Python 86 8 Updated Feb 21, 2023

sergeyk / rayleigh

Search image collections by multiple color palettes or by image color similarity.

Python 237 36 Updated Jan 9, 2016

JiauZhang / DragDiffusion

Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

Python 227 14 Updated Jul 19, 2023

MC-E / DragonDiffusion

ICLR 2024 (Spotlight)

Python 749 20 Updated Mar 2, 2024

openai / guided-diffusion

Python 6,601 851 Updated Jul 2, 2024

ali-vilab / composer

Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"

1,553 48 Updated Dec 26, 2023

mit-han-lab / proxylessnas

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

C++ 1,433 287 Updated Aug 30, 2024

AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,648 190 Updated Dec 27, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,872 2,604 Updated Mar 4, 2025

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,422 179 Updated Jan 23, 2025

OPPO-Mente-Lab / GlyphDraw

Text-To-Image Generation with Chinese Characters

Python 128 14 Updated Jul 20, 2023

deep-floyd / IF

Python 7,759 507 Updated Apr 14, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,604 2,926 Updated Sep 2, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,064 4,652 Updated Mar 1, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,939 5,745 Updated Mar 10, 2025

ankush-me / SynthText

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,066 623 Updated Aug 9, 2023

lllyasviel / ControlNet

Let us control diffusion models!

Python 31,669 2,836 Updated Feb 25, 2024

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 69,854 10,355 Updated Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kpillow scutyuanzhi

Achievements

Achievements

Block or report scutyuanzhi

Stars

QwenLM / Qwen2.5-VL

lucidrains / mmdit

baaivision / Emu3

lllyasviel / ControlNet-v1-1-nightly

yangjianxin1 / Firefly

shannanyinxiang / ViTEraser

m-bain / webvid

Stability-AI / generative-models

kyxscut / CG-GAN

yeungchenwa / Recommendations-Diffusion-Text-Image

tyxsspa / AnyText

yeungchenwa / FontDiffuser

lcy0604 / CTRNet

sergeyk / rayleigh

JiauZhang / DragDiffusion

MC-E / DragonDiffusion

openai / guided-diffusion

ali-vilab / composer

mit-han-lab / proxylessnas

AlibabaResearch / AdvancedLiterateMachinery

microsoft / unilm

X-PLUG / mPLUG-Owl

OPPO-Mente-Lab / GlyphDraw

deep-floyd / IF

Vision-CAIR / MiniGPT-4

lm-sys / FastChat

huggingface / diffusers

ankush-me / SynthText

lllyasviel / ControlNet

CompVis / stable-diffusion