Skip to content
View ashun989's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report ashun989

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
59 stars written in Python
Clear filter

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 67,109 8,242 Updated Jan 21, 2025

OpenMMLab Detection Toolbox and Benchmark

Python 30,107 9,520 Updated Aug 21, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,277 5,589 Updated Jan 25, 2025

deep learning for image processing including classification and object-detection etc.

Python 23,867 8,080 Updated Jan 12, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 21,557 3,156 Updated Jan 19, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,678 1,945 Updated Apr 4, 2024

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,100 792 Updated Jan 11, 2025

Python bindings for llama.cpp

Python 8,470 1,020 Updated Jan 20, 2025

用文本编辑器剪视频

Python 6,944 711 Updated Oct 5, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Int…

Python 5,153 445 Updated Jan 26, 2025

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Python 4,800 1,312 Updated Aug 8, 2024

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Python 3,353 196 Updated Feb 29, 2024

Visual tracking library based on PyTorch.

Python 3,309 608 Updated Aug 8, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,399 173 Updated Aug 1, 2024

Let ChatGPT truly learn how to go online and call APIs! 'EX-ChatGPT' can rival and even surpass NewBing

Python 1,995 329 Updated Mar 30, 2023

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,830 124 Updated Oct 30, 2024

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,214 44 Updated Dec 11, 2024

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python 878 49 Updated Jul 6, 2024

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Python 783 42 Updated Aug 5, 2024

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python 613 46 Updated Dec 30, 2024

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 559 38 Updated Jan 7, 2024

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 503 31 Updated May 8, 2024

Open-vocabulary Semantic Segmentation

Python 324 33 Updated Oct 16, 2024

[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models

Python 311 15 Updated Nov 3, 2023

[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation

Python 274 13 Updated Apr 22, 2024

An official PyTorch implementation of the CRIS paper

Python 258 36 Updated Jun 9, 2024

Official code for "SRFormer: Permuted Self-Attention for Single Image Super-Resolution" (ICCV 2023) and SRFormerV2

Python 251 22 Updated Aug 18, 2024

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 212 9 Updated Dec 22, 2024

JSeg is a Semantic segmentation toolbox based on MMSegmentation and Jittor

Python 208 13 Updated Jul 14, 2024
Next