Skip to content
View jianjieluo's full-sized avatar
🤯
Busssssssssssssssssssssy
🤯
Busssssssssssssssssssssy

Block or report jianjieluo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[MM 2025] SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning

2 Updated Jul 21, 2025

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 365 27 Updated Aug 3, 2025
Python 4,028 414 Updated Jul 31, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 10,769 1,079 Updated Sep 1, 2025

[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges

1,573 47 Updated Sep 2, 2025

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 297 8 Updated Nov 13, 2024

Official implementation of OneDiffusion paper (CVPR 2025)

Python 650 19 Updated Dec 14, 2024

Stable Diffusion web UI

Python 156,249 28,994 Updated May 3, 2025

《可解释的机器学习--黑盒模型可解释性理解指南》,该书为《Interpretable Machine Learning》中文版

4,897 688 Updated Nov 28, 2023

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,532 501 Updated Aug 25, 2025

[ICLR 2025 Spotlight] An open-sourced LLM judge for evaluating LLM-generated answers.

Python 396 26 Updated Feb 11, 2025

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

Python 513 88 Updated Oct 15, 2023

Main source code of SRPO framework.

Python 33 2 Updated Aug 18, 2025

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 312 18 Updated Oct 7, 2024

实验室GPU服务器的LXD虚拟化

Shell 419 66 Updated Jun 30, 2022

A data augmentations library for audio, image, text, and video.

Python 5,031 308 Updated Jul 30, 2025

A deep learning library for video understanding research.

Python 3,479 426 Updated Jan 25, 2025

Effective Video Augmentation Techniques for Training Convolutional Neural Networks

Python 409 79 Updated Feb 13, 2024
JavaScript 1 Updated Jun 21, 2025

Unleashing Hour-Scale Video Training for Long Video-Language Understanding

Python 11 Updated Jun 24, 2025

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,393 86 Updated Jun 27, 2025

Concat-ID: Towards Universal Identity-Preserving Video Synthesis

Python 58 Updated May 7, 2025

[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI

1,673 122 Updated Jul 21, 2025

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 4,115 459 Updated Feb 12, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,183 1,051 Updated Sep 4, 2025

Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

Jupyter Notebook 31 Updated Mar 28, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 57,593 7,059 Updated Sep 3, 2025

Official implementation of BLIP3o-Series

Python 1,460 60 Updated Sep 4, 2025
Next