Skip to content
@360CVGroup

360 AI Research

360人工智能研究院

👋 Who We Are

This is the 360 AI Research, our mission is to lead in tech innovations and deliver real-world values.
We focus on "multimodal + cross-modal learning" and "large model + zero/few shot learning",
conducting research in

  • 🔎 multi-modal comprehension

    • FG-CLIP: ICML2025, new generation of CLIP with strong fine grained discrimination capability
    • RzenEmbed: Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark
    • LMM-Det: ICCV2025, make large multimodal models excel in object detection
    • IAA: AAAI2025, LMM with plugin mechanism solving catastrophic forgetting
    • 360VL: Large multimodal model, 2nd-gen
    • SEEChat: Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM
    • OVD: KDD2023, open-world object detection, we also co-hosted open vocabulary detection contest 2023 with CSIG(中国图象图形学学会)
    • Zero: ACM MM2023, large scale open-sourced Chinese cross-modal data and benchmark
  • 🎨 multi-modal generation

    • EVTAR: End2End Virtual Try-on with Visual Reference
    • PlanGen: ICCV2025, unified layout planning and image generation
    • Qihoo-T2X: ICLR2025, efficient DiT architecture for text2any tasks
    • BDM: AAAI2025, Chinese-native image generation while compatible with SD eco-system, 1st-gen
    • HiCo: NeurIPS2024, layout controlled image generation
    • FancyVideo: Video generation from text&image, 1st-gen

🛒 Business & API

Check research.360.cn for contact and API portal

🔥 Hiring

Internship: we're hiring research interns in fileds of AIGC, LMM, and inference optimization, check 👉 JD here

Pinned Loading

  1. FG-CLIP FG-CLIP Public

    New generation of CLIP with fine grained discrimination capability, ICML2025

    Python 497 27

  2. RefVTON RefVTON Public

    End2End Virtual Try-on with Visual Reference

    Python 54 8

  3. RzenEmbed RzenEmbed Public

    Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark

    Python 23

  4. PlanGen PlanGen Public

    Unified layout planning and image generation, ICCV2025

    Python 39 1

  5. LMM-Det LMM-Det Public

    Make Large Multimodal Models excel in object detection, ICCV 2025

    Python 61 3

  6. HiCo_T2I HiCo_T2I Public

    Layout Conditioned Image Generation, NeurIPS2024

    Python 64 3

Repositories

Showing 10 of 19 repositories
  • RefVTON Public

    End2End Virtual Try-on with Visual Reference

    360CVGroup/RefVTON’s past year of commit activity
    Python 54 8 1 0 Updated Nov 19, 2025
  • .github Public

    Introduction to 360 AI Research

    360CVGroup/.github’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Nov 10, 2025
  • RzenEmbed Public

    Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark

    360CVGroup/RzenEmbed’s past year of commit activity
    Python 23 MIT 0 1 0 Updated Nov 6, 2025
  • FG-CLIP Public

    New generation of CLIP with fine grained discrimination capability, ICML2025

    360CVGroup/FG-CLIP’s past year of commit activity
    Python 497 Apache-2.0 27 34 0 Updated Oct 27, 2025
  • MiniCPM-o.cpp Public

    Inference of MiniCPM-o 2.6 in plain C/C++

    360CVGroup/MiniCPM-o.cpp’s past year of commit activity
    C++ 28 Apache-2.0 5 1 0 Updated Oct 14, 2025
  • FGCLIP-MCP Public

    MCP (Model Context Protocol) server for FG-CLIP embedding services.

    360CVGroup/FGCLIP-MCP’s past year of commit activity
    Python 2 Apache-2.0 0 0 0 Updated Sep 30, 2025
  • WISA Public

    World Simulator Assistant for Physics-Aware Text-to-Video Generation

    360CVGroup/WISA’s past year of commit activity
    Python 254 Apache-2.0 42 1 0 Updated Sep 22, 2025
  • HiCo_T2I Public

    Layout Conditioned Image Generation, NeurIPS2024

    360CVGroup/HiCo_T2I’s past year of commit activity
    Python 64 3 12 0 Updated Sep 3, 2025
  • llama.cpp Public Forked from ggml-org/llama.cpp

    LLM inference in C/C++

    360CVGroup/llama.cpp’s past year of commit activity
    C++ 0 MIT 14,416 0 0 Updated Aug 25, 2025
  • LMM-Det Public

    Make Large Multimodal Models excel in object detection, ICCV 2025

    360CVGroup/LMM-Det’s past year of commit activity
    Python 61 Apache-2.0 3 2 0 Updated Aug 1, 2025

Top languages

Loading…