Skip to content
View xiaoachen98's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report xiaoachen98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
xiaoachen98/README.md

Hi there πŸ‘‹

  • 🌱 I'm Lin Chen, a Ph.D. student in BIVLab, USTC.
  • πŸ”­ I’m working as a research intern at Shanghai AI Laboratory.
  • πŸ’¬ I'm currently looking for collaborations, feel free to contact me.

Research Projects

  • πŸ”₯ Large-scale high-quality video-text data and superior large video-language model: ShareGPT4Video.
  • πŸ”₯ An elite vision-indispensable multi-modal benchmark: MMStar.
  • πŸ”₯ Large-scale high-quality image-text data and superior large multi-modal model: ShareGPT4V.
  • More Stable "Drag" Editing: FreeDrag
  • Robust & Transferable Semantic Segmentation: DDB, DTP, Rein
  • Discriminator-free Adversarial Domain Adaption: DALN

Pinned Loading

  1. InternLM/InternLM-XComposer InternLM/InternLM-XComposer Public

    InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

    Python 2.7k 159

  2. open-compass/VLMEvalKit open-compass/VLMEvalKit Public

    Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

    Python 1.6k 221

  3. ShareGPT4Omni/ShareGPT4Video ShareGPT4Omni/ShareGPT4Video Public

    [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

    Python 1.3k 44

  4. MMStar-Benchmark/MMStar MMStar-Benchmark/MMStar Public

    [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

    Python 156 5

  5. Open-LLaVA-NeXT Open-LLaVA-NeXT Public

    An open-source implementation for training LLaVA-NeXT.

    Python 403 20

  6. LPengYang/FreeDrag LPengYang/FreeDrag Public

    Official Implementation of FreeDrag (CVPR 2024)

    Python 413 20