Skip to content
View pipixin321's full-sized avatar
🎯
Focusing
🎯
Focusing
  • HUST(Huazhong University of Science and Technology)
  • Wuhan
  • 21:34 (UTC +08:00)

Block or report pipixin321

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pipixin321/README.md

Hi there πŸ‘‹, I'm Huaxin Zhang

Huaxin Zhang github Google Scholar

Currently, I am an algorithm engineer.

πŸ”­ Reseach-wise, I mainly focus on:

  • Multi-modal Large Language Models
  • Video Understanding

πŸ“« Contact me by:

πŸ’¬ News:

  • 2025-02-27: Holmes-VAU is accepted on CVPR 2025.
  • 2024-07-01: We release our code and model of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM".[project page]
  • 2024-06-10: We release our code and model of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilities".[project page]
  • 2024-01-29: I start my internship in Baidu VIS, to do some research on Multi-modal Large Language Model (MLLM).
  • 2023-12-09: One paper about point supervised temporal action localization is accepted on AAAI 2024.

Huaxin's github stats

Pinned Loading

  1. HolmesVAU HolmesVAU Public

    [CVPR 2025 Highlight] Official implementation of "Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity"

    Python 99 4

  2. HolmesVAD HolmesVAD Public

    Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"

    Python 142 7

  3. HR-Pro HR-Pro Public

    [AAAI 2024] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"

    Python 39 2

  4. GlanceVAD GlanceVAD Public

    [ICME 2025 Oral] Official implementation of "GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection"

    Python 31 2

  5. Awesome-Video-MLLMs Awesome-Video-MLLMs Public

    πŸ”₯ πŸ”₯ πŸ”₯ Awesome MLLMs/Benchmarks for Short/Long/Streaming Video Understanding πŸ“Ή

    46 1

  6. Arcana Arcana Public

    Forked from syp2ysy/Arcana

    Implementation of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilitie"

    Python 2