Skip to content
View ZhngQ1's full-sized avatar

Highlights

  • Pro

Block or report ZhngQ1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ZhngQ1/README.md

Hi, I'm Qile (Charles) Zhang 👋

M.S. student in Electrical and Computer Engineering at UC San Diego, working on computer vision, multimodal AI, and efficient deep learning.

Research Interests

  • Multimodal Large Language Models — evaluation, benchmarking, tool-mediated reasoning
  • Efficient Deep Learning — model pruning, on-device deployment, federated learning
  • Generative Models — diffusion models, conditional image generation

Featured Projects

Project Description Status
MLLM-as-a-Judge Benchmark for evaluating MLLMs as judges of vision-task outputs Paper in prep
TAP-ViTs Task-adaptive pruning for deploying ViTs on edge devices arXiv
Noise-Level-Dependence Measuring how conditioning effectiveness varies with noise level in diffusion models Extended research
Face-Swap-Diffusion Exploring sampling schedulers and identity guidance mechanisms for conditional diffusion model-based face swapping Undergraduate Thesis

Publications

  1. Zhibo Wang, Zuoyuan Zhang, Xiaoyi Pang, Qile Zhang, et al. TAP-ViTs: Task-Adaptive Pruning for On-Device Deployment of Vision Transformers. arXiv:2601.02437

  2. [Authors including Qile Zhang]. Evaluating Multi-modal Large Language Models as MLLM-as-Judge for Vision Tasks. In preparation.

Get in Touch

Email LinkedIn

Popular repositories Loading

  1. Measuring-Noise-Level-Dependence-in-Conditional-Image-Generation Measuring-Noise-Level-Dependence-in-Conditional-Image-Generation Public

    Quantitative framework for measuring how conditioning effectiveness varies with noise level in diffusion model inference (SD 1.5 & SDXL)

    TeX 7 1

  2. TAP-ViTs TAP-ViTs Public

    Task-adaptive pruning framework for deploying Vision Transformers on heterogeneous edge devices without accessing private data (arXiv 2601.02437)

    Python 1

  3. MLLM-as-a-Judge MLLM-as-a-Judge Public

    Benchmark for evaluating MLLMs as judges of vision-task outputs across intrinsic and tool-mediated settings

    Jupyter Notebook 1

  4. Instance-Segmentation-for-MLLM-as-a-Judge Instance-Segmentation-for-MLLM-as-a-Judge Public

    Instance segmentation data curation pipeline for the Vision-Judge benchmark (MLLM-as-Judge)

    Python 1

  5. Labelme-modified Labelme-modified Public

    Python 1

  6. Camera-Fingerprint-Removal-Using-Neural-Networks Camera-Fingerprint-Removal-Using-Neural-Networks Public

    Neural network-based camera fingerprint (sensor pattern noise) removal for image privacy protectionUsing AE to remove camera fingerprint

    Python