Skip to content
View stein-wang0226's full-sized avatar

Block or report stein-wang0226

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
stein-wang0226/README.md
Typing SVG

CV / Academic Page Scholar Notes Email

About Me

M.S. student @ Fudan University, School of Data Science, advised by Prof. Baojian Zhou · Knowledge Works Lab

Research Intern @ Alibaba — AI4S / Multimodal LLM

Research Interests: Large Language Models (Agents), Diffusion Language Models, AI for Science, Multimodal LLMs, Graph Neural Networks


Selected Publications

Paper Venue Role
Locality-aware Diffusion Language ModelingScatter & Jigsaw blockwise architectures bridging AR and Diffusion Preprint Sole 1st Author
STM: Spatio-Temporal Distance Model for Dynamic Graph Fraud Detection — SOTA + invention patent CCF-B 1st Author
SemDLM: Semantic Diffusion Language Modeling — 27.19 Test PPL on LM1B - 2nd Author

Open-Source Notes & Resources

Interview prep notes for LLM / Multimodal / RL. Feel free to star and use!

Resource Description
Agent Harness 深度解析 Agent 训练基础设施全景 — 四层架构、异步 Rollout、GRPO、Reward 工程、VERL/ROLL/RAGEN 框架对比
LLM 算法岗面试题库 (420+ 题) Transformer / RLHF / RL / Agentic RL / VLM / Agent / RAG / Infra / 手撕代码,含详解答案
LLM & Multimodal Interview Notes Transformer, LLM Architecture, RLHF/DPO, Multimodal (CLIP/LLaVA/GPT-4o), Engineering
RL for LLM Alignment Policy Gradient → PPO → GRPO, DQN/DDPG/TD3/SAC, RLHF Pipeline, DPO Derivation, RLVR
VLM Knowledge & Interview (2025-2026) Visual Encoder, VLM Architecture, Alignment, Resolution, MoE
Transformer Decoder 全流程 训练(并行) vs 推理(自回归+KV Cache)、张量维度推导、RoPE/Attention/MLP 维度链
Training Infra & Distributed Systems DDP/FSDP/ZeRO/TP/PP/3D Parallel, LoRA/QLoRA, Flash Attention 1-3, Mixed Precision
Inference Optimization & System Design KV Cache, PagedAttention, Quantization, Speculative Decoding, System Design
NLP & LLM Course Notes Tokenization, N-gram, Transformer, GPT, BERT, RLHF (Fudan CS40008)

Tech Stack

Python PyTorch HuggingFace DeepSpeed C++ LaTeX Linux Git


GitHub Stats



Last updated: May 2026

Pinned Loading

  1. trainable-masked-diffusion trainable-masked-diffusion Public

    code open-source from alibaba

    Python

  2. STM STM Public

    STM:Spatio-Temporal distance model

    Python 2

  3. llm-study-notes llm-study-notes Public

    HTML

  4. mllm-interview-notes mllm-interview-notes Public archive

    HTML 1

  5. algorithms algorithms Public

    algorithms/acm/machine_learning

    C++

  6. stein-wang0226 stein-wang0226 Public

    Profile README — points visitors to my personal homepage