I am a PhD candidate at Network Security Lab, University of Washington, advised by Prof. Radha Poovendran. I work on LLM related topics (e.g., safety, alignment, reasoning, post-training, synthetic data).
๐ธ๏ธ Website | ๐ค HuggingFace | ๐ฆ X
I am a PhD candidate at Network Security Lab, University of Washington, advised by Prof. Radha Poovendran. I work on LLM related topics (e.g., safety, alignment, reasoning, post-training, synthetic data).
๐ธ๏ธ Website | ๐ค HuggingFace | ๐ฆ X
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
โจ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
๐ Open Source AI Agent Security Infrastructure โ intercepts and blocks dangerous agent behaviors before they happen. Just one command! Join us to build safer Human-AI Symbiosis!
Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
Your efficient and accurate answer verification system for RL training.