thu-coai
Conversational AI groups from Tsinghua University
Pinned Loading
Repositories
Showing 10 of 101 repositories
- AISafetyLab Public
AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.
thu-coai/AISafetyLab’s past year of commit activity - LRM-Safety-Study Public
thu-coai/LRM-Safety-Study’s past year of commit activity - Agent-SafetyBench Public
thu-coai/Agent-SafetyBench’s past year of commit activity - JPS Public
[MM '25] JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering
thu-coai/JPS’s past year of commit activity - CharacterBench Public
[AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models
thu-coai/CharacterBench’s past year of commit activity - ShieldVLM Public
thu-coai/ShieldVLM’s past year of commit activity - SafetyBench Public
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
thu-coai/SafetyBench’s past year of commit activity - VPO Public
thu-coai/VPO’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…