Skip to content
View jiaxiaojunQAQ's full-sized avatar

Block or report jiaxiaojunQAQ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Safety at Scale: A Comprehensive Survey of Large Model Safety

22 Updated Feb 11, 2025

Code for "TrustRAG: Enhancing Robustness and Trustworthiness in RAG"

Python 17 1 Updated Jan 21, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

5 Updated Dec 25, 2024

A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.

TypeScript 2,549 186 Updated Feb 12, 2025
Jupyter Notebook 16 Updated Jan 20, 2025

This is the official repository of [[2408.10848\] Perception-guided Jailbreak against Text-to-Image Models](https://arxiv.org/abs/2408.10848) The paper is accepted by AAAI 2025.

Python 2 1 Updated Jan 27, 2025

Fine-tuning base models to build robust task-specific models

Python 27 5 Updated Apr 11, 2024

Every practical and proposed defense against prompt injection.

384 27 Updated May 31, 2024
Jupyter Notebook 240 40 Updated May 17, 2024

[CVPR2024]Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training

Python 6 Updated Jan 27, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 3,435 463 Updated Jan 26, 2025

Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents

Python 317 30 Updated Feb 8, 2025

Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]

Python 60 2 Updated Sep 27, 2024

Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"

Python 23 1 Updated Dec 18, 2024
Python 10 1 Updated Nov 4, 2024

Aligning LMMs with Factually Augmented RLHF

Python 344 24 Updated Nov 1, 2023

The reinforcement learning codes for dataset SPA-VL

Python 28 Updated Jun 24, 2024

Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"

Python 15 1 Updated Nov 10, 2024

This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.

Jupyter Notebook 55 Updated Dec 20, 2024

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 261 7 Updated Sep 11, 2024

ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate

Python 396 55 Updated Oct 3, 2023

A collection of resources that investigate social agents.

107 10 Updated Jan 21, 2025

Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"

Python 12 Updated Jan 2, 2025

Collection of evals for Inspect AI

Python 70 74 Updated Feb 12, 2025
Python 12 Updated Oct 20, 2024

Agent Security Bench (ASB)

Python 58 3 Updated Dec 15, 2024
Python 7 Updated Oct 31, 2024
Next