jiaxiaojunQAQ

jiaxiaojun jiaxiaojunQAQ

85 followers · 21 following

Nanyang Technological University
新加坡
https://jiaxiaojunqaq.github.io/

Achievements

Stars

xingjunm / Awesome-Large-Model-Safety

Safety at Scale: A Comprehensive Survey of Large Model Safety

22 Updated Feb 11, 2025

llm-as-a-judge / Awesome-LLM-as-a-judge

201 7 Updated Feb 5, 2025

HuichiZhou / TrustRAG

Code for "TrustRAG: Enhancing Robustness and Trustworthiness in RAG"

Python 17 1 Updated Jan 21, 2025

AriaUI / Awesome-GUI-Agent

Forked from showlab/Awesome-GUI-Agent

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

5 Updated Dec 25, 2024

bytedance / UI-TARS-desktop

A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.

TypeScript 2,549 186 Updated Feb 12, 2025

gq-max / AdvDiffVLM

Jupyter Notebook 16 Updated Jan 20, 2025

deepseek-ai / DeepSeek-R1

72,871 9,398 Updated Feb 8, 2025

LeLiang-SJTU / PGJ-T2l

This is the official repository of [[2408.10848\] Perception-guided Jailbreak against Text-to-Image Models](https://arxiv.org/abs/2408.10848) The paper is accepted by AAAI 2025.

Python 2 1 Updated Jan 27, 2025

wagner-group / prompt-injection-defense

Fine-tuning base models to build robust task-specific models

Python 27 5 Updated Apr 11, 2024

tldrsec / prompt-injection-defenses

Every practical and proposed defense against prompt injection.

384 27 Updated May 31, 2024

kjappelbaum / gptchem

Jupyter Notebook 240 40 Updated May 17, 2024

tokaka22 / CVPR24-Focus-on-Hiders

[CVPR2024]Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training

Python 6 Updated Jan 27, 2025

SamuelSchmidgall / AgentLaboratory

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 3,435 463 Updated Jan 26, 2025

AriaUI / Aria-UI

Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents

Python 317 30 Updated Feb 8, 2025

lancopku / agent-backdoor-attacks

Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]

Python 60 2 Updated Sep 27, 2024

facebookresearch / jailbreak-objectives

Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"

Python 23 1 Updated Dec 18, 2024

yibo-miao / T2VSafetyBench

Python 10 1 Updated Nov 4, 2024

llava-rlhf / LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Python 344 24 Updated Nov 1, 2023

EchoseChen / SPA-VL-RLHF

The reinforcement learning codes for dataset SPA-VL

Python 28 Updated Jun 24, 2024

alenai97 / PEFT-MLLM

Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"

Python 15 1 Updated Nov 10, 2024

qingjiesjtu / USC

This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.

Jupyter Notebook 55 Updated Dec 20, 2024

Huang-yihao / Personalization-based_backdoor

6 Updated Dec 18, 2024

RLHF-V / RLHF-V

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 261 7 Updated Sep 11, 2024

composable-models / llm_multiagent_debate

ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate

Python 396 55 Updated Oct 3, 2023

FudanDISC / SocialAgent

A collection of resources that investigate social agents.

107 10 Updated Jan 21, 2025

MaTengSYSU / HIMRD-jailbreak

Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"

Python 12 Updated Jan 2, 2025

UKGovernmentBEIS / inspect_evals

Collection of evals for Inspect AI

Python 70 74 Updated Feb 12, 2025

lionel-w2 / FAP

Python 12 Updated Oct 20, 2024

agiresearch / ASB

Agent Security Bench (ASB)

Python 58 3 Updated Dec 15, 2024

PKU-ML / PAT

Python 7 Updated Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jiaxiaojun jiaxiaojunQAQ

Achievements

Achievements

Block or report jiaxiaojunQAQ

Stars

xingjunm / Awesome-Large-Model-Safety

llm-as-a-judge / Awesome-LLM-as-a-judge

HuichiZhou / TrustRAG

AriaUI / Awesome-GUI-Agent

bytedance / UI-TARS-desktop

gq-max / AdvDiffVLM

deepseek-ai / DeepSeek-R1

LeLiang-SJTU / PGJ-T2l

wagner-group / prompt-injection-defense

tldrsec / prompt-injection-defenses

kjappelbaum / gptchem

tokaka22 / CVPR24-Focus-on-Hiders

SamuelSchmidgall / AgentLaboratory

AriaUI / Aria-UI

lancopku / agent-backdoor-attacks

facebookresearch / jailbreak-objectives

yibo-miao / T2VSafetyBench

llava-rlhf / LLaVA-RLHF

EchoseChen / SPA-VL-RLHF

alenai97 / PEFT-MLLM

qingjiesjtu / USC

Huang-yihao / Personalization-based_backdoor

RLHF-V / RLHF-V

composable-models / llm_multiagent_debate

FudanDISC / SocialAgent

MaTengSYSU / HIMRD-jailbreak

UKGovernmentBEIS / inspect_evals

lionel-w2 / FAP

agiresearch / ASB

PKU-ML / PAT