Highlights
- Pro
Stars
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
Official repository for "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"
Official GitHub repository for the paper "Adversarial Attacks on Robotic Vision Language Action Models"
[ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
This is the official GitHub repository of the paper "Dia-LLaMA: Towards Large Language Model-driven CT Report Generation"
This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Report Generation".
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
[USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction"
This repository provides a benchmark for prompt Injection attacks and defenses
Repository accompanying the paper https://openreview.net/pdf?id=sSAp8ITBpC
SoK: Evaluating Jailbreak Guardrails for Large Language Models
Panda Guard is designed for researching jailbreak attacks, defenses, and evaluation algorithms for large language models (LLMs).
A framework to evaluate the generalization capability of safety alignment for LLMs
The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Code Implementation of Adversarial Prompt Evaluation paper
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models
Safety at Scale: A Comprehensive Survey of Large Model Safety
[ICLR 2025 Spotlight] The official implementation of our ICLR2025 paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs".