Official Implementation of Learning to Refuse: Towards Mitigating Privacy Risks in LLMs
RETURN is avaliable in data/RETURN.jsonl
. You also can access RETURN directly on Hugging Face.
from datasets import load_dataset
dataset = load_dataset("zhliu/RETURN")
# Clone the repository
git clone git@github.com:zhliu0106/learning-to-refuse.git
cd learning-to-refuse
# Create and activate conda environment
conda create -n refuse python==3.10
conda activate refuse
# Install dependencies
pip install -r requirements.txt
bash scripts/data_process.sh
bash scripts/run.sh
Note: Due to differences in hardware environments and random seed settings, there might be slight variations in the experimental results.
@article{liu2024learning,
title={Learning to refuse: Towards mitigating privacy risks in llms},
author={Liu, Zhenhua and Zhu, Tong and Tan, Chuanyuan and Chen, Wenliang},
journal={arXiv preprint arXiv:2407.10058},
year={2024}
}