🗺️ Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors

Michelle S. Lam, Fred Hohman, Dominik Moritz, Jeffrey P. Bigham, Kenneth Holstein, Mary Beth Kery
UIST, 2025. https://arxiv.org/abs/2409.18203

AI policy sets boundaries on acceptable behavior for AI models, but this is challenging in the context of large language models (LLMs): how do you ensure coverage over a vast behavior space? We introduce policy maps, an approach to AI policy design inspired by the practice of physical mapmaking. Instead of aiming for full coverage, policy maps aid effective navigation through intentional design choices about which aspects to capture and which to abstract away.

This repository contains the code for Policy Projector, a prototype tool for designing LLM policy maps, as described in our paper. With the Policy Projector tool an AI practitioner can survey the landscape of model input-output pairs, define custom regions (e.g., "violence"), and navigate these regions with if-then policy rules that can act on LLM outputs (e.g., if output contains "violence" and "graphic details," then rewrite without "graphic details"). Policy Projector supports interactive policy authoring using LLM classification and steering and a map visualization reflecting the AI practitioner's work. In an evaluation with 12 AI safety experts, our system helps policy designers craft policies around problematic model behaviors such as incorrect gender assumptions and handling of immediate physical safety threats.

Development Guide

This repo contains the following components:

policy-projector: the Policy Projector Python package
map-visualization-app: the Policy Projector interactive interface
notebooks: sample Python notebooks for data processing

Installation

Install uv.
Create a new virtual environment using uv: uv venv then source .venv/bin/activate
Install the dependencies with uv sync.
Open notebooks/preprocess_data.ipynb using jupyter lab . to download and prepare the sample dataset.
In Jupyter Lab, enable extensions, then run notebooks/PolicyProjector_CPU+OpenAI.ipynb to see the policy maps widget.
Check out the map visualization web viewer in map-visualization-app. Each component has further documentation and development instructions.

Contributing

When making contributions, refer to the CONTRIBUTING guidelines and read the CODE OF CONDUCT.

BibTeX

To cite our paper, please use:

@article{lam2025policy,
    title={{Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors
}},
    author={S. Lam, Michelle and Hohman, Fred and Moritz, Dominik and P. Bigham, Jeffrey and Holstein, Kenneth and Kery, Mary Beth},
    journal={Symposium on User Interface Software and Technology},
    organization={ACM},
    year={2025},
    doi={10.1145/3746059.3747680}
}

License

This code is released under the LICENSE terms.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
docs		docs
map-visualization-app		map-visualization-app
notebooks		notebooks
policy-projector		policy-projector
.gitignore		.gitignore
.python-version		.python-version
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
main.py		main.py
package.json		package.json
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🗺️ Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors

Development Guide

Installation

Contributing

BibTeX

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

apple/ml-policy-projector

Folders and files

Latest commit

History

Repository files navigation

🗺️ Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors

Development Guide

Installation

Contributing

BibTeX

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages