A comprehensive toolkit for implementing, analyzing, and validating AI value alignment based on Anthropic's "Values in the Wild" research.
graph TD
subgraph Core["Core Framework"]
Extract[Extraction Module]
Taxonomy[Taxonomy Module]
Anon[Anonymization Module]
Simulate[Simulation Module]
Analyze[Analysis Module]
end
subgraph Data["Data Resources"]
ValueData[Value Taxonomies]
Samples[Chat Samples]
Frequencies[Value Frequencies]
end
subgraph Workflows["Workflows"]
Extract_Flow[Value Extraction]
Analysis_Flow[Distribution Analysis]
Simulation_Flow[Chat Simulation]
Anonymization_Flow[Privacy-Preserving Anonymization]
end
%% Core Module Relationships
Extract --> Taxonomy
Simulate --> Taxonomy
Analyze --> Taxonomy
%% Data Relationships
ValueData --> Taxonomy
Frequencies --> Simulate
Samples --> Extract
Extract --> Samples
%% Workflow Relationships
Extract_Flow --> Extract
Extract_Flow --> Anon
Analysis_Flow --> Analyze
Simulation_Flow --> Simulate
Anonymization_Flow --> Anon
%% Privacy Layer
Anon -.-> Extract
Anon -.-> Analyze
classDef core fill:#f9f,stroke:#333,stroke-width:2px
classDef data fill:#bbf,stroke:#333,stroke-width:1px
classDef flow fill:#bfb,stroke:#333,stroke-width:1px
class Extract,Taxonomy,Anon,Simulate,Analyze core
class ValueData,Samples,Frequencies data
class Extract_Flow,Analysis_Flow,Simulation_Flow,Anonymization_Flow flow
This project uses uv
for Python dependency management and make
for workflow automation.
- Python 3.9+
- uv (Python package manager)
- Make
sequenceDiagram
participant User
participant Make
participant UV
participant Python
User->>Make: make setup
Make->>UV: uv venv .venv
UV-->>Make: Virtual environment created
Make->>UV: uv pip install -r requirements.txt
UV-->>Make: Dependencies installed
Make-->>User: Setup complete
User->>Make: make activate
Make-->>User: Environment activation instructions
User->>Make: make download-all
Make->>Python: Download papers and datasets
Python-->>Make: Downloads complete
Make-->>User: Resources ready
-
Clone the repository:
git clone https://github.com/defrecord/value-alignment-toolkit.git cd value-alignment-toolkit
-
Set up the environment:
make setup
This will create a virtual environment using uv and install all dependencies.
-
Activate the environment:
source .venv/bin/activate # or use 'make activate' for instructions
-
Download required resources:
make download-all
-
Run a sample analysis:
make sample-analysis
-
src/
: Core implementation modulesextraction/
: Value extraction algorithmssimulation/
: Chat system simulationanonymization/
: Privacy-preserving techniquesanalysis/
: Statistical tools and visualizationstaxonomy/
: Value hierarchy implementation
-
data/
: Data resources and outputsvalues/
: Reference data including value frequencies and taxonomiessamples/
: Generated and anonymized conversation datasets
-
tools/
: Utility scriptsdownload/
: Scripts to fetch relevant research papers and resourcesvalidation/
: Tools for testing and validating the implementation
-
docs/
: Documentationtutorials/
: Implementation guides and usage examplespaper/
: Summaries of research methodology and key findings
Run make help
to see all available commands.
[Appropriate license information]
This work builds upon research by Anthropic's "Values in the Wild" paper authored by Saffron Huang, Esin Durmus, et al.