Value Alignment in LLMs

This repository includes codes for two papers presented in EMNLP 2025 regarding value alignment in LLMs:

EMNLP 2025 Main: Mind the Value-Action Gap: Do LLMs Act in Alignment with Their Values?
EMNLP 2025 WiNLP Workshop: ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs

These two papers aim to answer the core concerning questions related to value alignment:

RQ1: How can we systematically capture human values and evaluate the extent to which LLM aligns with them? (Value Alignment between Humans & LLMs, EMNLP 2025 WiNLP Workshop)
RQ2: To what extent do LLM-generated value statements align with their value-informed actions? (Value Alignment between LLM's value claim & the corresponding actions, EMNLP 2025 Main)

Overview

To address RQ1 -- Human-AI Value Alignment -- we propose ValueCompass, a framework for systematically measuring value alignment between LLMs and humans across contextual scenarios. See below figure for an overview.

To address RQ2 -- LLM's Value-Action Alignment -- we propose ValueActionLens Framework, associated with the VIA (Value-Informed Dataset) dataset, to assess the alignment between LLMs’ stated values & value-informed actions. See below figure for an example of GPT4o's Value-Action Gap.

Evaluating Value Alignment

To evaluate the value alignment in LLMs, codes are released in this directory: Value-Action Alignment Tasks.

VIA Dataset

The full VIA dataset can be accessed in this directory: Value-Informed Dataset (VIA)

@article{shen2024valuecompass,
    title={Valuecompass: A framework of fundamental values for human-ai alignment},
    author={Shen, Hua and Knearem, Tiffany and Ghosh, Reshmi and Yang, Yu-Ju and Mitra, Tanushree and Huang, Yun},
    journal={arXiv preprint arXiv:2409.09586},
    year={2024}
}


@article{shen2025mind,
  title={Mind the Value-Action Gap: Do LLMs Act in Alignment with Their Values?},
  author={Shen, Hua and Clark, Nicholas and Mitra, Tanushree},
  journal={arXiv preprint arXiv:2501.15463},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
figures		figures
outputs		outputs
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Value Alignment in LLMs

Overview

Evaluating Value Alignment

VIA Dataset

About

Uh oh!

Releases

Packages

Contributors 2

Languages

huashen218/value_action_gap

Folders and files

Latest commit

History

Repository files navigation

Value Alignment in LLMs

Overview

Evaluating Value Alignment

VIA Dataset

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages