Multilingual-Human-Value-Concepts

Welcome to the official repository for code and data accompanying our paper titled Exploring Multilingual Human Value Concepts in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?.

The repe code is based on the representation-engineering project.

The lang_sim.pt file is computed by running lang2vec_try.ipynb, which relies on lang2vec.

Preparing

Our primary experimental data, the Multilingual human VALUE dataset(MVALUE) dataset, is provided in Google Drive.

Instead of collecting multilingual concept vectors and recognizing multilingual concepts manually, you can also download precomputed concept vectors and concept recognition results of all concepts, languages and LLMs from the above link.

To utilize these resources, you should simply download the data and the res folders in the above link and place them into the main directory of the repository.

[Optional] Collecting Multilingual Concept Vectors

Collect vectors for llama2-chat-7B as:

python collect_vector.py --lang en fr zh es pt vi ca id ja ko fi hu ta te sw ny --concept morality deontology utilitarianism fairness truthfulness toxicity harmfulness --model-name llama2-chat --model-size 7B

please replace model-name and model-size for other LLMs.

Recognizing Multilingual Concepts

[Optional] Perform concept recognition for llama2-chat-7B as:

python recognize_concept.py --lang en fr zh es pt vi ca id ja ko fi hu ta te sw ny --concept morality deontology utilitarianism fairness truthfulness toxicity harmfulness --model-name llama2-chat --model-size 7B

please replace model-name and model-size for other LLMs.

After obtaining all recognition results, you can generate the multilingual concept recognition accuracy (Figure 2 and Table 6 in the paper) as:

python recognize_concept.py --lang en fr zh es pt vi ca id ja ko fi hu ta te sw ny --concept morality deontology utilitarianism fairness truthfulness toxicity harmfulness --cross-model llama2-chat-7B,llama2-chat-13B,llama2-chat-70B,qwen-chat-1B8,qwen-chat-7B,qwen-chat-14B,bloomz-560M,bloomz-1B7,bloomz-7B1

Cross-lingual Consistency and Transferability Analysis

Perform consistency analysis, outputting the results in Figure 4&7 and Table 1:

python consistency_analysis.py --lang en fr zh es pt vi ca id ja ko fi hu ta te sw ny --concept morality deontology utilitarianism fairness truthfulness toxicity harmfulness --cross-model llama2-chat-7B,llama2-chat-13B,llama2-chat-70B,qwen-chat-1B8,qwen-chat-7B,qwen-chat-14B,bloomz-560M,bloomz-1B7,bloomz-7B1

Perform transferablity analysis, outputting Figure 5&8 in the paper:

python transferability_analysis.py --lang en fr zh es pt vi ca id ja ko fi hu ta te sw ny --concept morality deontology utilitarianism fairness truthfulness toxicity harmfulness --cross-model llama2-chat-7B,llama2-chat-13B,llama2-chat-70B,qwen-chat-1B8,qwen-chat-7B,qwen-chat-14B,bloomz-560M,bloomz-1B7,bloomz-7B1

Cross-Lingual Value Alignment Control

TBC

Citation

@article{DBLP:journals/corr/abs-2402-18120,
  author       = {Shaoyang Xu and
                  Weilong Dong and
                  Zishan Guo and
                  Xinwei Wu and
                  Deyi Xiong},
  title        = {Exploring Multilingual Human Value Concepts in Large Language Models:
                  Is Value Alignment Consistent, Transferable and Controllable across
                  Languages?},
  journal      = {CoRR},
  volume       = {abs/2402.18120},
  year         = {2024},
  url          = {https://doi.org/10.48550/arXiv.2402.18120},
  doi          = {10.48550/ARXIV.2402.18120},
  eprinttype    = {arXiv},
  eprint       = {2402.18120},
  timestamp    = {Tue, 26 Mar 2024 10:51:46 +0100},
  biburl       = {https://dblp.org/rec/journals/corr/abs-2402-18120.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
__pycache__		__pycache__
repe		repe
.gitignore		.gitignore
README.md		README.md
collect_vector.py		collect_vector.py
consistency_analysis.py		consistency_analysis.py
lang.pt		lang.pt
lang2vec_try.ipynb		lang2vec_try.ipynb
lang_sim.pt		lang_sim.pt
recognize_concept.py		recognize_concept.py
transferability_analysis.py		transferability_analysis.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multilingual-Human-Value-Concepts

Preparing

[Optional] Collecting Multilingual Concept Vectors

Recognizing Multilingual Concepts

Cross-lingual Consistency and Transferability Analysis

Cross-Lingual Value Alignment Control

Citation

About

Releases

Packages

Languages

shaoyangxu/Multilingual-Human-Value-Concepts

Folders and files

Latest commit

History

Repository files navigation

Multilingual-Human-Value-Concepts

Preparing

[Optional] Collecting Multilingual Concept Vectors

Recognizing Multilingual Concepts

Cross-lingual Consistency and Transferability Analysis

Cross-Lingual Value Alignment Control

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages