Code for "Leveraging Large Language Models for Predictive Analysis of Human Misery"

Dataset - https://huggingface.co/datasets/bansalaman18/misery-index

misery_index_game_show.py - code for running the LLM as a game show contestant
utils.py - functions for generating responsed from LLM functions
eval.py - evaluating the responses
Misery_Data.csv - data containing miserable situations, their annotated misery index values, contestant responses, and other metadata
- Run python misery_index_game_show.py <SEED> <MODEL_NAME>
  - <SEED> - 12, 123, 1234
  - <MODEL_NAME> - gpt-3.5-turbo, gpt-4, gpt-4-turbo, gpt-4o, gpt-4o-mini, o1, o1-mini, gemini-1.5-pro

Progress

Game-show setting

Column headers are seeds.

	12	123	1234
gpt-3.5-turbo	✅	✅	✅
gpt-4	✅	❌	❌
gpt-4-turbo	✅	✅	✅
gpt-4o-mini	✅	✅	✅
gpt-4o	✅	✅	✅
o1-preview	❌	❌	❌
o1-mini	❌	❌	❌
gemini-1.5-pro	❌	❌	❌

Game-show setting with chain-of-thought
Directly predicting the misery index
Binary Comparisons between situations

Links used for gathering data

https://bobbymgsk.wordpress.com/category/the-misery-index/
https://jericho.blog/2021/02/03/the-misery-index-data/ - contains most of the data in a spreadsheet - https://docs.google.com/spreadsheets/d/151WjFwDdhIURf48subj6SDOdra0XVIEo0xulnBMMfRo/edit#gid=1169151367

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Code for "Leveraging Large Language Models for Predictive Analysis of Human Misery"

Dataset - https://huggingface.co/datasets/bansalaman18/misery-index

Progress

Links used for gathering data

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
__pycache__		__pycache__
full_responses_MODEL_gpt-3.5-turbo_SEED_12		full_responses_MODEL_gpt-3.5-turbo_SEED_12
full_responses_MODEL_gpt-3.5-turbo_SEED_123		full_responses_MODEL_gpt-3.5-turbo_SEED_123
full_responses_MODEL_gpt-3.5-turbo_SEED_1234		full_responses_MODEL_gpt-3.5-turbo_SEED_1234
full_responses_MODEL_gpt-4-turbo_SEED_12		full_responses_MODEL_gpt-4-turbo_SEED_12
full_responses_MODEL_gpt-4-turbo_SEED_123		full_responses_MODEL_gpt-4-turbo_SEED_123
full_responses_MODEL_gpt-4-turbo_SEED_1234		full_responses_MODEL_gpt-4-turbo_SEED_1234
full_responses_MODEL_gpt-4_SEED_12		full_responses_MODEL_gpt-4_SEED_12
full_responses_MODEL_gpt-4o-mini_SEED_12		full_responses_MODEL_gpt-4o-mini_SEED_12
full_responses_MODEL_gpt-4o-mini_SEED_123		full_responses_MODEL_gpt-4o-mini_SEED_123
full_responses_MODEL_gpt-4o-mini_SEED_1234		full_responses_MODEL_gpt-4o-mini_SEED_1234
full_responses_MODEL_gpt-4o_SEED_12		full_responses_MODEL_gpt-4o_SEED_12
full_responses_MODEL_gpt-4o_SEED_123		full_responses_MODEL_gpt-4o_SEED_123
full_responses_MODEL_gpt-4o_SEED_1234		full_responses_MODEL_gpt-4o_SEED_1234
.gitattributes		.gitattributes
Misery_Data.csv		Misery_Data.csv
README.md		README.md
eval.py		eval.py
full_responses_MODEL_gpt-3.5-turbo_SEED_1234_IS_CORRECT.json		full_responses_MODEL_gpt-3.5-turbo_SEED_1234_IS_CORRECT.json
full_responses_MODEL_gpt-3.5-turbo_SEED_1234_PRED_ANSWERS.json		full_responses_MODEL_gpt-3.5-turbo_SEED_1234_PRED_ANSWERS.json
full_responses_MODEL_gpt-3.5-turbo_SEED_1234_RESPONSES.json		full_responses_MODEL_gpt-3.5-turbo_SEED_1234_RESPONSES.json
full_responses_MODEL_gpt-3.5-turbo_SEED_123_IS_CORRECT.json		full_responses_MODEL_gpt-3.5-turbo_SEED_123_IS_CORRECT.json
full_responses_MODEL_gpt-3.5-turbo_SEED_123_PRED_ANSWERS.json		full_responses_MODEL_gpt-3.5-turbo_SEED_123_PRED_ANSWERS.json
full_responses_MODEL_gpt-3.5-turbo_SEED_123_RESPONSES.json		full_responses_MODEL_gpt-3.5-turbo_SEED_123_RESPONSES.json
full_responses_MODEL_gpt-3.5-turbo_SEED_12_IS_CORRECT.json		full_responses_MODEL_gpt-3.5-turbo_SEED_12_IS_CORRECT.json
full_responses_MODEL_gpt-3.5-turbo_SEED_12_PRED_ANSWERS.json		full_responses_MODEL_gpt-3.5-turbo_SEED_12_PRED_ANSWERS.json
full_responses_MODEL_gpt-3.5-turbo_SEED_12_RESPONSES.json		full_responses_MODEL_gpt-3.5-turbo_SEED_12_RESPONSES.json
full_responses_MODEL_gpt-4-turbo_SEED_1234_IS_CORRECT.json		full_responses_MODEL_gpt-4-turbo_SEED_1234_IS_CORRECT.json
full_responses_MODEL_gpt-4-turbo_SEED_1234_PRED_ANSWERS.json		full_responses_MODEL_gpt-4-turbo_SEED_1234_PRED_ANSWERS.json
full_responses_MODEL_gpt-4-turbo_SEED_1234_RESPONSES.json		full_responses_MODEL_gpt-4-turbo_SEED_1234_RESPONSES.json
full_responses_MODEL_gpt-4-turbo_SEED_123_IS_CORRECT.json		full_responses_MODEL_gpt-4-turbo_SEED_123_IS_CORRECT.json
full_responses_MODEL_gpt-4-turbo_SEED_123_PRED_ANSWERS.json		full_responses_MODEL_gpt-4-turbo_SEED_123_PRED_ANSWERS.json
full_responses_MODEL_gpt-4-turbo_SEED_123_RESPONSES.json		full_responses_MODEL_gpt-4-turbo_SEED_123_RESPONSES.json
full_responses_MODEL_gpt-4-turbo_SEED_12_IS_CORRECT.json		full_responses_MODEL_gpt-4-turbo_SEED_12_IS_CORRECT.json
full_responses_MODEL_gpt-4-turbo_SEED_12_PRED_ANSWERS.json		full_responses_MODEL_gpt-4-turbo_SEED_12_PRED_ANSWERS.json
full_responses_MODEL_gpt-4-turbo_SEED_12_RESPONSES.json		full_responses_MODEL_gpt-4-turbo_SEED_12_RESPONSES.json
full_responses_MODEL_gpt-4_SEED_12_IS_CORRECT.json		full_responses_MODEL_gpt-4_SEED_12_IS_CORRECT.json
full_responses_MODEL_gpt-4_SEED_12_PRED_ANSWERS.json		full_responses_MODEL_gpt-4_SEED_12_PRED_ANSWERS.json
full_responses_MODEL_gpt-4_SEED_12_RESPONSES.json		full_responses_MODEL_gpt-4_SEED_12_RESPONSES.json
full_responses_MODEL_gpt-4o-mini_SEED_1234_IS_CORRECT.json		full_responses_MODEL_gpt-4o-mini_SEED_1234_IS_CORRECT.json
full_responses_MODEL_gpt-4o-mini_SEED_1234_PRED_ANSWERS.json		full_responses_MODEL_gpt-4o-mini_SEED_1234_PRED_ANSWERS.json
full_responses_MODEL_gpt-4o-mini_SEED_1234_RESPONSES.json		full_responses_MODEL_gpt-4o-mini_SEED_1234_RESPONSES.json
full_responses_MODEL_gpt-4o-mini_SEED_123_IS_CORRECT.json		full_responses_MODEL_gpt-4o-mini_SEED_123_IS_CORRECT.json
full_responses_MODEL_gpt-4o-mini_SEED_123_PRED_ANSWERS.json		full_responses_MODEL_gpt-4o-mini_SEED_123_PRED_ANSWERS.json
full_responses_MODEL_gpt-4o-mini_SEED_123_RESPONSES.json		full_responses_MODEL_gpt-4o-mini_SEED_123_RESPONSES.json
full_responses_MODEL_gpt-4o-mini_SEED_12_IS_CORRECT.json		full_responses_MODEL_gpt-4o-mini_SEED_12_IS_CORRECT.json
full_responses_MODEL_gpt-4o-mini_SEED_12_PRED_ANSWERS.json		full_responses_MODEL_gpt-4o-mini_SEED_12_PRED_ANSWERS.json
full_responses_MODEL_gpt-4o-mini_SEED_12_RESPONSES.json		full_responses_MODEL_gpt-4o-mini_SEED_12_RESPONSES.json
full_responses_MODEL_gpt-4o_SEED_1234_IS_CORRECT.json		full_responses_MODEL_gpt-4o_SEED_1234_IS_CORRECT.json
full_responses_MODEL_gpt-4o_SEED_1234_PRED_ANSWERS.json		full_responses_MODEL_gpt-4o_SEED_1234_PRED_ANSWERS.json
full_responses_MODEL_gpt-4o_SEED_1234_RESPONSES.json		full_responses_MODEL_gpt-4o_SEED_1234_RESPONSES.json
full_responses_MODEL_gpt-4o_SEED_123_IS_CORRECT.json		full_responses_MODEL_gpt-4o_SEED_123_IS_CORRECT.json
full_responses_MODEL_gpt-4o_SEED_123_PRED_ANSWERS.json		full_responses_MODEL_gpt-4o_SEED_123_PRED_ANSWERS.json
full_responses_MODEL_gpt-4o_SEED_123_RESPONSES.json		full_responses_MODEL_gpt-4o_SEED_123_RESPONSES.json
full_responses_MODEL_gpt-4o_SEED_12_IS_CORRECT.json		full_responses_MODEL_gpt-4o_SEED_12_IS_CORRECT.json
full_responses_MODEL_gpt-4o_SEED_12_PRED_ANSWERS.json		full_responses_MODEL_gpt-4o_SEED_12_PRED_ANSWERS.json
full_responses_MODEL_gpt-4o_SEED_12_RESPONSES.json		full_responses_MODEL_gpt-4o_SEED_12_RESPONSES.json
misery_index_example_full_conv.txt		misery_index_example_full_conv.txt
misery_index_game_show.py		misery_index_game_show.py
results.csv		results.csv
utils.py		utils.py

abhi1nandy2/Misery_Data_Exps_GitHub

Folders and files

Latest commit

History

Repository files navigation

Code for "Leveraging Large Language Models for Predictive Analysis of Human Misery"

Dataset - https://huggingface.co/datasets/bansalaman18/misery-index

Progress

Links used for gathering data

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages