Skip to content

Commit 476e61b

Browse files
Update README.md
1 parent 8ac395e commit 476e61b

File tree

1 file changed

+8
-3
lines changed

1 file changed

+8
-3
lines changed

README.md

Lines changed: 8 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -45,10 +45,9 @@ All available datasets for Instruction Tuning of Large Language Models
4545
- LLM instruction generation for a diverse set of corpus samples (27,739 instructions and long text pairs)
4646
- LLaVA Visual Instruct 150K: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K
4747
- GPT-generated multimodal instruction-following data
48+
-
4849

49-
### Misc
50-
- OIG: https://huggingface.co/datasets/laion/OIG
51-
- Superset of some of the datasets here
50+
### Preference Datasets (can be used to train the reward model)
5251
- HH-RLHF: https://huggingface.co/datasets/Anthropic/hh-rlhf
5352
- Contains human ratings of harmfulness and helpfulness of model outputs. The dataset contains ~160K human-rated examples, where each example in this dataset consists of a pair of responses from a chatbot, one of which is preferred by humans.
5453
- OpenAI WebGPT: https://huggingface.co/datasets/openai/webgpt_comparisons
@@ -57,7 +56,13 @@ All available datasets for Instruction Tuning of Large Language Models
5756
- Contains ~93K examples, each example consists of feedback from humans regarding the summarizations generated by a model. Human evaluators chose the superior summary from two options.
5857
- Stanford Human Preferences Dataset (SHP): https://huggingface.co/datasets/stanfordnlp/SHP
5958
- 385K collective human preferences over responses to questions/instructions in 18 different subject areas
59+
- Stack Exchange Preferences: https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences
60+
61+
### Misc
62+
- OIG: https://huggingface.co/datasets/laion/OIG
63+
- Superset of some of the datasets here
6064
- oa_leet10k: https://huggingface.co/datasets/ehartford/oa_leet10k
6165
- LeetCode problems solved in multiple programming languages
66+
6267

6368

0 commit comments

Comments
 (0)