Skip to content

Commit b491aaa

Browse files
Resolve merge conflicts with origin/main
- Keep both DSPy GEPA and GRPO vLLM entries in _toctree.yml - Keep both entries in index.md latest notebooks section
2 parents f3fee96 + 9088f98 commit b491aaa

File tree

3 files changed

+12844
-4
lines changed

3 files changed

+12844
-4
lines changed

notebooks/en/_toctree.yml

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -90,9 +90,8 @@
9090
title: Fine-tuning LLMs for Function Calling with the xLAM Dataset
9191
- local: dspy_gepa
9292
title: Optimizing Language Models with DSPy GEPA
93-
94-
95-
93+
- local: grpo_vllm_online_training
94+
title: Efficient Online Training with GRPO and vLLM in TRL
9695
- title: Computer Vision Recipes
9796
isExpanded: false
9897
sections:

notebooks/en/grpo_vllm_online_training.ipynb

Lines changed: 12841 additions & 0 deletions
Large diffs are not rendered by default.

notebooks/en/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,11 +8,11 @@ applications and solving various machine learning tasks using open-source tools
88
Check out the recently added notebooks:
99

1010
- [Optimizing Language Models with DSPy GEPA](dspy_gepa)
11+
- [Efficient Online Training with GRPO and vLLM in TRL](grpo_vllm_online_training)
1112
- [Fine-tuning LLMs for Function Calling with the xLAM Dataset](function_calling_fine_tuning_llms_on_xlam)
1213
- [Post training an VLM for reasoning with GRPO using TRL](fine_tuning_vlm_grpo_trl)
1314
- [TRL GRPO Reasoning with Advanced Reward](trl_grpo_reasoning_advanced_reward)
1415
- [Fine-Tuning a Vision Language Model with TRL using MPO](fine_tuning_vlm_mpo)
15-
- [Fine tuning a VLM for Object Detection Grounding using TRL](fine_tuning_vlm_object_detection_grounding)
1616

1717
You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).
1818

0 commit comments

Comments
 (0)