From 641f825513cbfd3d26b0ccb5716a3e37fdb0f368 Mon Sep 17 00:00:00 2001 From: burtenshaw Date: Mon, 16 Dec 2024 09:59:53 +0100 Subject: [PATCH] improve prose --- docs/source/index.mdx | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/source/index.mdx b/docs/source/index.mdx index ad02efcf94..217506448a 100644 --- a/docs/source/index.mdx +++ b/docs/source/index.mdx @@ -7,9 +7,9 @@ TRL is a full stack library where we provide a set of tools to train transformer language models with Reinforcement Learning, from the Supervised Fine-tuning step (SFT), Reward Modeling step (RM) to the Proximal Policy Optimization (PPO) step. The library is integrated with 🤗 [transformers](https://github.com/huggingface/transformers). -## Learn post-training +## Learn -Learn post-training with the 🤗 [smol course](https://github.com/huggingface/smol-course). +Learn post-training with TRL and other libraries in 🤗 [smol course](https://github.com/huggingface/smol-course). ## API documentation