Azure · sharvin2187 · May 29, 2025 · May 28, 2025
@@ -5,7 +5,7 @@
    "id": "c8348fe9",
    "metadata": {},
    "source": [
-    "# <span style=\"font-size:0.8em;\">📝 Agenda</span>\n",
+    "# <span style=\"font-size:0.8em;\">📝 Plan of Action</span>\n",
     "\n",
     "This notebook guides you through the end-to-end process of fine-tuning the **Qwen2.5-7B-Instruct** model into a **reasoning model** using medical data on **Azure ML**. Qwen2.5-7B-Instruct is an instruction-tuned large language model developed by Alibaba Cloud, based on their Qwen2.5-7B foundation model. It is optimized for following human instructions across a wide range of tasks, such as question answering, code generation, and language understanding. In this walkthrough, one will learn how to enhance the model's reasoning capabilities using **Reinforced Fine-Tuning (RFT)** techniques, with a focus on **GRPO (**G**roup **R**elative **P**olicy **O**ptimization)**.\n",
     "\n",