Enhancing SFT Training Efficiency Using Packing and FlashAttention2 with Position IDs #31629
+226
−0
We went looking everywhere, but couldn’t find those commits.
Sometimes commits can disappear after a force-push. Head back to the latest changes here.