Skip to content
#

tinylora

Here are 2 public repositories matching this topic...

借鉴《Learning to Reason in 13 parameters》 TinyLoRA方法,用极低参微调 Qwen2.5-Coder-3B-Instruct 进行OI。 Inspired by 《Learning to Reason in 13 parameters》, use Extreme parameter-efficient Reinforcement Learning to fine-tune Qwen2.5-Coder-3B-Instruct.

  • Updated Feb 12, 2026
  • Python

Improve this page

Add a description, image, and links to the tinylora topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tinylora topic, visit your repo's landing page and select "manage topics."

Learn more