TencentCloudADP · Lightblues · Jan 4, 2026 · Jan 4, 2026 · Jan 4, 2026 · Jan 4, 2026
diff --git a/README.md b/README.md
@@ -1,11 +1,10 @@
-# <img src="docs/assets/logo.svg" alt="Youtu-agent Logo" height="24px"> Youtu-Agent: A simple yet powerful agent framework that delivers with open-source models
+# <img src="docs/assets/logo.svg" alt="Youtu-agent Logo" height="24px"> Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
 
 <div align="center">
 <a href="https://tencentcloudadp.github.io/youtu-agent/"><img src=https://img.shields.io/badge/📖-Documentation-blue.svg></a>
-<!-- <a href=https://arxiv.org/abs/2502.14345><img src=https://img.shields.io/badge/arXiv-2502.14345-b31b1b.svg></a> -->
 <a href=https://github.com/TencentCloudADP/youtu-agent><img src=https://img.shields.io/badge/GitHub-Tencent-blue.svg></a>
 <a href=https://deepwiki.com/TencentCloudADP/youtu-agent><img src=https://img.shields.io/badge/DeepWiki-Tencent-blue.svg></a>
-<a href=https://arxiv.org/abs/2510.08191><img src=https://img.shields.io/badge/arXiv-2510.08191-b31b1b.svg></a>
+<a href=https://arxiv.org/abs/2512.24615><img src=https://img.shields.io/badge/arXiv-2512.24615-b31b1b.svg></a>
 </div>
 
 <p align="center">
@@ -25,16 +24,17 @@
 <img src="docs/assets/mascot.png" alt="Youtu-agent Logo" width="200" align="left" style="margin-right:20px;">
 
 Key highlights:
-- **Verified performance**: Achieved 71.47% on WebWalkerQA (pass@1) and 72.8% on GAIA (text-only subset, pass@1), using purely `DeepSeek-V3` series models (without Claude or GPT), establishing a strong open-source starting point.
-- **Automatic Tool and Agent Generation**: Built on modular design and structured configuration system, Youtu-Agent supports automatic generation of tools and agents through LLMs, enabling rapid adaptation to new application scenarios and reducing manual configuration overhead.
-- **Automatic Agent Experience Learning**: Through [Training-Free GRPO](https://arxiv.org/abs/2510.08191), agents can continuously optimize their capabilities using historical experiences, delivering significant performance improvements with minimal cost.
-- **Agent Training**: Youtu-Agent also supports parameter fine-tuning of agent models through end-to-end RL training to enhance model capabilities.
+- **Verified performance**: Achieves state-of-the-art performance on WebWalkerQA (71.47%) and GAIA (72.8%) using purely open-weight models (e.g., `DeepSeek-V3`), establishing a strong open-source baseline.
+- **Automated Agent Generation**: Introduces two paradigms: a **Workflow** mode for standard tasks and a **Meta-Agent** mode for complex requirements. The framework supports automated generation of tool code, prompts, and configurations, achieving over 81% tool synthesis success rate.
+- **Continuous Experience Learning**: The **Agent Practice** module enables low-cost continuous evolution via [Training-Free GRPO](https://arxiv.org/abs/2510.08191). Agents accumulate experience and improve performance (e.g., +5.4% on AIME 2025) through in-context optimization without parameter updates.
+- **Scalable and Stable Agent RL**: The **Agent RL** module provides a complete pipeline for end-to-end reinforcement learning. By integrating with distributed frameworks, it addresses stability and scalability challenges, achieving 40% training speedup and scaling to 128 GPUs.
 - **Open-source friendly & cost-aware**: Optimized for accessible, low-cost deployment without reliance on closed models.
-- **Practical use cases**: Out-of-the-box support for tasks like CSV analysis, literature review, personal file organization, and podcast and video generation (coming soon).
-- **Flexible architecture**: Built on [openai-agents](https://github.com/openai/openai-agents-python), with extensible support for diverse model APIs (form `DeepSeek` to `gpt-oss`), tool integrations, and framework implementations.
+- **Practical use cases**: Out-of-the-box support for tasks like data analysis, literature review, personal file organization, retrieval-augmented generation, and PPT generation.
+- **Flexible architecture**: Built on [openai-agents](https://github.com/openai/openai-agents-python), with extensible support for diverse model APIs (from `DeepSeek` to `gpt-oss`), tool integrations, and framework implementations.
 
 ## 🗞️ News
 
+- 🚀 [2026-01-04] **Youtu Tip & Youtu-LLM Released!** We are excited to introduce [**Youtu Tip**](https://github.com/TencentCloudADP/youtu-tip), a macOS desktop AI assistant powered by offline models (via Ollama) for automating tasks like file reading and web browsing. We also released **Youtu-LLM** (1.96B params, 128K context), specialized for STEM, coding, and agentic tasks. [Official Site](https://youtu-tip.com/); [GitHub](https://github.com/TencentCloudADP/youtu-tip).
 - 🚀 [2025-12-10] **Youtu-Agent x Agent-Lightning training integration available!** We've collaborated with the [Agent-Lightning](https://github.com/microsoft/agent-lightning/) team to implement efficient model training in verious scenarios. With ours efforts, training can now seamlessly scale to multi-node deployment on 128 GPUs. See details in the [rl/agl branch](https://github.com/TencentCloudADP/youtu-agent/tree/rl/agl).
 - 🎉 [2025-11-12] **Training-Free GRPO now available in main branch!** The agent practice module powered by [Training-Free Group Relative Policy Optimization](https://arxiv.org/abs/2510.08191) is now integrated into the main branch. Enhance your agents' performance without fine-tuning at minimal cost (~$8 for RL runs). See our [Agent Practice Documentation](https://tencentcloudadp.github.io/youtu-agent/practice/) for usage and examples on math reasoning and web search tasks.
 - 📢 [2025-11-03] New examples: we add the [**PPT generation**](examples/ppt_gen/README.md) and [**RAG**](configs/agents/examples/rag.yaml) examples.
@@ -43,9 +43,11 @@ Key highlights:
 
 <details>
 <summary><b>📰 Previous announcements</b></summary>
+
 - 📺 [2025-09-09] We hosted a live sharing the design philosophy and basic usage of `Youtu-Agent`. [[video](https://www.bilibili.com/video/BV1mypqz4EvS)] [[documentation](https://doc.weixin.qq.com/doc/w3_AcMATAZtAPICNLgt3CbnxRWaYWnW4)].
 - 🎁 [2025-09-02] [Tencent Cloud International](https://www.tencentcloud.com/) offers new users of the DeepSeek API **3 million free tokens** (**Sep 1 – Oct 31, 2025**). [Try it out](https://www.tencentcloud.com/document/product/1255/70381) for free if you want to use DeepSeek models in `Youtu-Agent`! For enterprise agent solutions, also check out [Agent Development Platform](https://adp.tencentcloud.com) (ADP).
 - 📺 [2025-08-28] We hosted a live sharing updates about DeepSeek-V3.1 and how to use it in the `Youtu-Agent` framework. [[video](https://www.bilibili.com/video/BV1XwayzrETi/)] [[documentation](https://doc.weixin.qq.com/doc/w3_AcMATAZtAPICNvcLaY5FvTOuo7MwF)].
+
 </details>
 
 ## 🌟 Benchmark Performance
@@ -401,6 +403,15 @@ We welcome contributions from the community! If you'd like to help improve Youtu
 If you find this work useful, please consider citing:
 
 ```bibtex
+@misc{youtu_agent,
+      title={Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization}, 
+      author={Yuchen Shi and Yuzheng Cai and Siqi Cai and Zihan Xu and Lichao Chen and Yulei Qin and Zhijian Zhou and Xiang Fei and Chaofan Qiu and Xiaoyu Tan and Gang Li and Zongyi Li and Haojia Lin and Guocan Cai and Yong Mao and Yunsheng Wu and Ke Li and Xing Sun},
+      year={2025},
+      eprint={2512.24615},
+      archivePrefix={arXiv},
+      primaryClass={cs.AI},
+      url={https://arxiv.org/abs/2512.24615}, 
+}
 @misc{training_free_grpo,
       title={Training-Free Group Relative Policy Optimization}, 
       author={Tencent Youtu Lab},
@@ -410,13 +421,4 @@ If you find this work useful, please consider citing:
       primaryClass={cs.CL},
       url={https://arxiv.org/abs/2510.08191}, 
 }
-
-@misc{youtu-agent-2025,
-  title={Youtu-agent: A Simple yet Powerful Agent Framework},
-  author={Tencent Youtu Lab},
-  year={2025},
-  publisher = {GitHub},
-  journal = {GitHub repository},
-  howpublished = {\url{https://github.com/TencentCloudADP/youtu-agent}},
-}
 ```
diff --git a/README_JA.md b/README_JA.md
@@ -2,10 +2,9 @@
 
 <div align="center">
 <a href="https://tencentcloudadp.github.io/youtu-agent/"><img src=https://img.shields.io/badge/📖-文档-blue.svg></a>
-<!-- <a href=https://arxiv.org/abs/2502.14345><img src=https://img.shields.io/badge/arXiv-2502.14345-b31b1b.svg></a> -->
 <a href=https://github.com/TencentCloudADP/youtu-agent><img src=https://img.shields.io/badge/GitHub-腾讯-blue.svg></a>
 <a href=https://deepwiki.com/TencentCloudADP/youtu-agent><img src=https://img.shields.io/badge/DeepWiki-Tencent-blue.svg></a>
-<a href=https://arxiv.org/abs/2510.08191><img src=https://img.shields.io/badge/arXiv-2510.08191-b31b1b.svg></a>
+<a href=https://arxiv.org/abs/2512.24615><img src=https://img.shields.io/badge/arXiv-2512.24615-b31b1b.svg></a>
 </div>
 
 <p align="center">
@@ -23,16 +22,18 @@
 <img src="docs/assets/mascot.png" alt="Youtu-agent Logo" width="200" align="left" style="margin-right:20px;">
 
 主なハイライト：
-- **パフォーマンスの検証**：WebWalkerQAではpass@1で71.47%、GAIA（純テキストサブセット）ではpass@1で72.8%を達成しました。これは`DeepSeek-V3`シリーズのモデルのみを使用しており（ClaudeやGPTは使用していません）、強力なオープンソースの出発点を築きました。
-- **自動ツール＆エージェント生成**：モジュール設計と構造化されたコンフィグシステムに基づいて、Youtu-AgentはLLMを通じたツールとエージェントの自動生成をサポートしており、新しいアプリケーションシナリオへの迅速な対応とマニュアル設定のオーバーヘッド削減を実現します。
-- **自動エージェント経験学習**：[Training-Free GRPO](https://arxiv.org/abs/2510.08191)を通じて、エージェントは過去の経験を活用してエージェント機能を継続的に最適化し、最小限のコストで大幅なパフォーマンス向上を実現します。詳しくは[エージェント学習ドキュメント](https://tencentcloudadp.github.io/youtu-agent/practice/)をご覧ください。
-- **エージェントトレーニング**：Youtu-Agentはエージェントモデルのパラメータファインチューニングもサポートし、エンドツーエンドのRLトレーニングを通じてモデル能力を向上させます。
-- **オープンソースに優しく、コストに敏感**：アクセスしやすく、低コストでのデプロイを最適化しており、クローズドなモデルに依存しません。
-- **実際の使用例**：CSV分析、文献レビュー、個人ファイルの整理、ポッドキャストやビデオの生成などのタスクを箱から出してすぐにサポートします。（近日公開予定）
-- **柔軟なアーキテクチャ**：[openai-agents](https://github.com/openai/openai-agents-python)に基づいて構築されており、`DeepSeek`から`gpt-oss`までのさまざまなモデルAPI、ツールの統合、フレームワークの実装と互換性があります。
+- **検証済みのパフォーマンス**: オープンウェイトモデル（`DeepSeek-V3`など）のみを使用して、WebWalkerQA（71.47%）およびGAIA（72.8%）で最先端のパフォーマンスを達成し、強力なオープンソースのベースラインを確立しました。
+- **自動エージェント生成**: 標準的なタスク向けの **Workflow** モードと、複雑な要件向けの **Meta-Agent** モードの2つのパラダイムを導入しました。ツールコード、プロンプト、設定の自動生成をサポートし、ツー合成の成功率は81%を超えています。
+- **継続的な経験学習**: **Agent Practice** モジュールは [Training-Free GRPO](https://arxiv.org/abs/2510.08191) を通じて低コストでの継続的な進化を実現します。エージェントはパラメータ更新なしにインコンテキスト最適化を通じて経験を蓄積し、パフォーマンスを向上させます（例：AIME 2025で+5.4%）。
+- **スケーラブルで安定したエージェントRL**: **Agent RL** モジュールは、エンドツーエンドの強化学習のための完全なパイプラインを提供します。分散フレームワークとの統合により、安定性とスケーラビリティの課題に対処し、トレーニング速度を40%向上させ、128 GPUまでスケーリングしました。
+- **オープンソースフレンドリーでコスト意識が高い**: クローズドモデルに依存せず、アクセスしやすく低コストでデプロイできるように最適化されています。
+- **実用的なユースケース**: データ分析、文献レビュー、個人ファイル整理、検索拡張生成（RAG）、PPT生成などのタスクをすぐにサポートします。
+- **柔軟なアーキテクチャ**: [openai-agents](https://github.com/openai/openai-agents-python)に基づいて構築されており、`DeepSeek`から`gpt-oss`までの多様なモデルAPI、ツール統合、フレームワーク実装への拡張可能なサポートを提供します。
 
 ## 🗞️ ニュース
 
+- 🚀 [2026-01-04] **Youtu Tip & Youtu-LLM リリース！** 私達は、オフラインモデル（Ollama経由）を使用してファイル読み取りやWebブラウジングなどのタスクを自動化するmacOSデスクトップAIアシスタント、[**Youtu Tip**](https://github.com/TencentCloudADP/youtu-tip) を紹介できることを嬉しく思います。また、STEM、コーディング、エージェントタスク向けに特化した **Youtu-LLM**（1.96Bパラメータ、128Kコンテキスト）もリリースしました。[公式サイト](https://youtu-tip.com/); [GitHub](https://github.com/TencentCloudADP/youtu-tip)。
+
 - 🚀 [2025-12-10] **Youtu-Agent x Agent-Lightning トレーニング統合が利用可能！** [Agent-Lightning](https://github.com/microsoft/agent-lightning/) チームと協力して、コード/数学と検索環境での効率的なモデルトレーニングを実装しました（ReTool、SearchR1）。Youtu-Agentの最適化により、分散トレーニングは128 GPUのマルチノードデプロイにシームレスにスケールできるようになりました。詳細は [rl/agl ブランチ](https://github.com/TencentCloudADP/youtu-agent/tree/rl/agl) をご覧ください。
 - 🎉 [2025-11-12] **Training-Free GRPO がメインブランチで利用可能に！** [Training-Free Group Relative Policy Optimization](https://arxiv.org/abs/2510.08191) によるエージェント学習モジュールがメインブランチに統合されました。ファインチューニング不要で、極めて低コスト（約8ドル）でエージェントの性能を向上させることができます。数学推論とウェブ検索タスクの使用方法と例については、[エージェント学習ドキュメント](https://tencentcloudadp.github.io/youtu-agent/practice/)をご覧ください。
 - 📢 [2025-11-03] 新しい例を追加しました：[**PPT生成**](examples/ppt_gen/README.md)と[**RAG**](configs/agents/examples/rag.yaml)の例を追加しました。
@@ -41,9 +42,11 @@
 
 <details>
 <summary><b>📰 過去のお知らせ</b></summary>
+
 - 📺 [2025-09-09] 「Youtu-Agent」の設計思想と基本的な使い方についてライブ配信を行いました。[[動画](https://www.bilibili.com/video/BV1mypqz4EvS)] [[資料](https://doc.weixin.qq.com/doc/w3_AcMATAZtAPICNLgt3CbnxRWaYWnW4)]。
 - 🎁 [2025-09-02] [テンセントクラウド国際サイト](https://www.tencentcloud.com/)では、DeepSeek APIの新規ユーザーに対して**300万枚の無料トークン**を提供しています（**2025年9月1日から2025年10月31日まで**）。`Youtu-Agent`でDeepSeekモデルを使用したい場合は、[無料トライアルをクリック](https://www.tencentcloud.com/document/product/1255/70381)してください！企業向けのエージェントソリューションについては、[エージェント開発プラットフォームADP](https://adp.tencentcloud.com)もご覧ください。
 - 📺 [2025-08-28] 新しくリリースされたDeepSeek-V3.1モデルについてライブ配信を行い、`Youtu-Agent`フレームワークでの使用方法を紹介しました。[[動画](https://www.bilibili.com/video/BV1XwayzrETi/)] [[資料](https://doc.weixin.qq.com/doc/w3_AcMATAZtAPICNvcLaY5FvTOuo7MwF)]。
+
 </details>
 
 ## 🌟 ベンチマークパフォーマンス
@@ -319,6 +322,15 @@ python scripts/run_eval.py --config_name ww --exp_id <your_exp_id> --dataset Web
 この研究が役立つと思われる場合は、引用を検討してください：
 
 ```bibtex
+@misc{youtu_agent,
+      title={Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization}, 
+      author={Yuchen Shi and Yuzheng Cai and Siqi Cai and Zihan Xu and Lichao Chen and Yulei Qin and Zhijian Zhou and Xiang Fei and Chaofan Qiu and Xiaoyu Tan and Gang Li and Zongyi Li and Haojia Lin and Guocan Cai and Yong Mao and Yunsheng Wu and Ke Li and Xing Sun},
+      year={2025},
+      eprint={2512.24615},
+      archivePrefix={arXiv},
+      primaryClass={cs.AI},
+      url={https://arxiv.org/abs/2512.24615}, 
+}
 @misc{training_free_grpo,
   title={Training-Free Group Relative Policy Optimization},
   author={Tencent Youtu Lab},
@@ -328,13 +340,4 @@ python scripts/run_eval.py --config_name ww --exp_id <your_exp_id> --dataset Web
   primaryClass={cs.CL},
   url={https://arxiv.org/abs/2510.08191},
 }
-
-@misc{youtu-agent-2025,
-  title={Youtu-agent: A Simple yet Powerful Agent Framework},
-  author={Tencent Youtu Lab},
-  year={2025},
-  publisher = {GitHub},
-  journal = {GitHub repository},
-  howpublished = {\url{https://github.com/TencentCloudADP/youtu-agent}},
-}
 ```