Improve VBot navigation robustness via domain randomization and reward reweighting#4
Merged
Logic-TARS merged 5 commits intomainfrom Feb 11, 2026
Merged
Conversation
…tness Co-authored-by: Logic-TARS <99871356+Logic-TARS@users.noreply.github.com>
Co-authored-by: Logic-TARS <99871356+Logic-TARS@users.noreply.github.com>
Co-authored-by: Logic-TARS <99871356+Logic-TARS@users.noreply.github.com>
Co-authored-by: Logic-TARS <99871356+Logic-TARS@users.noreply.github.com>
Copilot
AI
changed the title
[WIP] Improve VBot robustness with domain randomization
Improve VBot navigation robustness via domain randomization and reward reweighting
Feb 11, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
VBot robot dog exhibits 30-40% success rate on navigation tasks due to instability - small perturbations cause falls. Target: 70-80% success with <32° body tilt tolerance.
Changes
Domain Randomization (
cfg.py)Added
DomainRandomizationdataclass with systematic parameter variation:Reward Function Reweighting (
cfg.py)Increased stability penalties to prioritize posture control:
orientation: -0.05 → -0.20 (4x increase)lin_vel_z: -0.30 (suppress vertical oscillation)ang_vel_xy: -0.15 (penalize roll/pitch rates)contact_stability: 0.1 (reward 2+ feet on ground)action_smoothness: -0.01 (penalize jerky motions)Implementation (
vbot_section001_np.py)reset(): Apply randomization to 12 actuated DOFs
_compute_reward(): Integrate new stability terms with configurable weights
Testing & Validation
verify_implementation.sh: Grep-based code verification (no runtime dependencies)test_robustness.py: Success rate and stability angle measurement across trialsvalidate_config.py: Configuration parameter validationExpected Impact
Notes
hasattr()checksOriginal prompt
问题描述
VBot 机器狗模型虽然已收敛并具备行走能力,但在测试集上的鲁棒性不足:
优化目标
通过引入域随机化(Domain Randomization)和调整姿态惩罚权重来提升鲁棒性:
修复内容
1️⃣ 配置层优化 (cfg.py)
1.1 添加域随机化配置
1.2 调整奖励函数权重(姿态稳定性提升)
2️⃣ 环境实现层 (vbot_section001_np.py)
2.1 在 reset() 中实现域随机化
2.2 添加新奖励项计算
2.3 步态恢复机制(可选高级功能)