[Newton] Add Warp based inhand_manipulation env #4413

hujc7 · 2026-01-17T09:10:08Z

Description

Add warp env for inhand_manipulation

Fixes # (issue)

Type of change

New feature (non-breaking change which adds functionality)

Screenshots

Please attach before and after screenshots of the change if applicable.

Checklist

I have read and understood the contribution guidelines
I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

1. con success 10.67 2. used the same MJWarpSolver as torch env

hujc7 · 2026-01-20T18:03:53Z

Attaching performance data

Performance summary (torch → warp)

Metric	Torch	Warp	Δ% (torch→warp)
Action processing mean (us, N=9600)	1456.55	36.98	-97.46%
Newton simulation mean (us, N=9600)	17591.24	19028.51	+8.17%
Post-processing mean (us, N=4800)	6522.74	171.14	-97.38%
Total step mean (us, N=4800)	45333.22	39187.63	-13.56%
Throughput (steps/s)	62826	68045	+8.31%
Iteration time (s)	1.04	0.96	-7.69%
Collection time (s)	0.801	0.699	-12.73%
Learning time (s)	0.242	0.264	+9.09%

Δ% is computed as ((\text{warp} - \text{torch}) / \text{torch} \times 100%), so negative time Δ% = less time (better).

allegro_warp.txt
allegro_torch.txt

greptile-apps · 2026-01-20T18:11:12Z

Greptile Overview

Greptile Summary

This PR implements a Warp-accelerated in-hand manipulation environment for the Allegro Hand robot, enabling high-performance parallel simulation across 8192 environments using GPU kernels.

Key changes:

New InHandManipulationWarpEnv class with 20+ Warp kernels for parallel computation of observations, rewards, resets, and physics
Complete environment configuration (AllegroHandWarpEnvCfg) with Newton solver settings optimized for contact-rich manipulation
Gymnasium registration for Isaac-Repose-Cube-Allegro-Direct-Warp-v0 environment
Three RL framework configurations (RSL-RL, RL Games, SKRL) with matching hyperparameters for multi-framework compatibility
Task involves reorienting a cube to match a target pose using 16-DOF Allegro hand with reward shaping for position, orientation, and action penalty

Architecture:

Inherits from DirectRLEnvWarp base class
Uses Warp kernels for GPU-accelerated computations (actions, observations, rewards, resets)
Implements both "full" (124-dim) and "openai" observation modes
Includes non-finite value sanitization to prevent NaN propagation

Minor issues found:

Unused z_unit_vecs initialized but never referenced
Comment clarity could be improved in rotation_distance function

Confidence Score: 4/5

This PR is safe to merge with minimal risk, pending validation that tests pass.
The implementation follows established patterns from the IsaacLab codebase and appears well-structured. The Warp kernels are logically sound with proper input/output separation. Minor style issues (unused variable, comment clarity) don't affect functionality. The main uncertainty is lack of test coverage mentioned in the checklist, and the PR author hasn't verified that changes generate no warnings or that tests pass.
Verify inhand_manipulation_warp_env.py works correctly with both observation modes and handles edge cases in quaternion operations.

Important Files Changed

Filename	Overview
source/isaaclab_tasks_experimental/isaaclab_tasks_experimental/direct/inhand_manipulation/inhand_manipulation_warp_env.py	Adds complete Warp-accelerated in-hand manipulation environment with 20+ GPU kernels for parallel simulation across 8192 environments.
source/isaaclab_tasks_experimental/isaaclab_tasks_experimental/direct/allegro_hand/allegro_hand_warp_env_cfg.py	Configuration file defining Allegro Hand environment parameters, solver settings, and reward scales.
source/isaaclab_tasks_experimental/isaaclab_tasks_experimental/direct/allegro_hand/init.py	Registers the Isaac-Repose-Cube-Allegro-Direct-Warp-v0 environment with Gymnasium and configures RL agent entry points.

Sequence Diagram

sequenceDiagram
    participant User
    participant Gym
    participant InHandManipulationWarpEnv
    participant DirectRLEnvWarp
    participant WarpKernels
    participant Hand as Articulation (Hand)
    participant Object as Articulation (Object)
    
    User->>Gym: register environment
    Gym->>InHandManipulationWarpEnv: create with AllegroHandWarpEnvCfg
    InHandManipulationWarpEnv->>DirectRLEnvWarp: __init__(cfg)
    InHandManipulationWarpEnv->>InHandManipulationWarpEnv: _setup_scene()
    InHandManipulationWarpEnv->>Hand: create Articulation(robot_cfg)
    InHandManipulationWarpEnv->>Object: create Articulation(object_cfg)
    InHandManipulationWarpEnv->>WarpKernels: initialize_rng_state()
    InHandManipulationWarpEnv->>WarpKernels: initialize_goal_constants()
    InHandManipulationWarpEnv->>WarpKernels: initialize_xyz_unit_vecs()
    
    User->>InHandManipulationWarpEnv: step(actions)
    InHandManipulationWarpEnv->>InHandManipulationWarpEnv: _pre_physics_step(actions)
    InHandManipulationWarpEnv->>InHandManipulationWarpEnv: _apply_action()
    InHandManipulationWarpEnv->>WarpKernels: apply_actions_to_targets()
    InHandManipulationWarpEnv->>Hand: set_joint_position_target()
    InHandManipulationWarpEnv->>DirectRLEnvWarp: simulate physics
    InHandManipulationWarpEnv->>InHandManipulationWarpEnv: _get_dones()
    InHandManipulationWarpEnv->>WarpKernels: compute_intermediate_values()
    InHandManipulationWarpEnv->>WarpKernels: get_dones()
    InHandManipulationWarpEnv->>InHandManipulationWarpEnv: _get_observations()
    InHandManipulationWarpEnv->>WarpKernels: compute_full_observations()
    InHandManipulationWarpEnv->>WarpKernels: sanitize_and_print_once()
    InHandManipulationWarpEnv->>InHandManipulationWarpEnv: _get_rewards()
    InHandManipulationWarpEnv->>WarpKernels: compute_rewards()
    InHandManipulationWarpEnv->>WarpKernels: update_consecutive_successes_from_stats()
    InHandManipulationWarpEnv->>InHandManipulationWarpEnv: _reset_target_pose()
    InHandManipulationWarpEnv->>WarpKernels: reset_target_pose()
    
    alt Reset Required
        InHandManipulationWarpEnv->>InHandManipulationWarpEnv: _reset_idx(mask)
        InHandManipulationWarpEnv->>WarpKernels: reset_object()
        InHandManipulationWarpEnv->>Object: update root_link_pose_w
        InHandManipulationWarpEnv->>WarpKernels: reset_hand()
        InHandManipulationWarpEnv->>Hand: update joint_pos/joint_vel
        InHandManipulationWarpEnv->>WarpKernels: reset_successes()
    end
    
    InHandManipulationWarpEnv-->>User: obs, reward, done, info

greptile-apps

_{2 files reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-01-20T18:11:17Z