Skip to content
@InternRobotics

Intern Robotics

Building inclusive infrastructure for Embodied AI, from Shanghai AI Lab.

Toolchain - Training, Inference and Evaluation

  • InternUtopia: A simulation platform for versatile Embodied AI research and developments.
  • InternManip: An all-in-one robot manipulation learning suites (5 pretrained models, 3 benchmarks, and more coming soon).
  • InternNav: A open platform for building generalized navigation foundation models (with 6 mainstream benchmarks and 10+ baselines).
  • InternHumanoid: A versatile, all-in-one toolbox for whole-body humanoid robot contorl.
  • InternSR: A open-source toolbox for vision-based embodied spatial intelligence.

Models, Datasets and Research

  • Humanoids/Legged Robots

    • Datasets:
      • InternData-H1: The largest open-sourced 3D human motion dataset with text annotation, including 2.5k hours 1.9M episodes.
    • Models and Research:
      • UniHSI: Unified Human-Scene Interaction via Prompted Chain-of-Contacts
      • HIMLoco: Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response
      • 🏆HoST [Best Systems Paper Finalist at RSS 2025]: Learning Humanoid Standing-up Control across Diverse Postures
      • HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit
  • Manipulation

    • Datasets:
      • InternData-A1: A hybrid synthetic-real manipulation dataset integrating 5 heterogeneous robots, 15 skills, and 200+ scenes, emphasizing multi-robot collaboration under dynamic scenarios.
      • InternData-M1: A large-scale synthetic dataset for generalizable pick-and-place over 80K objects, with open-ended instructions covering object recognition, spatial and commonsense reasoning, and long-horizon tasks.
    • Models and Research:
      • InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
      • InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
      • F1-VLA: Visual foresight generation for planning-based control
      • VLAC: A generalist vision-language-action-critic model for robotic real-world reinforcement learning
      • Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
      • RoboSplat: Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation
      • GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
  • Navigation

    • Datasets:
      • InternData-N1: A high-quality navigation dataset with the most diverse scenes and extensive randomization across embodiments/viewpoints, including 3k+ scenes and 830k VLN data.
    • Models and Research:
      • InternVLA-N1: An Open Dual-System Vision-Language Navigation Foundation Model with Learned Latent Plans
      • NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance
      • StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling
      • VLN-PE: A Holistic Study of Physical and Visual Disparities in Vision-and-Language Navigation
  • AIGC for Embodied AI

    • Datasets:
      • OmniWorld: A large-scale, multi-domain, multi-modal dataset, enables significant performance improvements in 4D reconstruction and video generation.
    • Models and Research:
      • MeshCoder: Generate Structured 3D Object Blender Code from Point Clouds
      • Infinite-Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation
      • Aether: Geometric-Aware Unified World Modeling
  • 3D Vision and Embodied Perception

    • EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
    • 🏆PointLLM [Best Paper Candidate at ECCV 2024]: Empowering Large Language Models to Understand Point Clouds
    • MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
    • OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
  • 3D Assets for Embodied AI

    • InternScenes: A large-scale interactive indoor scene dataset with realistic layouts, 40,000 diverse scenes and 1.96M 3D objects.

Pinned Loading

  1. InternUtopia InternUtopia Public

    A simulation platform for versatile Embodied AI research and developments.

    Python 1.1k 65

  2. InternScenes InternScenes Public

    [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.

    Python 187 6

  3. InternManip InternManip Public

    An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.

    Python 147 11

  4. InternNav InternNav Public

    InternRobotics' open platform for building generalized navigation foundation models.

    Jupyter Notebook 364 33

  5. InternHumanoid InternHumanoid Public

    A versatile, all-in-one toolbox for whole-body humanoid robot control.

    Python 136 3

  6. InternSR InternSR Public

    InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.

    Python 44

Repositories

Showing 10 of 54 repositories
  • InternNav Public

    InternRobotics' open platform for building generalized navigation foundation models.

    InternRobotics/InternNav’s past year of commit activity
    Jupyter Notebook 364 33 11 1 Updated Oct 30, 2025
  • internrobotics.github.io Public

    Documentation of Intern Robotics Platform & Toolkits

    InternRobotics/internrobotics.github.io’s past year of commit activity
    Python 2 3 0 1 Updated Oct 30, 2025
  • GenManip Public

    [CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"

    InternRobotics/GenManip’s past year of commit activity
    Python 75 MIT 0 3 0 Updated Oct 29, 2025
  • EgoThinker Public

    Official implementation of EgoThinker at NIPS 2025

    InternRobotics/EgoThinker’s past year of commit activity
    Python 8 0 1 0 Updated Oct 29, 2025
  • Aether Public

    [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling

    InternRobotics/Aether’s past year of commit activity
    Python 511 MIT 4 2 0 Updated Oct 26, 2025
  • InternVLA-M1 Public

    InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

    InternRobotics/InternVLA-M1’s past year of commit activity
    Python 219 MIT 10 3 0 Updated Oct 24, 2025
  • MMSI-Bench Public

    [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

    InternRobotics/MMSI-Bench’s past year of commit activity
    Python 55 0 0 0 Updated Oct 23, 2025
  • InternRobotics/internvla-m1.github.io’s past year of commit activity
    Astro 0 0 0 0 Updated Oct 23, 2025
  • Humanoid-Goalkeeper Public

    [arxiv 2025] Official implementation of "Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints"

    InternRobotics/Humanoid-Goalkeeper’s past year of commit activity
    Python 88 3 0 0 Updated Oct 22, 2025
  • MV-CoLight Public

    [NIPS 2025] MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation

    InternRobotics/MV-CoLight’s past year of commit activity
    Python 6 MIT 1 0 0 Updated Oct 21, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.