🚀 A modular decision-making framework combining graph-based physical perception with LLM-driven reasoning, tested in dynamic and uncertain environments like obstacle avoidance and Tetris!
| gpt-4o-mini | gpt-4o | VLM | APEX(gpt-4o) |
|---|---|---|---|
![]() |
![]() |
![]() |
![]() |
gpt-4o-mini
gpt-4o
VLM
APEX(gpt-4o-mini)
APEX(gpt-4o)
APEX is a hybrid framework designed for reasoning and decision-making under uncertainty. It uses a Graphormer model to capture potential physical interactions and a language model (LLM) to interpret simulation results and generate high-level strategies.
- Graphormer Trigger: Detects potential physical hazards by computing edge-level attention scores.
- Physical Simulator: Predicts action outcomes under current physics.
- LLM Planner: Uses simulation summaries to suggest safe and effective actions.
- Ask LLM to answer physical questions without using python.
- Predicts optimal block placement
- Avoids overflow & line gaps using reasoning
- Difficulty levels:
Simple,Medium,Hard - Metrics:
- CFR: Collision-Free Rate
- IAR: Invalid Action Rate
- AST: Average Survival Time
- See as the corresponding folders.
conda create -n apex python=3.11
conda activate apexIf you find this work helpful, please cite our paper💡
@misc{huang2025apexempoweringllmsphysicsbased,
title={APEX: Empowering LLMs with Physics-Based Task Planning for Real-time Insight},
author={Wanjing Huang and Weixiang Yan and Zhen Zhang and Ambuj Singh},
year={2025},
eprint={2505.13921},
archivePrefix={arXiv},
primaryClass={cs.RO},
url={https://arxiv.org/abs/2505.13921},
}







