Skip to content

Commit

Permalink
Update OSWorld
Browse files Browse the repository at this point in the history
  • Loading branch information
Timothyxxx authored Oct 11, 2024
1 parent 850edfb commit 856e7df
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -188,6 +188,7 @@ Multimodal Large Language Models (MLLMs) are gaining increasing popularity in bo
2. <mark>EgoPlan-Bench</mark> **"EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning"**. *Yi Chen, Yuying Ge, Yixiao Ge, et al.*. arXiv 2023. [[Paper](https://arxiv.org/abs/2312.06722)] [[Github](https://github.com/ChenYi99/EgoPlan)].
3. <mark>PCA-EVAL</mark> **"Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond"**. *Liang Chen, Yichi Zhang, Shuhuai Ren, et al.*. arXiv 2023. [[Paper](https://arxiv.org/abs/2310.02071)] [[Github](https://github.com/pkunlp-icler/PCA-EVAL/)].
4. <mark>OpenEQA</mark> **"OpenEQA: Embodied Question Answering in the Era of Foundation Models"**. *Majumdar, Arjun and Ajay, Anurag and Zhang, et al.*. CVPR 2024. [[Paper](https://openaccess.thecvf.com/content/CVPR2024/papers/Majumdar_OpenEQA_Embodied_Question_Answering_in_the_Era_of_Foundation_Models_CVPR_2024_paper.pdf)] [[Github](https://open-eqa.github.io/)].
5. <mark>OSWorld</mark> **"OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments"**. *Tianbao Xie, Danyang Zhang, Jixuan Chenet al.*. NeurIPS 2024. [[Paper](https://arxiv.org/abs/2404.07972)] [[Github](https://os-world.github.io/)].

**Mobile Agency**
1. <mark>Mobile-Eval</mark> **"Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception"**. *Junyang Wang, Haiyang Xu, Jiabo Ye, et al.*. ICLR 2024. [[Paper](https://arxiv.org/abs/2401.16158)] [[Github](https://github.com/X-PLUG/MobileAgent)].
Expand Down

0 comments on commit 856e7df

Please sign in to comment.