Skip to content

feat(agent): add GUI-Owl model support#910

Open
Fizza-Mukhtar wants to merge 1 commit intotrycua:mainfrom
Fizza-Mukhtar:feat/gui-owl-agent
Open

feat(agent): add GUI-Owl model support#910
Fizza-Mukhtar wants to merge 1 commit intotrycua:mainfrom
Fizza-Mukhtar:feat/gui-owl-agent

Conversation

@Fizza-Mukhtar
Copy link
Contributor

Summary

This PR adds first-class agent support for GUI-Owl, a model family from the Mobile-Agent-V3 project, enabling it to be used within the existing CUA agent framework.

The integration follows the same design and conventions as existing agents (e.g. Gemini), ensuring consistency and ease of extension.

What’s included

New GUIOwlAgent implementation (gui_owl.py)

Registration via the agent registry using @register_agent("gui-owl")

Agent discoverable through list_agents() and loadable via get_agent("gui-owl")

No breaking changes to existing agents or configs

Verification / Testing

The following local, non-GPU tests were performed:

from cua_bench.agents import list_agents, get_agent

assert "gui-owl" in list_agents()
assert get_agent("gui-owl") is not None

Output confirms successful registration and loading:

['cua-agent', 'gemini', 'gui-owl']
<class 'cua_bench.agents.gui_owl.GUIOwlAgent'>

Notes on evaluation

Full benchmark runs (e.g. ScreenSpot / OSWorld / Android Control) were not executed locally due to hardware (GPU) limitations.
This PR focuses on infrastructure-level integration, enabling downstream users and CI environments with appropriate hardware to run full evals without additional changes.

Why this matters

GUI-Owl is a strong GUI-automation-focused model family, and this integration makes it immediately usable within the CUA agent ecosystem with minimal friction.

closes #374

@vercel
Copy link
Contributor

vercel bot commented Jan 25, 2026

@Fizza-Mukhtar is attempting to deploy a commit to the Cua Team on Vercel.

A member of the Team first needs to authorize it.

@Fizza-Mukhtar
Copy link
Contributor Author

@f-trycua @ddupont808
Hi! Whenever you get time, could you please take a look at this PR?
Thanks a lot 🙏

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Agent]: Add model support for GUI-Owl

1 participant