MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B
-
Updated
Mar 20, 2026 - Jupyter Notebook
MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B
[AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic alignment bottlenecks in GUI agents through efficient, guided exploration.
Mano-P: Open-source GUI-VLA agent for edge devices. #1 on OSWorld (specialized, 58.2%). Runs locally on Apple M4 Mac mini/MacBook — no data leaves your device.Mano-P 是一个开源 GUI-VLA 项目,支持在 Mac mini/MacBook 上或通过算力棒本地运行推理,实现纯视觉驱动的跨平台 GUI 自动化操作。数据完全本地处理,支持复杂多步骤任务规划与执行。
Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning
A Practical Zoom-in GUI Grounding and Behavior-Based Evaluation method.
Code for paper "Improved GUI Grounding via Iterative Narrowing"
Add a description, image, and links to the gui-grounding topic page so that developers can more easily learn about it.
To associate your repository with the gui-grounding topic, visit your repo's landing page and select "manage topics."