Skip to content

TruhnLab/computer-use-agents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

💻 Computer-Use Agents Evaluation

This repository contains all relevant code and material for our evaluation of autonomous computer-use agents on a Hospital Information System (HIS). The materials are intended to provide an unified entry point for agent implementations and task definitions, enable reproducibility and comparison of different computer-use agent architectures and support research on autonomous GUI interaction in clinical software systems.


Computer-Use Agents

Three autonomous agents were evaluated in our experiments:

Agent A (omniparser_mcp branch)

  • GUI perception handled locally via OmniParser MCP server
  • Screenshots converted into structured textual representations
  • Reduced model token usage, increased local computation
  • Integrated vision, planning, and reasoning
  • Screenshots provided directly as image inputs alongside OCR results
  • Lower local compute requirements, higher token usage

Agent C (openai_gpt54_computer branch)

  • Successor of the computer-use-preview model
  • Uses OpenAIs GPT-5.4 model with an updated computer tool

All agents receive natural-language task instructions (listed in tasks.json) to iteratively perceive the GUI, plan actions, and execute mouse and keyboard interactions. For comparability, they share the same predefined action set (mouse movement, clicks, scrolling, text input, keyboard shortcuts).

Execution Environment

All experiments were conducted using HospitalRun, an open-source, offline demo Hospital Information System (HIS).

System:

  • Windows 11
  • Intel Core i9-13900K
  • 128 GB RAM
  • NVIDIA RTX 4090 (24 GB)

Deployment

Instructions for running or using the agents can be found in the respective branches.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors