OpenFlo

An open-source web automation framework powered by large multimodal models.

Developed under Nexus Labs

This project is based on Avenir-Web, which itself builds upon the SeeAct framework developed by the OSU NLP Group.

OpenFlo enables autonomous web agents to perform tasks on any website using vision-language models. The system combines robust browser automation with intelligent action prediction to execute complex workflows.

Repository Layout

src/seeact/: core agent implementation (SeeActAgent)
- agent/: main agent logic
  - agent.py: central agent class and execution flow
  - config.py: configuration loading and validation
  - reporting.py: result saving and summary generation
  - evaluation.py: task success evaluation and termination logic
  - executor.py: action execution logic
  - predictor.py: LLM interaction and action prediction
src/run_agent.py: single-process runner (demo + batch)
src/config/*.toml: sample configs
data/: example data and task files

Requirements

Python >=3.9 (src/pyproject.toml)
A browser for Playwright (Chromium recommended)
An API key for your chosen provider (OpenRouter preferred)

Setup

From the repository root:

# Create a conda environment
conda create -n seeact python=3.11
conda activate seeact

# Install the package in editable mode
pip install -e src

# Set up Playwright and install browser kernels
playwright install

Set an API key:

export OPENROUTER_API_KEY="your-key"

You can also put keys in src/config/*.toml under [api_keys]. Environment variables take precedence.

Running

Run scripts from src/ (paths in configs are written relative to src/):

cd src

Batch Mode (JSON list of tasks)

In your config (src/config/auto_mode.toml), set experiment.task_file_path, then run:

python run_agent.py -c config/auto_mode.toml

Task JSON Format

Batch mode expects a JSON array of tasks like:

[
  {
    "task_id": "task_001",
    "confirmed_task": "Find the official API docs for X",
    "website": "https://example.com/"
  }
]

Configuration Overview

Configs are TOML files; see src/config/auto_mode.toml.

[basic]
- save_file_dir: output root directory
- default_task, default_website: defaults for single-task runs
[experiment]
- task_file_path: JSON tasks list for batch mode
- overwrite: skip or overwrite existing task output folders
- max_op, max_continuous_no_op, highlight
[model]
- name: model identifier (commonly openrouter/...)
- temperature, rate_limit
- optional: reasoning_model, checklist_model, completion_eval_model
[api_keys]
- openrouter_api_key (or set OPENROUTER_API_KEY)
- optional: gemini_api_key (or set GEMINI_API_KEY)
[playwright]
- headless, viewport, tracing, save_video, locale, geolocation

Outputs

Each task writes to basic.save_file_dir/<task_id>/:

agent.log: per-task execution log
result.json: final summary (handled by src/seeact/agent/reporting.py)
config.toml: resolved config snapshot
all_predictions.json: recorded LLM I/O for the task
screenshots/: screen_<step>.png and sometimes screen_<step>_labeled.png

The runners also write run-level logs to src/logs/.

Troubleshooting

Missing API key: set OPENROUTER_API_KEY (preferred) or configure [api_keys]
Playwright browser not found: run python -m playwright install chromium
Want to watch the browser: set playwright.headless = false
Config paths look wrong: run from src/ or pass an absolute -c config path

Attribution

OpenFlo is built upon Avenir-Web, which extends the original SeeAct framework by the OSU NLP Group.

If you use this work, please cite the original SeeAct paper:

@article{zheng2024seeact,
  title={GPT-4V(ision) is a Generalist Web Agent, if Grounded},
  author={Zheng, Boyuan and Gou, Boyu and Kil, Jihyung and Sun, Huan and Su, Yu},
  journal={arXiv preprint arXiv:2401.01614},
  year={2024}
}

License

This project maintains the same license as the original SeeAct framework. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data/online_tasks		data/online_tasks
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenFlo

Repository Layout

Requirements

Setup

Running

Batch Mode (JSON list of tasks)

Task JSON Format

Configuration Overview

Outputs

Troubleshooting

Attribution

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Onflow-AI/OpenFlo

Folders and files

Latest commit

History

Repository files navigation

OpenFlo

Repository Layout

Requirements

Setup

Running

Batch Mode (JSON list of tasks)

Task JSON Format

Configuration Overview

Outputs

Troubleshooting

Attribution

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages