Princeton AI for Accelerating Invention Lab
Authors: Aiden Yiliu Li, Xinyue Hao, Shilong Liu, Mengdi Wang
Avenir-Web is an autonomous web agent framework designed for reliable execution of long-horizon tasks on complex, dynamic web interfaces. Addressing challenges in element grounding and long-term task tracking, it introduces a modular architecture combining Mixture of Grounding Experts (MoGE), Experience-Imitation Planning (EIP), and Adaptive Memory with Task-Tracking Checklists. This approach establishes a new open-source state-of-the-art on the Online-Mind2Web benchmark, bridging the performance gap with proprietary models in real-world deployments.
- 2026-03-05: Added repository-level
SKILL.mdto define the Avenir-Web agent skill workflow (mode selection, instruction design, single/batch execution, API requirements, and reporting contract).
Requirements:
- Python
3.11(recommended;>=3.9supported) - Playwright browsers (Chromium recommended)
- A model provider API key (OpenRouter preferred)
From the repository root:
conda create -n avenir-web python=3.11
conda activate avenir-web
pip install -e src
python -m playwright install chromiumRecommended (environment variable):
export OPENROUTER_API_KEY="your-key"Or set it in src/config/batch_experiment.toml under [api_keys] (openrouter_api_key = "..."). Environment variables take precedence.
The example configuration runs a batch from data/example.json and writes artifacts to the directory configured by basic.save_file_dir in src/config/batch_experiment.toml.
cd src
python run_agent.py -c config/batch_experiment.tomlFrom the repository root:
python example.py --task "Find the official API docs for X" --website "https://example.com/"This repository now includes a dedicated skill specification at SKILL.md.
The skill defines how an agent should:
- choose run mode (
headless,headed,demo) - write high-quality task instructions
- run single-task and batch workflows consistently
- handle API key requirements (
OPENROUTER_API_KEYfirst, TOML fallback) - report outcomes with evidence and next-step recommendations
If you are using an agentic coding assistant that supports skills, point it to this file as the canonical operating guide for Avenir-Web runs.
Avenir-Web features a specialized Demo Mode designed for high-quality screen recordings and live presentations. When enabled, the agent provides:
- Cursor Visuals: A GPU-accelerated, dynamic cursor overlay with tech-minimalist light effects and clear click/keypress feedback.
- Real-time Dashboard: A native GUI window synchronized with the browser, displaying the agent's current strategy, task progress, and live status.
- Visual Feedback: Reaction-based animations (e.g., impact ripples, shockwaves) that make the agent's decision-making process transparent and engaging.
Watch Avenir-Web in action: YouTube Demo
To enable Demo Mode, ensure browser.mode = "demo" is set in your configuration.
For each task, outputs are written under basic.save_file_dir/<task_id>/ (configured in TOML):
agent.log: per-task execution logresult.json: final summaryconfig.toml: resolved config snapshotllm_records.json: recorded LLM I/Oscreenshots/:screen_<step>.png
Runner-level logs are written under src/logs/.
The primary configuration entry point is src/config/batch_experiment.toml:
[basic]: output directory (save_file_dir)[model]: model name, temperature, and (optional) specialist models (e.g., checklist/strategist)[api_keys]: API keys (environment variables still take precedence)[experiment]: task file path, overwrite policy, max operations[playwright]: headless/headful, viewport, geolocation
- Missing API key: set
OPENROUTER_API_KEY(preferred) or configure[api_keys] - Playwright browser not found: run
python -m playwright install chromium - Config paths look wrong: run from
src/or pass an absolute path to-c
This project was developed with support from Princeton AI for Accelerating Invention Lab.
This repository is provided for research use. Model outputs may be incorrect, incomplete, or unsafe; you are responsible for reviewing actions and complying with applicable laws and website terms of service when running web automation.
- Aiden Yiliu Li: yiliu.li@outlook.com
- Shilong Liu: slongliu86@gmail.com
This project is licensed under the Apache License 2.0 - see the LICENSE.txt file for details.
Copyright © 2026 Princeton AI for Accelerating Invention Lab. All rights reserved.