WebHarbor – We “dock” the real websites into local for web agents! [R]

Disclosure: Some links in this article are affiliate links. AI Maestro may earn a commission if you make a purchase, at no…

By AI Maestro May 14, 2026 1 min read
WebHarbor – We “dock” the real websites into local for web agents! [R]

“`html

Hello! Excited to share our latest community-driven research project: WebHarbor: Docking Real Websites for Evolving GUI Agent Environments. This initiative involves packaging 15 popular websites (like Amazon, GitHub, BBC News) into self-contained Flask + SQLite apps in a single Docker image. The control plane allows resetting each site to its original state in less than one second by human-in-the-loop coding agents like Claude Code or CodeX.

Why WebHarbor: Running web agent benchmarks on the live web is challenging due to factors such as reCAPTCHA, geo-blocking, content drift, and network issues. This project provides a lightweight, easy-to-reset environment for both evaluation and training of web agents, addressing these challenges by allowing controlled experiments without exposing real-world websites.

  1. WebHarbor Project Page: Access detailed information about the project including how to contribute or review PRs.
  2. HuggingFace Dataset: Utilize this dataset for further research and development on WebHarbor.
  3. Contribute Guide: Learn how to create new mirror sites using a pipeline, human verification, and contributing to the final paper.

Welcome suggestions and discussions!

“`

“`html

Suggested Resources:

“`


Originally published at reddit.com. Curated by AI Maestro.

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

Name
Scroll to Top