“`html
Hello! Excited to share our latest community-driven research project: WebHarbor: Docking Real Websites for Evolving GUI Agent Environments. This initiative involves packaging 15 popular websites (like Amazon, GitHub, BBC News) into self-contained Flask + SQLite apps in a single Docker image. The control plane allows resetting each site to its original state in less than one second by human-in-the-loop coding agents like Claude Code or CodeX.
Why WebHarbor: Running web agent benchmarks on the live web is challenging due to factors such as reCAPTCHA, geo-blocking, content drift, and network issues. This project provides a lightweight, easy-to-reset environment for both evaluation and training of web agents, addressing these challenges by allowing controlled experiments without exposing real-world websites.
- WebHarbor Project Page: Access detailed information about the project including how to contribute or review PRs.
- HuggingFace Dataset: Utilize this dataset for further research and development on WebHarbor.
- Contribute Guide: Learn how to create new mirror sites using a pipeline, human verification, and contributing to the final paper.
Welcome suggestions and discussions!
“`
“`html
Suggested Resources:
- WebHarbor Project Page — Detailed information about the project.
- HuggingFace Dataset — Useful for further research and development on WebHarbor.
- WebHarbor GitHub — Access the code repository to contribute or review PRs.
“`
Originally published at reddit.com. Curated by AI Maestro.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

![WebHarbor – We “dock” the real websites into local for web agents! [R]](https://ai-maestro.online/wp-content/uploads/2026/05/webharbor-we-dock-the-real-websites-into-local-for-web-agent-1024x576.jpg)


