Every AI coding agent shares the same blind spot: it writes frontend code but cannot see what the browser renders. The human becomes a relay station — screenshot, copy error, paste back. This is the exact “Human in the Middle” pattern that HOTL architecture is designed to eliminate.
This post compares how five major platforms solve (or fail to solve) this problem as of March 2026.
The Problem
Agent writes code → Human opens browser → Human screenshots → Human pastes back
↑ |
└──────────────────── wasted cycle ────────────────────────────┘
What we want:
Agent writes code → Agent navigates localhost → Agent screenshots → Agent fixes
Five Platforms Compared
1. Antigravity IDE — The Gold Standard
Google’s agentic IDE has the most complete browser integration. The agent launches localhost, navigates pages, clicks elements, reads console output, and takes screenshots — all built-in, zero configuration.
Key differentiator: Artifacts system. Every screenshot and browser recording becomes a verifiable deliverable. The Manager view shows what the agent saw, not just what it wrote.
- Built-in browser subagent with click, scroll, type, DOM capture, video recording
- Automatic visual regression detection (before/after screenshots)
- Free in public preview, supports Gemini 3 Pro + Claude Sonnet 4.5
2. Claude Code CLI — Two Paths
Path A: /chrome beta — Install the “Claude in Chrome” extension, then the agent can control your browser directly. Convenient but limited: same-machine only, JS dialogs block the agent, Brave/Arc not supported.
Path B: chrome-devtools-mcp — The universal MCP server that works with everything. 26 tools covering navigation, input, emulation, performance, network, and debugging. --isolated mode runs a fresh Chrome instance with a temporary profile.
For most developers, Path B is more reliable. Path A is more convenient when it works.
3. Codex CLI/App — MCP Only
OpenAI’s Codex has no built-in browser capability. It relies entirely on MCP servers (Playwright or Chrome DevTools). The sandbox restricts network access by default — you must explicitly approve localhost connections.
Known issues: Playwright MCP connection timeouts reported in GitHub issues. Works better in CLI than in VS Code extension.
4. Gemini CLI — Experimental Browser Agent
Since v0.31.0 (Feb 27, 2026), Gemini CLI has an experimental browser subagent that uses the accessibility tree + a vision model for coordinate-based clicking. Novel approach — the agent “sees” the page through accessibility labels and screenshots analyzed by Gemini’s vision model.
Free with generous API limits. Weekly release cadence means the API may change.
5. Chrome DevTools MCP — The Universal Bridge
Not a platform, but the tool that makes all platforms work. Google’s official MCP server connects any AI agent to Chrome via CDP (Chrome DevTools Protocol).
| Fact | Value |
|---|---|
| Node.js requirement | 20.19+ (not 22+ as some docs claim) |
| Chrome requirement | 146+ Stable (released 2026-03-10) |
| Tools | 26 (navigation, input, emulation, performance, network, debugging) |
| Supported agents | Claude Code, Claude Desktop, Gemini CLI, Cursor, Copilot, Codex |
| Mac + Windows | Both supported |
Install for Claude Code:
claude mcp add chrome-devtools --scope user npx chrome-devtools-mcp@latest
Always use --isolated mode in production — it creates a temporary Chrome profile that auto-deletes on close.
The Matrix
| Capability | Claude Code | Codex | Gemini CLI | Antigravity |
|---|---|---|---|---|
| Built-in browser | Beta | No | Experimental | Full |
| MCP support | Yes | Yes | Yes | Yes |
| See localhost | Yes | Yes (approve) | Yes | Yes |
| Auto screenshot | Yes | Yes | Yes | Auto + artifacts |
| Console errors | Yes | Via MCP | Via MCP | Built-in |
| Mac + Windows | Both | Both | Both | Both |
| Free | No | Limited | Yes | Yes (preview) |
What This Means for Development Teams
If you’re on Claude Code (like us): install chrome-devtools-mcp. It’s the most mature, most portable option. One install, works everywhere.
If you’re starting fresh: try Antigravity IDE. Its browser integration is a generation ahead. The Artifacts system alone changes how you verify frontend work.
If you’re cost-sensitive: Gemini CLI is free and improving weekly. The experimental browser agent is rough but functional.
Universal truth: the chrome-devtools-mcp server is the safety net. No matter which platform you use, it provides the deepest DevTools integration available via MCP.
Safety Boundaries
Browser control is powerful and dangerous. Our rules:
- Always
--isolatedmode — never connect to personal Chrome profile - Never browse financial sites (banking, trading platforms)
- Never access cookies/localStorage of non-development sites
- Close debugging port when done
- Disable Google telemetry:
--no-performance-crux --no-usage-statistics
The agent enhances development workflows. It does not touch personal financial processes.