Experiment: Which tool sees better?

Playwright snapshot

Experiment B

YAML accessibility tree — no colors, no spacing, no layout info (~3,500 tokens) View full page

chrome-cdp-ex

chrome-cdp perceive

Experiment A

Layout + style hints + contrast + coordinates in ~800 tokens — the agent could see what needed fixing View full page

Tool C snapshot

Experiment C

Accessibility tree with element refs — no layout or visual info (~400 tokens) View full page

What each tool revealed

Playwright

browser_snapshot returns element roles and names. No colors, no spacing, no font sizes, no layout.

Clean white/blue theme. Proper hierarchy. Professional — but relied on reading CSS source to find visual issues.

chrome-cdp-ex

perceive returns layout dimensions, background colors, font sizes, contrast hints, scroll position, and bounding coordinates — the agent knows what things look like.

Polished dark SaaS theme. Google Fonts, gradient text, SVG icons, differentiated CTA styles, social proof, multi-column footer.

Tool C

snapshot returns accessibility tree with element refs. Similar to Playwright but more compact (~400 vs ~3,500 tokens).

Dark/white hybrid. Solid layout, icon placeholders, "Most Popular" badge. Clean but less refined.

Experiment Design

Variable	Control
Starting file	Identical `challenge.html` — no hint comments
Agent	Independent Claude Code session per tool
Prompt	Same structure — only tool names differ
Rounds	Exactly 5 per agent
Source access	All agents can read HTML source
Observation tool	The only variable

All three agents dramatically improved the page. The difference is in how quickly they identified visual issues and how much polish they achieved — the agent with richer visual context made more nuanced design decisions.