Show HN: I made an AI that reviews iPhone apps – 1h of autonomous GUI work

I ran a GUI agent for ~1 hour that installs and reviews iPhone apps on a real device. Understudy: browses the App Store in Chrome, mirrors a real iPhone via macOS, explores the app, records screenshots/video, stitches a narrated review with FFmpeg, uploads to YouTube. Key architecture: split the work into typed child sessions so context doesn’t explode. Six stages: scrape listing, install via mirroring, exploratory testing, targeted checks, media capture, compose+upload+cleanup. Workers = deterministic I/O. Skills = agentic decisions. Robustness wins here. Re‑ground every action from the live screenshot so unexpected dialogs don’t kill the run. Keep device control deterministic, collect screenshots + UI dumps + logs, and compose media locally (FFmpeg) for an auditable pipeline. MIT license. Takeaway for builders: long GUI agents need clear separation — deterministic workers for device/browser ops, agentic skills for discovery, session isolation, and artifact bundling + human review bef

Show HN: I made an AI that reviews iPhone apps – 1h of autonomous GUI work

Related Articles

Learn Something Old Every Day, Part XVIII: How Does FPU Detection Work?

“Learn to Code” Is Dead… Learn to Think Instead

How One File Makes Claude Code Actually Follow Your Instructions

LeetCode Solution: 121. Best Time to Buy and Sell Stock

The Feature Took 2 Hours to Build — and 2 Weeks to Fix

Related Articles

How-To
Learn Something Old Every Day, Part XVIII: How Does FPU Detection Work?
Lobsters • 1h ago

How-To
“Learn to Code” Is Dead… Learn to Think Instead
Medium Programming • 3h ago

How-To
How One File Makes Claude Code Actually Follow Your Instructions
Medium Programming • 4h ago

How-To
LeetCode Solution: 121. Best Time to Buy and Sell Stock
Dev.to Tutorial • 4h ago

How-To
The Feature Took 2 Hours to Build — and 2 Weeks to Fix
Medium Programming • 5h ago