Show HN: Finalrun – Spec-driven testing using English and vision for mobile apps

I wanted to test mobile apps in plain English instead of relying on brittle selectors like XPath or accessibility IDs. With a vision-based agent, that part actually works well. It can look at the screen, understand intent, and perform actions across Android and iOS. The bigger problem showed up around how tests are defined and maintained. When test flows are kept outside the codebase (written manually or generated from PRDs), they quickly go out of sync with the app. Keeping them updated becomes a lot of effort, and they lose reliability over time. I then tried generating tests directly from the codebase (via MCP). That improved sync, but introduced high token usage and slower generation. The shift for me was realizing test generation shouldn’t be a one-off step. Tests need to live alongside the codebase so they stay in sync and have more context. I kept the execution vision-based (no brittle selectors), but moved test generation closer to the repo. I’ve open sourced the core pieces: 1

Show HN: Finalrun – Spec-driven testing using English and vision for mobile apps

Related Articles

The Last Quiet Thing

The Great Nix Flake Check

Can open source outperform proprietary software?

Two Years of Valkey

Live Life on the Edge: A Layered Strategy for Testing Data Models

Related Articles

News
The Last Quiet Thing
Lobsters • 1h ago

News
The Great Nix Flake Check
Lobsters • 4h ago

News
Can open source outperform proprietary software?
Reddit Programming • 5h ago

News
Two Years of Valkey
Lobsters • 5h ago

News
Live Life on the Edge: A Layered Strategy for Testing Data Models
Reddit Programming • 7h ago