Why I stopped using DOM locators for End-to-End tests altogether—and replaced them with computer vision

If you’ve ever written End-to-End (E2E) tests using Selenium, Cypress, or Playwright, you know the ultimate pain: flakiness. You write a perfect test. It runs green. Two weeks later, a frontend developer changes a wrapper div, renames a CSS class, or adds a promotional pop-up. Suddenly, your CI/CD pipeline is red, and your test fails with the dreaded ElementNotInteractableException. I’ve spent years in QA automation, and I realized we are testing UIs fundamentally wrong. Humans don't look at the DOM to click a button. We look at the screen. So, why do our automated tests rely on hidden HTML structures? I decided to fix this and built AIQA Systems - an automation approach that ditches DOM locators completely in favor of Computer Vision (CV) and Large Language Models (LLMs). Here is how it works, what it costs, and why I believe this is the future of QA. ❌ The Problem with the DOM-based Approach Standard automation relies on finding elements via IDs, XPaths, or CSS selectors. // The old

Why I stopped using DOM locators for End-to-End tests altogether—and replaced them with computer vision

Related Articles

Why i think Mythos is gonna be game changing after using Opus for a CTF

I Am Very Fond of the Pipeline Operator

Understand ARP in byte level

1SubML: Plan vs Reality

Group Lasso with Overlaps: the Latent Group Lasso approach

Related Articles

News
Why i think Mythos is gonna be game changing after using Opus for a CTF
Reddit Programming • 2h ago

News
I Am Very Fond of the Pipeline Operator
Reddit Programming • 3h ago

News
Understand ARP in byte level
Reddit Programming • 4h ago

News
1SubML: Plan vs Reality
Lobsters • 7h ago

News
Group Lasso with Overlaps: the Latent Group Lasso approach
Dev.to • 10h ago