Guide

Automate a 40-Step SOP Without Writing a Single Selector

David Park||6 min
Ctrl+R

Imagine a finance team that has to manually run a forty-step reconciliation every week. The process is documented in a standard operating procedure. Someone on the team reads each step, clicks the right buttons, copies data between spreadsheets, and checks results. It takes two hours. The process is stable, the steps do not change, and everyone knows how to run it. The problem is the backlog. Every time a new system rolls out or a UI refresh happens, the existing bot breaks. The team has to rebuild the automation from scratch. That is the reality for many automation programs. Bots become fragile islands that require constant babysitting. A 40-step process that should take minutes instead becomes a monthly maintenance project.

Why RPA breaks here

Traditional RPA relies on selectors, xpaths, and object IDs. A bot captures the position of a button or a text field. When the underlying UI changes, the selector no longer points to the right element. The bot clicks the wrong thing or fails to find the expected result. Gartner estimates that up to 70 percent of RPA maintenance time goes into fixing broken bots after system updates. In a 40-step process, a single selector failure can halt the entire run. The bot has to be paused, debugged, and then rebuilt. For a process that runs once a week, the cost of that rebuild can exceed the value of the automation itself. The vulnerability is not just the development time. It is the risk of missed deadlines, incorrect reconciliations, and manual workarounds that defeat the purpose of automation. The process is documented as a human SOP, but the bot is built as a fragile, non-recoverable script.

What changes with computer use agents

  • Agents see the screen just like a human does. They use vision and reading to understand the current state.
  • No selectors, xpaths, or object IDs are required. The agent adapts when the UI changes.
  • When something unexpected happens, the agent reads the error, identifies the mismatch, and recalculates the next action instead of halting.
  • A human SOP in plain English can be fed directly to the agent. The agent follows the steps as written.
  • Agents work across any application, including legacy systems, Citrix environments, and virtual desktops where traditional RPA struggles.

The one line a VP of automation should remember: A computer use agent treats your SOP as live documentation, not a fragile bot script.

The 40-step process in practice

Take the same forty-step reconciliation. The human SOP lists actions like open the ERP, navigate to the transactions tab, filter by date, download the CSV, open the analytics tool, upload the file, review the dashboard values, copy the totals, paste them into the spreadsheet, save the sheet, and then repeat for the next entity. With traditional RPA, each of those steps requires a developer to identify the exact UI element and hardcode the selector. If the ERP refreshes the navigation menu, the bot fails. With a computer use agent, you upload the SOP and let the agent run through the steps. The agent watches the screen, reads the labels, and clicks the buttons it sees. If the menu changes, it simply clicks the new label it can read. If a dialog appears asking for confirmation, the agent reads the message and clicks the appropriate button. The bot does not need to know the underlying UI structure. It only needs to follow the human instructions. This makes the automation durable across releases, updates, and even different environments.

How to move without the risk

You do not have to rip out your existing RPA investment overnight. Start by identifying one high-pain process that is both SOP-driven and frequently interrupted by UI changes. A common example is a multi-step data entry or review task that touches multiple systems. The process should have documented steps that people actually follow. Partner with the operations team to capture the SOP in plain language. Feed it into a computer use agent and run a pilot on a non-production environment. Measure how much time the agent saves compared with manual execution. Focus on exception handling. Ask the agent to run through known error scenarios and see how it handles them. If the pilot succeeds, expand the approach to other processes. Over time, you can phase in more agent-based automation while keeping your stable, high-volume RPA bots intact. This hybrid approach lets you capture the long tail of changing UIs and SOP-driven work without abandoning your current investments.

The path forward is not to replace RPA entirely, but to add a layer of durable automation that handles the processes RPA cannot. Computer use agents let you turn a human SOP into working automation without writing a single selector. They survive UI changes, recover from exceptions, and work across any application. Ready to see how a computer use agent can automate your next 40-step process without the fragility of traditional RPA? Book a demo with the Coasty team at https://cal.com/coasty/15min .

Want to see this in action?

View Case Studies
Try Coasty Free