Guide

Your AI Customer Support Is Embarrassing You (Here's How to Actually Fix It With a Computer Use Agent)

Sophia Martinez||8 min
+W

U.S. companies lose $1.6 trillion every single year because their customer service is bad. Not because they have too few agents. Not because customers are too demanding. Because they keep reaching for the wrong tools and calling it innovation. Klarna is the most honest example of this. In 2023, they fired 700 customer service reps, pointed at their shiny AI chatbot, and told the world this was the future. By May 2025, their CEO was on Bloomberg admitting customers hated it and they were hiring humans again. That story should terrify every ops leader reading this, but not for the reason most people think. The lesson isn't 'AI bad.' The lesson is that a dumb chatbot that can't actually do anything is not the same as a real AI computer use agent. And the difference between those two things is the difference between a PR disaster and a support operation that actually scales.

The Chatbot Era Is Over and It Failed Spectacularly

Legacy chatbots fail 63% of customer interactions. That stat comes from banking data, but if you've ever tried to get a refund from a chatbot at 11pm, you already knew that. The thing is, companies keep deploying these broken tools because they're cheap upfront and the failure is diffuse. You don't get one giant bill that says 'your chatbot cost you 4,200 customers this quarter.' You just bleed slowly. Customers stop responding. Churn ticks up. Your NPS quietly crumbles. The Klarna situation is just the rare case where a company was honest about it in public. Most companies are living that same story right now and pretending they aren't. The problem with traditional chatbot automation is structural. These tools work off scripts and decision trees. They can answer 'what are your hours' and 'where is my order' if someone types the exact right words. The moment a customer says something unexpected, or needs an action taken inside an actual system, the bot hits a wall and routes to a human anyway. You haven't automated support. You've just added a frustrating extra step before the human.

What 'Actually Automating' Customer Support Looks Like in 2025

  • A real computer use agent doesn't just answer questions. It opens your CRM, finds the ticket, reads the account history, and takes action inside the actual software.
  • Manual support tickets cost between $15 and $25 each to process. A computer-using AI handles the same ticket for a fraction of that, at 3am, with no queue.
  • The best computer use setups handle refund processing, order updates, account changes, and escalation routing without a human touching the keyboard once.
  • AI computer use agents work across real desktops and browsers, not just APIs. That means they work with your legacy tools too, the ones that were never built for automation.
  • Teams using proper AI agent automation report handling 10x more tickets with the same headcount. That's not a rounding error. That's a structural change in how support works.
  • 47% of companies that don't use AI in support plan to implement it in 2025. Most of them will buy another chatbot and wonder why nothing changed.

Klarna fired 700 humans, replaced them with AI, then rehired humans six months later because the AI couldn't actually do the job. The problem wasn't using AI. The problem was using the wrong kind of AI.

Why Most 'AI Support' Tools Are Still Just Chatbots in a Trench Coat

Here's what nobody in the vendor space wants to say out loud: most AI customer support tools are still fundamentally chatbots with a GPT wrapper slapped on top. They generate better-sounding responses. They handle more natural language. But they still can't open a browser tab, log into your Zendesk instance, pull up a customer account, issue a partial refund, and send a confirmation email. They talk. They don't act. This is the core distinction between a language model bolted onto a chat widget and a true AI computer use agent. Computer use means the AI actually controls a computer, the cursor moves, forms get filled, buttons get clicked, data gets entered into real systems. It's the difference between a customer service rep who can only talk and one who can actually do the work. OpenAI's Operator and Anthropic's Computer Use features both gesture at this capability, and both are still in research preview or limited rollout as of 2025, with significant reliability gaps on complex real-world tasks. On OSWorld, the gold-standard benchmark for real-world computer use tasks, Claude Sonnet 4.5 scores 61.4%. That's not bad for a general-purpose model. But it's not good enough to trust with your live customer support queue either.

The Right Stack: What Automated Customer Support Actually Needs

If you're building a support automation system that won't embarrass you in six months, you need to think in three layers. First, intake and triage. This is where natural language understanding matters. The AI needs to correctly classify what the customer wants, pull context from their account, and decide whether this is something it can handle autonomously or something that needs a human. Second, action execution. This is where 99% of chatbot solutions fall apart. The AI needs to actually do things inside real software. That means navigating UIs, filling forms, reading dynamic page content, and handling multi-step workflows across multiple tools. No API shortcut. No pre-built connector that breaks every time your software updates. Real computer control. Third, escalation and handoff. Not every ticket should be fully automated. A good computer use agent knows when to hand off and does it with full context already written up, so the human agent doesn't start from zero. Build these three layers right and you stop bleeding customers. Build only the first layer, which is what most chatbot vendors sell you, and you get the Klarna situation.

Why Coasty Is the Computer Use Agent Built for This

I'm going to be direct here because I think the tool matters and I've seen what actually works. Coasty is the highest-performing computer use agent available right now, scoring 82% on OSWorld. For context, that's 20+ percentage points ahead of Claude Sonnet 4.5, which is one of the strongest general-purpose models on the market. That gap isn't marketing. It shows up in production. Coasty controls real desktops, real browsers, and real terminals. It doesn't need your legacy tools to have an API. It works the way a human agent works, by looking at the screen and taking action, except it doesn't need sleep, doesn't have bad days, and can run as a swarm of parallel agents handling multiple tickets simultaneously. For customer support specifically, this means you can point Coasty at your actual support workflow, your actual CRM, your actual ticketing system, and it learns to execute the same steps a trained human agent would. Refunds, account updates, order modifications, escalation notes, all of it. The desktop app is solid, cloud VMs are available for teams that don't want to manage infrastructure, and there's a free tier if you want to test it on real workflows before committing. BYOK is supported too, so you're not locked into one model provider. This is what the Klarna AI experiment should have been. Not a chatbot. A computer use agent that actually does the work.

Here's my honest take after watching this space for years. The companies that are going to win on customer support in the next two years are not the ones with the most human agents and not the ones with the most aggressive chatbot deployments. They're the ones that figure out what real computer use automation looks like and build it into their ops now, before their competitors do. Klarna's story is not a cautionary tale about AI. It's a cautionary tale about confusing hype for capability. A chatbot that talks but can't act is not automation. It's a speed bump with a friendly UI. If you want support that actually scales, that handles the full action loop from reading a ticket to resolving it inside real software, you need a real computer use agent. The benchmark doesn't lie. 82% on OSWorld is not a coincidence. Go try Coasty at coasty.ai. Start with the free tier. Point it at something real. You'll understand within an hour why the chatbot era is done.

Want to see this in action?

View Case Studies
Try Coasty Free