The OSWorld Benchmark Results Are In, and Most AI Agents Should Be Embarrassed - Coasty Blog