Engineering
May 26, 2026
A new state of the art for computer use
We just hit a new state of the art on OSWorld. Here's how we built the agent, what surprised us along the way, and why topping the benchmark is just the starting line.
:quality(90))
The Pointer blog
A new state of the art for computer use
We just hit a new state of the art on OSWorld. Here's how we built the agent, what surprised us along the way, and why topping the benchmark is just the starting line.
:quality(90))