The model, codenamed “Spud,” is designed to complete complex multi-step tasks with minimal human direction. It sets new benchmarks in agentic coding, computer use, and knowledge work, while matching ...