Anthropic reran its "Project Fetch" robotics test and found its newer Claude models could outperform the previous generation.
Claude AI robotics benchmark shows Opus 4.7 finishing physical robot programming in 9 minutes, against 181 minutes for ...