Anthropic’s Computer Use mode shows strengths and limitations in new study

Claude can perform impressively complex tasks, but it will also make stupid mistakes from time to time.