Anthropic Just Launched there Most Intelligent Models Claude 4 Opus and Claude 4 With Ultimate Benchmarks And Become the Top LLM for Codeing
Anthropic claims Claude Opus 4 has achieved a 72.5% score on SWE-bench, a rigorous software engineering benchmark, outperforming OpenAI’s GPT-4.1, which scored 54.6% when it launched in April. The achievement establishes Anthropic as a formidable challenger in the increasingly crowded AI market.
