Anthropic just released the next generation of Claude models: Claude Opus 4 and Claude Sonnet 4!
The models achieve SOTA performance in SWE-bench with a score of 72.5% for Opus and 72.7% for Sonnet
Along with the model release, the models get a bunch of new abilities and features:
- Extended thinking with tool use (beta): Both models can use tools—like web search—during extended thinking so the models can to alternate between reasoning and tool use to provide way better responses
- Parallel tool execution: Both models can now use tools in parallel
- Improved prompt adherence: The models now follow instructions more precisely than previous models
- Better memory capabilities: The new models are better at extracting and saving key facts to maintain continuity and build tacit knowledge over time
The pricing for the models remains consistent with previous Opus and Sonnet models: Opus 4 at $15/$75 per million tokens (input/output) and Sonnet 4 at $3/$15. Both models are available on the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI.
The Anthropic API also received some updates that should help build more powerful AI agents:
- code execution tool
- MCP connector
- Files API
- caching prompts for up to one hour
Lastly, Claude Code is now generally available and it now supports background tasks via GitHub Actions and native integrations with VS Code and JetBrains.
The models are already available in AI IDEs such as Cursor and GitHub Copilot.