News

Anthropic launches automated AI security tools for Claude Code that scan code for vulnerabilities and suggest fixes, ...
Anthropic retired its Claude 3 Sonnet model. Several days later, a post on X invited people to celebrate it: "if you're ...
Roughly 200 people gathered in San Francisco on Saturday to mourn the loss of Claude 3 Sonnet, an older AI model that ...
For the past year, a dark horse contestant has been quietly racking up wins in student hacking competitions: Claude. Why it ...
Claude Opus 4.1 scores 74.5% on the SWE-bench Verified benchmark, indicating major improvements in real-world programming, bug detection, and agent-like problem solving.
Anthropic's Claude Opus 4.1 achieves 74.5% on coding benchmarks, leading the AI market, but faces risk as nearly half its $3.1B API revenue depends on just two customers.
In one instance, Claude is said to have solved 11 of 20 progressively harder problems in just 10 minutes, and after another ...
OpenAI was connecting Claude to internal tools that allowed the company to compare Claude’s performance to its own models in ...
Anthropic partners with the U.S. government to offer AI tools like Claude for as little as $1, enhancing national security ...
An OpenAI spokesperson said the API access allowed for industry-standard benchmarking and safety improvements.
Anthropic says Claude Opus 4.1 improves software engineering accuracy to 74.5%. That compares to 62.3% with Claude Sonnet 3.7 ...
Anthropic found that pushing AI to "evil" traits during training can help prevent bad behavior later — like giving it a ...