benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
Anthropic
Claude Sonnet 4.5
2 benchmarks
SWE-bench Verified
#14 of 40
77.2%
OSWorld
#10 of 16
61.4%