benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
Qwen 3 32B + CodeStruct
1 benchmarks
SWE-bench Verified
#75 of 77
16.0%