benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
YouTube · 2026-01-28
"OpenAI OSS 20B, a relatively small model and smallest size on the list yet able to achieve high SWE-bench Verified score."
Red Stapler
YouTube channel - local AI benchmarking
SWE-bench Verified
gpt-oss-20b
view original source →
all researcher takes →