latentspacetv on AI benchmarks
2 quotes from AI researchers about benchmarks, models, and evaluation
"I don't think anyone's actually tried properly to do open source Co-work. OpenClaw doesn't even try to make sandboxing work. Co-work is actually trying to make sandboxing work but still be accessible to non-technical users."
swyx @latentspacetv · 2026-03-18 view on x
"I didn't think agents were capable of this kind of stuff — full browser manipulation. It figures out the titles because it can transcribe and selectively look at screenshots using vision."
swyx @latentspacetv · 2026-03-18 view on x