"GLM 4.7 Flash, since it has bigger size, its slow and barely usable. The real issue was their output formatting and tool calling that Aider wasnt able to understand and eventually stuck in a loop. This model is too much for 8 gigs GPU to handle."
Red Stapler
YouTube channel - local AI benchmarking
GLM-4.7-Flash