ResearchReddit r/LocalLLaMA — 15 d ago

GLM 5.2: 98% of max level intelligence with less than half of tokens usage

GLM 5.2 has been released, demonstrating the ability to achieve approximately 98% of maximum intelligence while utilizing less than half of the tokens compared to its predecessor, GLM 5.1. This model reportedly requires 36.7k reasoning tokens, yet users are encouraged to operate at a "high" effort level to optimize performance for coding tasks, which may yield satisfactory results with significantly lower token consumption. This efficiency is crucial for practitioners, particularly those with limited computational resources, as it allows for more practical deployment of the model in everyday applications without sacrificing too much accuracy.

GLMintelligencetokensrelevance 0.00 · engagement 0.00

Read at source ↗← all news