GLM-5 and MiniMax M2.5, two new open-weight models available in Kilo Code, were benchmarked across three TypeScript coding tasks: bug hunting, legacy code refactoring, and API implementation from an OpenAPI spec. GLM-5 scored 90.5/100, excelling at greenfield development with 94 test cases, reusable middleware, and zero bugs,

Sort: