Three Chinese AI labs—Moonshot AI, ZAI (Zhipu AI), and MiniMax—have rapidly emerged as leaders in open-source LLM development, challenging closed-source models from OpenAI and Anthropic. Moonshot AI pioneered quantization-aware training with Kimi K2 Thinking, achieving state-of-the-art performance while optimizing for real-world inference. ZAI's GLM-4.7 model focuses on agentic capabilities and practical tool use, positioning itself as a cheaper alternative to Claude at $3/month. MiniMax pivoted from linear attention to standard GQA, topping SWE-bench among open-source models with their M2 release. Unlike research-focused labs like DeepSeek, this trifecta emphasizes application-driven development, targeting coding agents, tool use, and long-context capabilities with conservative but practical architectures.

13m watch time
1 Comment

Sort: