We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

GLM-4.7-Flash is a 30B-A3B MoE (Mixture of Experts) language model that balances performance and efficiency for lightweight deployment. It demonstrates strong benchmark performance across tasks like AIME 25, GPQA, and SWE-bench Verified, outperforming comparable models in several categories. The model supports local deployment through vLLM, SGLang, and transformers frameworks, with detailed installation and usage instructions provided for each. Deployment requires the latest versions from main branches and supports features like speculative decoding, tool calling, and reasoning parsing.

zai-org/GLM-4.7-Flash · Hugging Face