TheRegister's platform is a leading technology news website, offering insights into IT industry news, hardware reviews, and software updates. Through articles, analysis, and opinion pieces, TheRegister offers insights into cybersecurity threats, technology trends, and industry developments. Readers can stay updated with the latest news and analysis from the world of technology and IT business.

The Register

OpenAI's gpt-oss models use MXFP4, a 4-bit floating point data type that reduces inference costs by 75% compared to traditional BF16 models. MXFP4 uses micro-scaling blocks to maintain precision while dramatically cutting memory and compute requirements. This allows a 120 billion parameter model to run on 80GB VRAM instead of requiring much more memory. The format enables 4x faster token generation and significantly lower hardware costs for running large language models, though it may sacrifice some quality compared to higher precision formats.

OpenAI gpt-oss LLMs use MXFP4: smaller, faster, cheaper