Phoronix is a leading source for Linux hardware reviews, benchmarking, and open-source news. With  performance analysis, hardware testing, and coverage of the latest developments in the Linux ecosystem, Phoronix provides  insights for Linux enthusiasts, developers, and system administrators, helping them make informed decisions about hardware compatibility, performance optimization, and software selection.

Phoronix

KTransformers 0.5.3 has been released, adding AVX2-only inference support for Mixture of Experts (MoE) models, enabling BF16, FP8, and GPTQ-INT4 workloads on CPUs that lack AVX-512 or AMX (such as Intel Core/Ultra consumer processors). This makes local LLM inferencing viable on a broader range of hardware. The release also includes NUMA-aware deployment improvements for multi-socket environments, lower idle CPU overhead, speculative decode enhancements, and other fixes.

KTransformers Adds AVX2 MoE Support For Viable Performance On CPUs Without AMX/AVX-512