We are a community of AI/ ML/Generative AI enthusiasts/researchers/journalists/writers who share interesting news and articles about the applications of AI. 

Machine Learning News

Microsoft has open-sourced bitnet.cpp, an efficient 1-bit LLM inference framework optimized for running on CPUs. This solution offers up to 6.17x speed improvements and reduces energy consumption by up to 82.2%, making it possible to run large 100-billion parameter models locally without the need for GPUs. This democratizes access to advanced AI technology for smaller businesses and individuals, significantly lowering costs and hardware requirements. The framework currently supports ARM and x86 CPUs, with plans to expand to NPUs, GPUs, and mobile devices in the future, promising a more accessible and sustainable means of deploying LLMs.

Microsoft Open-Sources bitnet.cpp: A Super-Efficient 1-bit LLM Inference Framework that Runs Directly on CPUs