Llama.cpp is an active open-source community that implements Meta's LLaMA architecture in C/C++. It runs everywhere, serves as a Schelling point for low-level features, has a custom model format, and focuses on a single model architecture.
Sort: