Llama.cpp is an active open-source community that implements Meta's LLaMA architecture in C/C++. It runs everywhere, serves as a Schelling point for low-level features, has a custom model format, and focuses on a single model architecture.

2m read timeFrom matt-rickard.com
Post cover image

Sort: