llama.cpp
High-performance local LLM inference
C/C++ local inference project supporting GGUF models and multi-platform deployment.
- Pricing
- Open source
- Region
- US
- Chinese
- Partial
- API
- Available
- Website
- github.com
- Updated
- 2026-07-04
Features
- Easy to start
- Useful across workflows
- Actively updated
Use cases
- Daily productivity
- Content creation
Pros
- Mature product
- Active ecosystem
Things to note
- Advanced features often require a paid plan
- Availability may vary by region