Loading...
Run LLMs on CPU with quantization
llama.cpp enables running large language models on consumer hardware using CPU with optimized quantization techniques. Supports many model formats.
Admin
Good start but needs more features to compete with established options.