Yapay Zeka ModelleriDIY NAS Achieves 18 tok/s on 80B LLM Using Integrated Graphics
A technology enthusiast has successfully optimized a homemade NAS server to run an 80-billion-parameter large language model using only the CPU's integrated graphics unit, without an external GPU. This achievement redefines the limits of personal servers, achieving a processing speed of 18 tokens per second.






















