1
Show HN: Llama CPU Benchmarks
TurboQuant号称8倍速,实测CPU端到端慢2.2倍,Qwen准确率还降17个百分点,别被合成数据骗了。
Article URL: https://deemwar-products.github.io/llama-cpu-benchmarks/ Comments URL: https://news.ycombinator.com/item?id=48212222 Points: 1 # Comments…
TurboQuant号称8倍速,实测CPU端到端慢2.2倍,Qwen准确率还降17个百分点,别被合成数据骗了。
Article URL: https://deemwar-products.github.io/llama-cpu-benchmarks/ Comments URL: https://news.ycombinator.com/item?id=48212222 Points: 1 # Comments…