1
Show HN: Glq LLM quantization using E8 lattice
基于E8晶格的新型LLM量化方法GLQ,8权重一组映射为16位索引,精度2-8 bpw,开源有代码。
I have with the help of AI create an open source method of E8 LLM code book quantization library called glq. I was interested in creating Glq as a PC …