1
A Compression Tool for LLM Reads. Est. 60-95% Fewer Tokens
一个专为LLM设计的token压缩神器,声称能节省60-95%的token量。读论文、喂上下文?用它能大幅降低推理成本,提升处理速度。
Article URL: https://github.com/chopratejas/headroom Comments URL: https://news.ycombinator.com/item?id=48155888 Points: 3 # Comments: 0