牛哥精选 · 所有

1

⚡ 效率工具 Hacker News Ask 2026-07-11

Ask HN: How are you controlling Token Costs?

真实数据揭示LLM编码代理90%时间浪费在重读上下文，手把手教你控制token成本

I have been using LLMs & Coding Agent since early 2024. A large problem with Coding Agents & LLMs in general is context compression. To give you some …

token成本上下文压缩编码代理 claude cod 成本控制

2

🤖 AI·大模型 Hacker News LLM 2026-07-06

Compressor V2: three compression layers for a 50% LLM agent cost cut

Edgee推出Compressor V2，三层次压缩策略将LLM Agent成本砍半，性能与效率兼得。

Article URL: https://www.edgee.ai/blog/posts/introducing-compressor-v2-three-compression-layers-measured-end-to-end-for-a-50-cost-reduction Comments U…

compressor llm成本上下文压缩 agent优化 edgee

3

📝 深度技术 arXiv AI 2026-06-23

Governance Decay: How Context Compaction Silently Erases Safety Constraints in Long-Horizon LLM Agents

揭示LLM长对话中上下文压缩悄然抹除安全护栏的隐患，值得AI安全领域关注。

arXiv:2606.22528v1 Announce Type: new Abstract: Modern LLM agents increasingly rely on context compaction, summarization, or eviction to keep long-run…

llm agent 上下文压缩安全约束治理退化长上下文

4

🔓 开源项目 Hacker News AI 2026-06-20

Show HN: AgentArk – open-source self-hosted AI agent OS

开源自托管AI Agent OS，ArkDistill压缩噪点输出60-90%，大幅节省上下文空间！

Article URL: https://github.com/agentark-ai/AgentArk Comments URL: https://news.ycombinator.com/item?id=48606186 Points: 3 # Comments: 0

agentark ai agent 开源自托管上下文压缩

5

🤖 AI·大模型 Hacker News AI 2026-06-20

Show HN: Zero-config session-taste packer for AI agents

零配置AI agent上下文压缩工具，从56K token骤降至1.9K，自动学习你的编程风格，兼容任意agent。

Article URL: https://github.com/dvcoolarun/taste-ai Comments URL: https://news.ycombinator.com/item?id=48608707 Points: 1 # Comments: 0

ai agent 上下文压缩代码风格学习零配置开源

6

🤖 AI·大模型 Hacker News LLM 2026-06-17

Tokdiet a local proxy that cuts LLM token spend ~70% without quality loss

本地代理tokdiet通过智能上下文优化，在不牺牲回答质量的前提下，将LLM token开销削减高达70%，实测基准验证效果。

Article URL: https://github.com/agiwhitelist/tokdiet Comments URL: https://news.ycombinator.com/item?id=48563156 Points: 1 # Comments: 1

tokdiet 本地代理 llm token优化成本削减

7

⚡ 效率工具 Hacker News LLM 2026-06-10

Show HN: Lore – LLM proxy for coding agent context and memory management

比压缩算法好2倍：Lore用智能记忆管理让AI编码代理告别68分钟/天的重复解释，总召回率提升至2.6倍。

Article URL: https://withlore.ai/ Comments URL: https://news.ycombinator.com/item?id=48464573 Points: 4 # Comments: 0

lore llm代理代码智能体上下文管理记忆管理

8

🤖 AI·大模型 Hacker News LLM 2026-06-09

TokenTamer A proxy that reduces LLM token usage through context compression

实时压缩代码上下文，降低LLM API成本50-80%，可直接嵌入的代理工具。

Article URL: https://github.com/borhen68/TokenTamer Comments URL: https://news.ycombinator.com/item?id=48458633 Points: 1 # Comments: 1

token压缩 llm成本优化上下文压缩开源代理实时压缩

9

📝 深度技术 arXiv AI 2026-06-09

Decision-Aware Memory Cards: Counterfactual-Inspired Context Selection and Compression for Tool-Using LLM Agents

反事实推理优化LLM Agent记忆管理，创新性地解决工具使用中的上下文选择与压缩难题。

arXiv:2606.08151v1 Announce Type: new Abstract: Tool-using LLM agents often fail not because relevant text is absent, but because decisive evidence is…

llm代理反事实推理上下文压缩记忆管理工具使用

10

📝 深度技术 arXiv AI 2026-06-02

ACON: Optimizing Context Compression for Long-horizon LLM Agents

最新论文提出ACON方法，优化长时域LLM代理的上下文压缩，显著提升推理效率与准确性。

arXiv:2510.00615v3 Announce Type: replace Abstract: Large language models (LLMs) are increasingly deployed as agents in dynamic real-world environment…

上下文压缩 llm代理长上下文 acon 效率优化

11

📝 深度技术 arXiv AI 2026-05-28

Thinking as Compression: Your Reasoning Model is Secretly a Context Compressor

提出思考即压缩的观点，把大模型推理能力重新定义为上下文压缩机制，视角极为新颖

arXiv:2605.28713v1 Announce Type: new Abstract: Context compression aims to shorten long context inputs with minimal information loss for LLM inferenc…

推理模型上下文压缩大模型原理信息论思维机制

12

📝 深度技术 arXiv AI 2026-05-25

Parallel Context Compaction for Long-Horizon LLM Agent Serving

针对长时LLM Agent的上下文溢出问题，提出并行压缩方法，减少数十秒推理阻塞。

arXiv:2605.23296v1 Announce Type: new Abstract: Long-horizon LLM agents accumulate growing conversation histories that eventually exceed the model's c…

llm代理上下文压缩推理优化并行计算摘要技术

13

📝 深度技术 arXiv 机器学习 2026-05-20

Compress the Context, Keep the Commitments: A Formal Framework for Verifiable LLM Context Compression

提出可验证的LLM上下文压缩形式化框架，压缩同时保持承诺完整性，AI安全新思路。

arXiv:2605.17304v1 Announce Type: new Abstract: LLM context is not just tokens; it is a set of commitments. Long-running conversations accumulate goal…

llm上下文压缩可验证性形式化框架承诺保持 ai安全

🐂 牛哥精选