牛哥精选 · 所有

📋 全部 ☁️ 云服务 🤖 AI 平台 🔗 API 中转 🔐 安全/认证 💳 支付 📧 通讯 📊 数据分析 🖼 媒体处理 🌐 域名/DNS

📝 深度技术 arXiv AI 2026-05-23

Meta-Soft: Leveraging Composable Meta-Tokens for Context-Preserving KV Cache Compression

用可组合的元标记压缩KV缓存，高效保留上下文信息，大模型推理再提速。

arXiv:2605.22337v1 Announce Type: new Abstract: The KV cache used in large language models has linearly growing time complexity, so LLMs face memory b…

kv cache压缩 meta-token 上下文保留大模型推理优化可组合元标记

📅 日期

2026-05-20 2026-05-19

🐂 牛哥精选

Meta-Soft: Leveraging Composable Meta-Tokens for Context-Preserving KV Cache Compression

📅 日期