1
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws
从香农信息论视角重新审视LLM,揭示模型容量与缩放定律的深层联系,ICML 2026前沿研究。
arXiv:2605.23901v1 Announce Type: cross Abstract: Existing scaling laws for Large Language Models (LLMs), predominantly monotonic power laws, fail to …