1
Ghosted Layers: Unconstrained Activation Alignment for Recovering Layer-Pruned LLMs
提出Ghosted Layers,无需训练即可恢复层剪枝后LLM的性能,通过激活对齐解决隐藏状态不匹配问题。
arXiv:2605.15491v1 Announce Type: cross Abstract: Layer pruning removes entire Transformer decoder blocks from large language models, but introduces a…