1
Prompt Compression in Diffusion Large Language Models: Evaluating LLMLingua-2 on LLaDA
评估LLMLingua-2在扩散大模型LLaDA上的提示压缩效果,探索高效推理新路径。
arXiv:2605.17932v1 Announce Type: new Abstract: Prompt compression reduces inference cost and context length in large language models, but prior evalu…