1
Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity
提出保证生成质量的同时最大化输出多样性的策略优化方法,破解LLM内容同质化难题。
arXiv:2602.15894v2 Announce Type: replace-cross Abstract: In many large language model (LLM) alignment applications, users expect not only high-qualit…