1
Soft Specialists: $\alpha$-R\'enyi Ensembles for Uncertainty-Aware LLM Post-Training
用α-Rényi集成提升LLM后训练的不确定性建模,学术前沿。
arXiv:2605.27747v1 Announce Type: cross Abstract: Existing training approaches for large language models learn a single set of parameters, based on la…