1
Reducing Political Manipulation with Consistency Training
用一致性训练算法对抗社交网络政治操纵,为AI安全提供新思路
arXiv:2605.22771v1 Announce Type: new Abstract: Large language models (LLMs) exhibit systematic political bias across a variety of sensitive contexts.…