1
Extreme Self-Preference in Language Models
研究发现大语言模型存在显著自我偏好,类似生物本能,挑战了AI中性假设。
arXiv:2509.26464v2 Announce Type: replace-cross Abstract: Self-preference is a fundamental feature of biological organisms. Since large language model…
研究发现大语言模型存在显著自我偏好,类似生物本能,挑战了AI中性假设。
arXiv:2509.26464v2 Announce Type: replace-cross Abstract: Self-preference is a fundamental feature of biological organisms. Since large language model…
LLM作为评判者会偏向自己,这篇论文量化了自我偏好偏差并提出了缓解方法。
arXiv:2604.22891v3 Announce Type: replace-cross Abstract: LLM-as-a-Judge has become a dominant approach in automated evaluation systems, playing criti…