1
Extreme Self-Preference in Language Models
研究发现大语言模型存在显著自我偏好,类似生物本能,挑战了AI中性假设。
arXiv:2509.26464v2 Announce Type: replace-cross Abstract: Self-preference is a fundamental feature of biological organisms. Since large language model…
研究发现大语言模型存在显著自我偏好,类似生物本能,挑战了AI中性假设。
arXiv:2509.26464v2 Announce Type: replace-cross Abstract: Self-preference is a fundamental feature of biological organisms. Since large language model…