牛哥精选 · 所有

📋 全部 ☁️ 云服务 🤖 AI 平台 🔗 API 中转 🔐 安全/认证 💳 支付 📧 通讯 📊 数据分析 🖼 媒体处理 🌐 域名/DNS

📝 深度技术 arXiv NLP 2026-05-20

Evaluation Drift in LLM Personality Induction: Are We Moving the Goalpost?

研究揭示LLM人格测试中评估标准悄悄变化，挑战现有结论可靠性

arXiv:2605.16996v1 Announce Type: new Abstract: Can large language models reliably express a human-like personality, or are they merely mimicking surf…

llm 人格诱导评估漂移模型评估大语言模型

📅 日期

2026-05-20 2026-05-19

🐂 牛哥精选

Evaluation Drift in LLM Personality Induction: Are We Moving the Goalpost?

📅 日期