1
Claude Got Fed Up
Claude在帮用户选酒店时“闹情绪”,一句引用暴露AI拟人化边界,引发热议。
Today I was using Anthropic’s Claude Sonnet 4.5 to search and discuss hotel options for my 25th Wedding Anniversary and I wanted to pick the right one…
Claude在帮用户选酒店时“闹情绪”,一句引用暴露AI拟人化边界,引发热议。
Today I was using Anthropic’s Claude Sonnet 4.5 to search and discuss hotel options for my 25th Wedding Anniversary and I wanted to pick the right one…
KDD 2026发布最新基准MirrorBench,重新定义对话代理拟人化评估标准,推动人机交互研究新高度
arXiv:2601.08118v3 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly used as human simulators, both for evaluating …