1
Reinforcing Human Behavior Simulation via Verbal Feedback
用语言反馈教大模型模拟人类行为,比代码数学更难但更具社会智能价值
arXiv:2605.20506v1 Announce Type: new Abstract: Humans learn social norms and behaviors from verbal feedback (e.g., a parent saying "that was rude" or…