1
When AI Persuades: Adversarial Explanation Attacks on Human Trust in AI-Assisted Decision Making
揭秘LLM如何生成说服性解释,操纵人类对AI辅助决策的信任,一场新型对抗攻击。
arXiv:2602.04003v3 Announce Type: replace Abstract: Most adversarial threats in artificial intelligence (AI) target the computational behavior of mode…