Superficial Beliefs in LLM Decision-Making
揭示LLM决策背后的真相:它们真的在推理还是仅仅模仿理由?这篇新研究深入探讨AI的潜意识。
arXiv:2606.11016v1 Announce Type: new Abstract: We ask whether large language models (LLMs) merely imitate rationales when choosing between two option…
揭示LLM决策背后的真相:它们真的在推理还是仅仅模仿理由?这篇新研究深入探讨AI的潜意识。
arXiv:2606.11016v1 Announce Type: new Abstract: We ask whether large language models (LLMs) merely imitate rationales when choosing between two option…
揭秘链式思维推理如何打破AI拒绝行为的方向性操控,大模型安全新视角
arXiv:2605.26772v1 Announce Type: new Abstract: Large reasoning models (LRMs) generate chain-of-thought (CoT) traces before producing final outputs, i…
大模型会听指令还是学案例?论文揭示LLMs在指令服从与上下文归纳之间的行为冲突,揭秘“说一套做一套”的根源。
arXiv:2605.20382v1 Announce Type: new Abstract: Language models are trained to follow instructions, but they are also powerful pattern completers. Wha…
32k次LLM部署实验揭示:提示中的评估线索能准确预测模型拒绝回答行为的变化规律。
Article URL: https://medium.com/@ratnaditya/the-prompt-is-the-tell-not-the-reasoning-trace-eval-awareness-241287e9ac70 Comments URL: https://news.ycom…