1
Reasoning models struggle to control their chains of thought, and that’s good
OpenAI研究发现推理模型难以自主控制思维链,这反而强化了可监控性,为AI安全提供新思路。
OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeg…