1
Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution
被ICML 2026收录,提出逐步置信度归因方法,精准诊断黑盒大模型的多步推理失败原因。
arXiv:2605.19228v1 Announce Type: cross Abstract: Large Language Models have achieved strong performance on reasoning tasks with objective answers by …