1
ProbeLLM: Automating Principled Diagnosis of LLM Failures
自动化诊断大模型失败原因的新框架,用原理性方法定位LLM错误根源。
arXiv:2602.12966v2 Announce Type: replace Abstract: Understanding how and why large language models (LLMs) fail is becoming a central challenge as mod…