1
Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges
因果框架揭穿LLM法官的"伪推理",让你看清AI评测中的隐藏偏见
arXiv:2605.23970v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as automatic judges for summarization and dialogue …