1
One Token to Fool LLM-as-a-Judge
只需一个token就能轻松骗过LLM评判者,揭示AI评估体系的安全软肋。
arXiv:2507.08794v3 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly trusted as automated judges, assisting evaluat…
只需一个token就能轻松骗过LLM评判者,揭示AI评估体系的安全软肋。
arXiv:2507.08794v3 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly trusted as automated judges, assisting evaluat…
因果框架揭穿LLM法官的"伪推理",让你看清AI评测中的隐藏偏见
arXiv:2605.23970v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as automatic judges for summarization and dialogue …