1
One Token to Fool LLM-as-a-Judge
只需一个token就能轻松骗过LLM评判者,揭示AI评估体系的安全软肋。
arXiv:2507.08794v3 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly trusted as automated judges, assisting evaluat…
只需一个token就能轻松骗过LLM评判者,揭示AI评估体系的安全软肋。
arXiv:2507.08794v3 Announce Type: replace-cross Abstract: Large language models (LLMs) are increasingly trusted as automated judges, assisting evaluat…