1
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs
不再是优化提示词,而是进化攻击方法本身:一篇提出用进化算法自动合成LLM越狱攻击的论文,攻防研究者必读。
arXiv:2511.12710v2 Announce Type: replace Abstract: Automated red teaming frameworks for Large Language Models (LLMs) have become increasingly sophist…