1
Faster-GCG: Efficient Discrete Optimization Jailbreak Attacks against Aligned Large Language Models
提出Faster-GCG算法,显著加速离散优化越狱攻击,高效突破对齐大语言模型防线
arXiv:2410.15362v2 Announce Type: replace Abstract: Aligned Large Language Models (LLMs) have attracted significant attention for their safety, partic…