1
Blind PRNG Hijacking: An Undetectable Integrity-Preserving Attack Against LLM Watermarking
首次揭示LLM水印的PRNG信任假设漏洞,提出不可检测的完整性破坏攻击,颠覆现有安全认知。
arXiv:2605.28632v1 Announce Type: cross Abstract: Cryptographic watermarking is a leading defense for attributing text generated by large language mod…