It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs
互补自蒸馏如何维护大模型上下文完整性?这项研究提出双模型协作新方案,为LLM安全对齐提供创新思路。
arXiv:2605.20258v1 Announce Type: new Abstract: Contextual Integrity (CI) defines privacy not merely as keeping information hidden, but as governing i…