1
Distinguishable Deletion: Unifying Knowledge Erasure and Refusal for Large Language Model Unlearning
提出可区分删除方法,统一知识擦除与拒绝机制,为LLM去学习难题提供新思路。
arXiv:2605.16776v1 Announce Type: new Abstract: Mitigating sensitive and harmful outputs is fundamental to ensuring safe deployment of LLMs. Existing …