1
Non-linear Interventions on Large Language Models
突破线性干预局限,提出大模型非线性特征干预新方法,为理解LLM内部表征开辟新路径
arXiv:2605.14749v1 Announce Type: cross Abstract: Intervention is one of the most representative and widely used methods for understanding the interna…