When Probing Accuracy Saturates, Fragility Resolves: A Complementary Metric for LLM Pre-Training Analysis
当标准探测准确率饱和时,引入“脆弱性”度量作为互补指标,为LLM预训练分析提供新视角。
arXiv:2606.11375v1 Announce Type: cross Abstract: Standard linear probing declares a property "encoded" when a classifier on hidden states achieves hi…