1
From Benchmarks to Skills: Low-Rank Factors for LLM Evaluation
提出基于低秩因子的LLM评估新范式,突破传统基准分数局限,揭示模型真实能力。
arXiv:2507.20208v2 Announce Type: replace Abstract: Current evaluations of large language models (LLMs) rely heavily on a growing collection of benchm…