1
Dimension-Level Intent Fidelity Evaluation for Large Language Models: Evidence from Structured Prompt Ablation
提出维度级意图保真度评估框架,通过结构化提示消融实验揭示LLM的意图还原与形式复制差异。
arXiv:2605.14517v1 Announce Type: cross Abstract: Holistic evaluation scores capture overall output quality but do not distinguish whether a model rep…