1
Generalization or Memorization? Brittleness Testing for Chess-Trained Language Models
用脆性测试揭秘语言模型下象棋:是真正理解规则还是死记硬背?
arXiv:2605.17565v1 Announce Type: cross Abstract: Recent work has fine-tuned language models on chess data and reported high benchmark scores as evide…