1
MixRea: Benchmarking Explicit-Implicit Reasoning in Large Language Models
新基准MixRea评估LLM在显式与隐式推理上的表现,揭示推理能力的短板。
arXiv:2605.20128v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly integrated into high-stakes decision-making. Inspired by…