What are the Right Symmetries for Formal Theorem Proving?
探讨形式定理证明中对称性的选择,为自动推理提供新视角。
arXiv:2605.22257v1 Announce Type: cross Abstract: Formal theorem provers based on large language models (LLMs) are highly sensitive to superficial var…
探讨形式定理证明中对称性的选择,为自动推理提供新视角。
arXiv:2605.22257v1 Announce Type: cross Abstract: Formal theorem provers based on large language models (LLMs) are highly sensitive to superficial var…
基于智能体的自动证明优化框架,提升数学证明的简洁性与可读性,ICLR 2025 顶会论文。
arXiv:2410.04753v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have been used to generate formal proofs of mathematical theore…
美团开源LongCat-Flash-Prover,AI数学定理证明新SOTA,MiniF2F通过率97.1%
在常规的数学解题中,模型只需要“答对最终数值”即可,但数学定理证明不同,它要求极度严苛的逻辑链条,任何一句自然语言的模棱两可,都可能导致整个证明的崩塌。那么,如何让 AI 从“猜答案”走向“严谨证明”,成为复杂推理具有挑战的课题。为了解答这个问题,我们开源了专门用于数学形式化与定理证明的模型 —— …
LeanSearch v2提出全局前提检索,一次性找出Lean 4定理所需全部引理,突破现有单步或语义匹配局限。
arXiv:2605.13137v2 Announce Type: replace-cross Abstract: Proving theorems in Lean 4 often requires identifying a scattered set of library lemmas whos…
结合Lean与理论计算机科学,可规模生成形式-非形式配对的定理证明挑战,助力AI数学推理研究。
arXiv:2508.15878v2 Announce Type: replace-cross Abstract: Formal theorem proving (FTP) has emerged as a critical foundation for evaluating the reasoni…
提出最小化agent基线,系统对比AI定理证明器架构,核心特性包括迭代改进、库搜索与上下文管理。
arXiv:2602.24273v3 Announce Type: replace Abstract: We propose a minimal agentic baseline that enables systematic comparison across different AI-based…