An interactive linear algebra primer aimed at LLM readers
交互式线性代数入门,专为LLM读者设计,让抽象数学直观易懂。
Article URL: https://algo-rhythm.dev/en/ Comments URL: https://news.ycombinator.com/item?id=48245604 Points: 6 # Comments: 0
交互式线性代数入门,专为LLM读者设计,让抽象数学直观易懂。
Article URL: https://algo-rhythm.dev/en/ Comments URL: https://news.ycombinator.com/item?id=48245604 Points: 6 # Comments: 0
OpenAI数学家借助AI破解80年未解几何难题,震惊学界
Nature, Published online: 22 May 2026; doi:10.1038/d41586-026-01651-0 The late Hungarian mathematician Paul Erdős thought he had the last word on a ge…
AIME 2024数学题经13种文本扰动,测试大模型推理鲁棒性,揭示依赖格式的短板
arXiv:2604.08571v2 Announce Type: replace-cross Abstract: While Large Language Models (LLMs) achieve high performance on standard mathematical benchma…
AI模型破解80年未解的离散几何猜想,数学难题再被AI攻克
An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in AI-driven …
OpenAI GPT-5一口气攻克10个80年未解的Erdős数学难题,AI推理再破人类认知天花板
OpenAI claims its reasoning model disproved a geometry conjecture unsolved since 1946 — and this time, the mathematicians who exposed its last embarra…
用AI将物理问题可视化并逐步推导,快速获得精确计算和答案,适合考前冲刺或检查作业。
Article URL: https://physicsai.chat Comments URL: https://news.ycombinator.com/item?id=48221075 Points: 2 # Comments: 1
OpenAI模型推翻离散几何核心猜想,AI在科学发现中展现惊人潜力
OpenAI Model Disproves Central Conjecture in Discrete Geometry Meta Description: An OpenAI model has disproved a central conjecture in discrete geomet…
AI模型推翻数学猜想,证明离散几何核心猜想错误,引发学界热议。
https://x.com/wtgowers/status/2057175727271800912 , https://xcancel.com/wtgowers/status/2057175727271800912 Comments URL: https://news.ycombinator.com…
AI首次挑战80年几何猜想,数学探索迎来新突破。
IT之家 5 月 21 日消息,OpenAI 称其全新推理模型推导出了一个原创数学证明,推翻了几何学中一道著名的未解猜想。该猜想最早由保罗・埃尔德什于 1946 年提出。 IT之家注意到,OpenAI 已不是第一次放出这般大胆的言论。七个月前,这家人工智能巨头前副总裁凯文・韦尔在社交平台 X 上发文…
手写数学也能自动批改?视觉大模型让AI教育再进一步,来自AIED 2026的实证研究。
arXiv:2605.19043v1 Announce Type: cross Abstract: Automated grading systems have enabled scalable assessment for many response types, but handwritten …
OpenAI用AI破解1946年数学难题,见证大模型推理能力新突破
Article URL: https://twitter.com/openai/status/2057176201782075690 Comments URL: https://news.ycombinator.com/item?id=48215185 Points: 3 # Comments: 0
AI在数学中展现“原始思考”,跨子域建立惊人联系,刷新对机器创造力的认知
Nature, Published online: 19 May 2026; doi:10.1038/d41586-026-01553-1 Thanks to some surprising advances, mathematicians are starting to realize that …
美团开源LongCat-Flash-Prover,AI数学定理证明新SOTA,MiniF2F通过率97.1%
在常规的数学解题中,模型只需要“答对最终数值”即可,但数学定理证明不同,它要求极度严苛的逻辑链条,任何一句自然语言的模棱两可,都可能导致整个证明的崩塌。那么,如何让 AI 从“猜答案”走向“严谨证明”,成为复杂推理具有挑战的课题。为了解答这个问题,我们开源了专门用于数学形式化与定理证明的模型 —— …
探索LLM与进化搜索结合时,代码进化究竟在优化什么——对算法设计的关键追问
arXiv:2605.20086v1 Announce Type: cross Abstract: Recent work pairs LLMs with evolutionary search to iteratively generate, modify, and select code usi…
用通俗语言解释密码熵,让你真正理解“强密码”到底有多强
Every password generator tells you the password is "strong." Very few tell you how strong, or what that actually means in practice. The answer is entr…
数学家精选的Soohak基准测试,专攻LLM科研级数学推理能力,挑战最高阶思维极限
arXiv:2605.09063v2 Announce Type: replace Abstract: Following the recent achievement of gold-medal performance on the IMO by frontier LLMs, the commun…
将工具调用与执行解耦,提出隐式层次化GRPO框架,显著提升数学推理中的工具集成效率与泛化能力。
arXiv:2605.18500v1 Announce Type: new Abstract: Large language models (LLMs) have increasingly leveraged tool invocation to enhance their reasoning ca…
结合Lean与理论计算机科学,可规模生成形式-非形式配对的定理证明挑战,助力AI数学推理研究。
arXiv:2508.15878v2 Announce Type: replace-cross Abstract: Formal theorem proving (FTP) has emerged as a critical foundation for evaluating the reasoni…
用层论数学框架检测AI智能体在科学理论迁移中的障碍,开创性跨学科方法
arXiv:2605.14033v1 Announce Type: new Abstract: Scientific theory shift in AI agents requires more than fitting equations to data. An artificial scien…