1
OPT-Engine: Benchmarking the Limits of LLMs in Optimization Modeling via Complexity Scaling
这篇论文精准揭示了LLM在优化建模中的能力边界:纯文本推理随复杂度飙升鲁棒性崩溃,集成工具能算数却守不住全局约束,而自动公式化约束成为当前SOTA的核心瓶颈。一张结构化的路线图,指明下一代推理模型必须攻克的关卡。
arXiv:2601.19924v2 Announce Type: replace-cross Abstract: We investigate the capabilities and scalability of Large Language Models (LLMs) in optimizat…