1
BuildArena: A Physics-Aligned Interactive Benchmark of LLMs for Engineering Construction
首个专为工程建造设计的LLM基准,物理对齐交互测试揭示大模型真实建造能力。
arXiv:2510.16559v5 Announce Type: replace Abstract: Engineering construction automation aims to transform natural language specifications into physica…