1
OpenDeepThink: Parallel Reasoning via Bradley--Terry Aggregation
提出OpenDeepThink方法,用Bradley-Terry模型实现并行推理,无需真实验证器即可筛选最佳候选,为LLM推理扩展新路径。
arXiv:2605.15177v1 Announce Type: new Abstract: Test-time compute scaling is a primary axis for improving LLM reasoning. Existing methods primarily sc…