1
UCCI: Calibrated Uncertainty for Cost-Optimal LLM Cascade Routing
提出基于校准不确定度的LLM级联路由方案,在保持性能的同时降低推理成本。
arXiv:2605.18796v1 Announce Type: new Abstract: LLM cascades and model routing promise lower inference cost by sending easy queries to a small model a…