1
Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents
从排队论视角揭秘LLM推理的吞吐最优调度算法,为系统优化提供数学根基
arXiv:2504.07347v3 Announce Type: replace-cross Abstract: As demand for Large Language Models (LLMs) and AI agents grows rapidly, optimizing systems f…