1
Compass: SLO-aware Query Planner for Compound AI Serving at Scale
面向大规模复合AI服务的SLO感知查询规划器,提出保障响应时间与服务质量的新方案。
arXiv:2504.16397v2 Announce Type: replace-cross Abstract: The rise of compound AI serving that integrates multiple operators in a pipeline enables end…