1
E2LLM: Towards Efficient LLM Serving in Heterogeneous Edge/Fog Environments
异构边缘/雾环境中大模型高效服务的新方案,解决部署延迟与资源优化难题。
arXiv:2606.03770v1 Announce Type: cross Abstract: Large Language Models (LLMs) have become integral to modern applications, yet their deployment remai…