1
Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode
揭示LLM推理瓶颈新视角:batch-1解码受内存限制而非带宽限制,挑战传统认知。
arXiv:2605.30571v1 Announce Type: cross Abstract: Physical AI systems, including robots, autonomous vehicles, embodied agents and edge copilots, often…