1
DCC: Data-Centric Compilation of Machine Learning Kernels for Processing-In-Memory Architectures
面向存内计算架构的数据中心编译方法,从数据流角度优化机器学习内核,突破内存墙瓶颈。
arXiv:2511.15503v2 Announce Type: replace-cross Abstract: High-performance Host processors can integrate Processing-In-Memory (PIM) devices, which can…