1
Show HN: GPT-2 inference in pure C#, 0 bytes allocated per token
纯C#实现GPT-2推理引擎,零内存分配无GC压力,性能媲美ONNX Runtime,对.NET开发者极具吸引力。
Article URL: https://github.com/DevOnBike/Overfit Comments URL: https://news.ycombinator.com/item?id=48172293 Points: 1 # Comments: 0