GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz
(twitter.com)
27 points
by laxmena
2 hours ago |
8 comments
()
()
()