16:12
2026-06-16
twitter.com
artificial-intelligence
GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz
A developer implemented a full Transformer with KV cache on an FPGA, achieving over 56,000 tokens per second at only 80 MHz, without using a GPU or CPU. The design was created gate by gate as a customβ¦