# Kog AI

> Entity coverage from Web Pulse
> Last updated: 2026-05-29T18:00:36.070164+00:00
> 2 articles mentioning **Kog AI**

- [Real-time LLM Inference on Standard GPUs: 3k tokens/s per request](https://wpnews.pro/news/real-time-llm-inference-on-standard-gpus-3k-tokens-s-per-request) — 2026-05-29
- [Building a single-kernel, latency-optimized LLM inference engine on AMD MI300X GPUs](https://wpnews.pro/news/building-a-single-kernel-latency-optimized-llm-inference-engine-on-amd-mi300x) — 2026-05-28