cd/entity/Delayed Tensor ParallelismΒ· homeβ€Ί entitiesβ€Ί Delayed Tensor Parallelism
grep -l @delayed tensor parallelism /news/*.json | wc -l β†’ 1

@Delayed Tensor Parallelism

mentions 1 type Person feed RSS
16:16
2026-05-28
blog.kog.ai
large-language-models

Delayed Tensor Parallelism for Faster Transformer Inference

Kog Team researchers introduced Delayed Tensor Parallelism (DTP), a Transformer architecture that hides communication overhead behind computation and weight streaming to accelerate batch-size-one infe…

// co-occurs with top 4 entities