cd/entity/OrcaΒ· homeβ€Ί entitiesβ€Ί Orca
grep -l @orca /news/*.json | wc -l β†’ 1

Orca

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

18:01
2026-06-18
pub.towardsai.net
large-language-models

Continuous Batching: How to Keep Your GPU Actually Busy

Continuous batching, introduced in the 2022 Orca paper, improves GPU utilization during LLM inference by dynamically updating the batch at each iteration, freeing slots as requests finish and immediat…

// co-occurs with top 1 entities