AIME25

mentions 2 type Organization feed RSS

// recent coverage 2 mentions

15:18

2026-06-04

github.com

large-language-models

KVarN: Native vLLM KV-cache quantization back end by Huawei

Huawei released KVarN, a native KV-cache quantization back end for vLLM that delivers up to 5x more cache capacity and 1.3x the throughput of FP16 while maintaining FP16-level accuracy. The calibratio…

18:22

2026-05-16

research.nvidia.com

large-language-models

iGRPO: Self-Feedback-Driven LLM Reasoning

Researchers introduced Iterative Group Relative Policy Optimization (iGRPO), a two-stage reinforcement learning method that improves large language model reasoning by having the model generate and ref…

// co-occurs with top 8 entities

GRPO 1 iGRPO 1 Nemotron-H-8B-Base-8K 1 DeepSeek-R1 Distilled 1 OpenReasoning-Nemotron-7B 1 AceReason-Math 1 AIME24 1 Huawei 1