cd/entity/Gemma· home entities Gemma
grep -l @gemma /news/*.json | wc -l → 76

Gemma

mentions 76 type Organization page 2/4 feed RSS
sameAs · en.wikipedia.org

// recent coverage 76 mentions

18:03
2026-06-20
hackster.io
ai-tools

Offline AI Voice Assistant on Raspberry Pi 4 with Gemma

A developer built a fully offline voice assistant on a Raspberry Pi 4 or 5 using local AI models. The device records audio, processes it with Whisper for speech-to-text, runs a local language model vi…

20:25
2026-06-19
lmsys.org
large-language-models

The next generation of speculative decoding: DFlash and Spec V2

Modal and Z Lab released DFlash, a speculative decoding model for Qwen 3.5 397B-A17B, achieving over 4.3x throughput versus baseline and 1.5x versus MTP on HumanEval at concurrency 1. The model uses a…

01:53
2026-06-18
letsdatascience.com
large-language-models

Google releases OpenRL for LLM fine-tuning

Google released OpenRL, an open-source API for fine-tuning large language models on Kubernetes clusters, aiming to decouple infrastructure from AI research and improve GPU utilization by running multi…

00:00
2026-06-17
runagentrun.co.uk
ai-infrastructure

OpenRouter fans prompts to match Claude Fable 5

OpenRouter launched Fusion, a routing layer that sends a single prompt to multiple AI models in parallel and synthesizes their outputs, achieving performance comparable to Anthropic's Claude Fable 5 a…

14:16
2026-06-16
byteiota.com
large-language-models

Local LLMs vs Claude for Coding: The 70% Problem

A Hacker News thread on June 16 revealed that local LLMs like Qwen 3.6 35B-A3B handle about 70% of daily coding tasks but fall short on complex multi-file reasoning, creating a gap akin to a junior ve…

00:00
2026-06-16
tomtunguz.com
large-language-models

5x for Free : The Local Coding Stack

A Hacker News thread reveals that local AI coding models, led by Qwen 3.6 35B-A3B and harness Pi, are increasingly replacing cloud-based tools like Claude and GPT, offering privacy, zero cost, and off…

22:29
2026-06-14
arxiv.org
large-language-models

Still: Amortized KV Cache Compaction in a Single Forward Pass

Researchers introduced Still, a per-layer Perceiver model that compacts KV cache in a single forward pass, enabling efficient long-context language model deployment. On Qwen and Gemma models, Still ou…

07:01
2026-06-14
coles.codes
large-language-models

Local Models in Mid-2026

Open-weight local language models in mid-2026 have nearly matched frontier performance for everyday tasks, driven by engineering advances in sparse attention and mixture-of-experts architectures. Deep…

22:21
2026-06-11
idlemachines.co.uk
artificial-intelligence

DiffusionGemma: Discrete diffusion in a large language model

DeepMind released DiffusionGemma, a new large language model that uses discrete diffusion to generate entire sequences in parallel instead of autoregressive token-by-token generation. The model achiev…

17:50
2026-06-11
lesswrong.com
large-language-models

Failing to Ragebait the New Gemma

Researchers from the SPAR Research Fellowship found that Google's Gemma 4 language model shows significantly reduced emotional instability compared to its predecessor, Gemma 3, which frequently exhibi…

14:00
2026-06-11
coles.codes
large-language-models

Local models in mid-2026: the engineering that closed the gap

Local large language models have nearly caught up to frontier models for everyday tasks as of mid-2026, driven by engineering advances in sparse attention and mixture-of-experts architectures that red…

20:00
2026-06-10
simonwillison.net
large-language-models

DiffusionGemma

Google released DiffusionGemma, a new open-weight AI model under the Apache 2 license, available on Hugging Face. NVIDIA is hosting the model for free on its NIM cloud API, where it generated 2,409 to…

← prev page 2 / 4 next →
// co-occurs with top 8 entities