Gemma 4 bug fixes and Research Request

wpnews.pro

cd /news/large-language-models/gemma-4-bug-fixes-and-research-reque… · home › topics › large-language-models › article

[ARTICLE · art-33844] src=discuss.huggingface.co ↗ pub=2026-06-19T10:44Z topic=large-language-models verified=true sentiment=↓ negative

Gemma 4 bug fixes and Research Request

A critical bug in Google's Gemma 4 causes it to malform tool calls under real load, affecting vLLM, llama.cpp, Ollama, and oobabooga. A developer open-sourced a diagnosis, repair, and experimental LoRA that reduces but doesn't eliminate the issue, calling for community help with richer data.

read1 min views1 publishedJun 19, 2026

Gemma-4 has an ecosystem-wide agentic bug: under real load (long context + reasoning + generating

content) it malforms its own tool calls and then loops on the broken output — reported across vLLM,

llama.cpp, Ollama, and oobabooga, and unfixed by Google.

I open-sourced a full diagnosis + a reproduction harness + a working parser-level repair + a stock

NVFP4 quant & recipe + an experimental format LoRA — with all raw data. Honest findings: the LoRA

reduces the slip (I ruled out quantization as the cause via a BF16 control) but doesn’t fully

eliminate it yet — it overfit my synthetic data, so a bigger/more-diverse dataset is likely the real fix.

Everything’s here, Built with Gemma — looking for help pushing the LoRA further with richer data

source & further reading

discuss.huggingface.co — original article Rakarrack-0.6.1 port making progress! ( AI assisted ) Cloud Storage Poll Welcome to Haiku basic(Haiku Docs, Haiku slide and Haiku sheets)

~/api · this article 200

$curl api.wpnews.pro/v1/news/gemma-4-bug-fixes-and-re…

Read original on discuss.huggingface.co → discuss.huggingface.co/t/gemma-4-bug-fixes-and-r…

mentioned entities

Google

Gemma 4

vLLM

llama.cpp

Ollama

oobabooga

NVFP4

LoRA

metadata

sluggemma-4-bug-fixes-and-research-request

topic#large-language-models

secondary4 topics

sentimentnegative

canonicaldiscuss.huggingface.co

navigation

← prevBeast – governed output gateway …

next →The Repo Is the Context: Why Age…

── more in #large-language-models 4 stories · sorted by recency

android-developers.googleblog.com · 19 Jun · #large-language-models

Google is turning Android into a 'sloperating' system. Your thoughts?

digiday.com · 19 Jun · #large-language-models

Le Monde blocked the bots. Now paying readers showing up as agents

searchenginejournal.com · 19 Jun · #large-language-models

Google Research Shows How AI Spam Can Be Detected

businessinsider.com · 19 Jun · #large-language-models

I built an AI tool that negotiated hotel prices for me. One hotel suspected it was AI, but it got me a better deal.

── more on @google 3 stories trending now

wpnews · 18 Jun · #large-language-models

ICYMI: ZAI launches GLM-5.2 open model with 1M context

wpnews · 18 Jun · #ai-chips

Apple and Intel join forces in Trump’s push to bring chipmaking home

wpnews · 18 Jun · #ai-agents

How to Automate Business Reports With an AI Agent Instead of Dashboards

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required