Gemma 4 E2B running in-browser at 255 tok/s

wpnews.pro

cd /news/large-language-models/gemma-4-e2b-running-in-browser-at-25… · home › topics › large-language-models › article

[ARTICLE · art-31790] src=huggingface.co ↗ pub=2026-06-17T21:30Z topic=large-language-models verified=true sentiment=↑ positive

Gemma 4 E2B running in-browser at 255 tok/s

A new Hugging Face Space demonstrates Gemma 4 E2B running in-browser via WebGPU at 255 tokens per second, showcasing efficient on-device AI inference.

read1 min views26 publishedJun 17, 2026

Article URL:

https://huggingface.co/spaces/webml-community/gemma-4-webgpu-kernels Comments URL: https://news.ycombinator.com/item?id=48577195

Points: 3

source & further reading

huggingface.co — original article High School Sophomore Seeking arXiv Endorser for Vision Transformer MoE Paper (cs.LG / cs.CV) Reliability check on my own dataset's annotation layer: five machine raters, one definition, answers from 0 to 78 Same model, up to 4.66x different price — full Inference Providers pricing matrix

~/api · this article 200

$curl api.wpnews.pro/v1/news/gemma-4-e2b-running-in-b…

Read original on huggingface.co → huggingface.co/spaces/webml-community/gemma-4-we…

mentioned entities

Gemma 4

Hugging Face

WebGPU

metadata

sluggemma-4-e2b-running-in-browser-at-255-tok-s

topic#large-language-models

secondary2 topics

sentimentpositive

canonicalhuggingface.co

navigation

← prevLongtime broadcaster Mike Krukow…

next →OpenAI joins push for safety tes…

── more in #large-language-models 4 stories · sorted by recency

dev.to · 2 Aug · #large-language-models

From Agents to Infrastructure: Building Secure, Local-First AI Assistants with Go and Rust

news.ycombinator.com · 2 Aug · #large-language-models

Strangers pretrained a language model with HF PRs and a cron job

dev.to · 2 Aug · #large-language-models

Your Agent Pays a Tax on Every Tool It Never Calls

the-decoder.com · 2 Aug · #large-language-models

After Hugging Face incident, METR urges independent root-cause investigations into AI agent misbehavior

── more on @gemma 4 3 stories trending now

wpnews · 1 Aug · #ai-products

OpenAI Atlas Shuts Down August 9: Migration Guide

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

wpnews · 1 Aug · #developer-tools

I Built a Portable AI Skill That Safely Upgrades .NET Applications

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required