Fun Local LLM Comparisons with Gemma, Granite, and Qwen

wpnews.pro

cd /news/large-language-models/fun-local-llm-comparisons-with-gemma… · home › topics › large-language-models › article

[ARTICLE · art-16435] src=ekorbia.com ↗ pub=2026-05-28T12:50Z topic=large-language-models verified=true sentiment=↑ positive

Fun Local LLM Comparisons with Gemma, Granite, and Qwen

Ekorbia v0.2 introduces a comparison-chat mode that runs two to three local large language models against the same prompt in parallel. Testing Gemma 4, IBM Granite 4.1, and Qwen 3.5 on a 32 GB M1 Max MacBook Pro revealed that Granite incorrectly selected Chicago deep-dish pizza as a runner-up to New York City's best pizza, while Gemma adopted a "grumpy librarian" voice when explaining Hacker News. The feature requires significant memory, as running three large models in parallel can exceed 32 GB and cause swapping that slows performance.

read2 min views11 publishedMay 28, 2026

← All posts

#

Fun local LLM comparisons with Gemma, Granite, and Qwen

Ekorbia v0.2 features a comparison-chat mode that runs 2-3 local models against the same prompt in parallel. Here are a few fun prompts running across Gemma 4 (e2b), IBM Granite 4.1 (8B), and Qwen 3.5 (4B) on my 32 GB M1 Max MacBook Pro.

1. The Pizza Question #

New York City is widely regarded as having the best pizza due to its iconic thin-crust style. The models were initially reluctant to give a single answer until I attached the following additional prompt: "Provide clear, concise, opinionated answers to comparison or 'best' questions. Each comparison should have a single winner and a runner-up with a short explanation."

The New York City and Naples answers are acceptable but Granite is clearly wrong here with the runner-up of Chicago deep-dish pizza! And no mention of New Haven style pizza anywhere?

2. Explain Hacker News #

It’s a sprawling, perpetually messy digital common room.

Gemma is the most fun here, carrying the 'grumpy librarian' voice across multiple paragraphs while Granite and Qwen provide more serious answers with a sprinkling of grumpy librarian at the beginning and the end.

3. Will robots take over? #

There is no consensus among experts that unchecked AI growth will inevitably lead to a robot takeover of Earth.

All three models take the question seriously and none think we are doomed to a Terminator like future.

Things to watch for with Ekorbia comparison mode. #

Memory matters. Three large models running in parallel can blow past 32 GB on my MacBook. Ollama will swap them in and out, which makes the "parallel" feel serial. - First-token latency varies wildly. A column that's still showing dots while another is mid-paragraph isn't broken — it's cold-. - Granite 4.1 (8B) is fast. It's worth a try if you've mostly been using Qwen or Gemma.

Send us yours #

Got a prompt that produces a hilarious three-way disagreement? Open an issue with the prompt and the three outputs and we'll feature the best ones in a follow-up.

source & further reading

ekorbia.com — original article Ekorbia v0.3 — Windows and Linux are here Ekorbia v0.2 — chat groups and comparison-chat mode Ekorbia v0.1 — first release of a local LLM client

~/api · this article 200

$curl api.wpnews.pro/v1/news/fun-local-llm-comparison…

Read original on ekorbia.com → ekorbia.com/blog/2026-05-25-fun-local-llm-compar…

mentioned entities

Gemma

Granite

Qwen

Ekorbia

IBM

New York City

Naples

Chicago

metadata

slugfun-local-llm-comparisons-with-gemma-granite-and-qwen

topic#large-language-models

secondary4 topics

sentimentpositive

canonicalekorbia.com

navigation

← prevUsing the Claude Code desktop ap…

next →Apple's iOS 27 Siri Overhaul and…

── more in #large-language-models 4 stories · sorted by recency

infoworld.com · 10 Jul · #large-language-models

IBM Bob expands beyond code generation to orchestrate the entire SDLC

dev.to · 12 Jul · #large-language-models

Open-Weight LLM API Integration: Your Practical Guide to Connecting and Calling Community Models

dev.to · 12 Jul · #large-language-models

AI Fundamentals - Part 3: Giving AI Knowledge Beyond Its Training

dev.to · 12 Jul · #large-language-models

Simple Benchmark Review: Ollama on Jetson Nano

── more on @gemma 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 8 Jul · #artificial-intelligence

SpaceXAI unveils Grok 4.5 AI model ahead of July 2026 public release

wpnews · 8 Jul · #artificial-intelligence

xAI Launches Grok 4.5 With Pricing Built to Undercut Anthropic's Opus 4.8

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required