I scaled a pure Spiking Neural Network (SNN) to 1.088B parameters from scratch. Ran out of budget, but here is what I found

wpnews.pro

cd /news/artificial-intelligence/i-scaled-a-pure-spiking-neural-netwo… · home › topics › artificial-intelligence › article

[ARTICLE · art-30862] src=dev.to ↗ pub=2026-06-17T10:26Z topic=artificial-intelligence verified=true sentiment=· neutral

I scaled a pure Spiking Neural Network (SNN) to 1.088B parameters from scratch. Ran out of budget, but here is what I found

An 18-year-old independent developer scaled a pure Spiking Neural Network (SNN) to 1.088 billion parameters from scratch, achieving convergence with a loss of 4.4 after 27,000 steps despite budget constraints. The model exhibited 93% sparsity, cross-lingual emergence of Russian text, and a spontaneous memory routing shift at larger scales. The developer has open-sourced the code and checkpoint on GitHub.

read1 min views26 publishedJun 17, 2026

Hey everyone. I’m an 18yo indie dev, and I’ve been experimenting with Spiking Neural Networks (SNNs) for language modeling. A lot of papers (like SpikeBERT) mention that training 1B+ SNNs directly from random initialization fails due to vanishing gradients, so people usually do ANN-to-SNN conversion or distillation. I wanted to see if I could force it to converge purely in the spike domain. I had to stop at 27k steps because my wallet is literally empty lol, but the loss converged to 4.4.

Here are the most interesting things that happened:

Massive Sparsity: It maintains ~93% sparsity. Only about 7% of neurons fire per token. It's incredibly cheap on memory during inference compared to dense models.

Cross-lingual emergence: Around step 25K, it randomly started generating structurally correct Russian text, even though it wasn't explicitly targeted/weighted for it in the dataset mix.

Memory routing shift: As I scaled the architecture past 600M to 1B, the model spontaneously shifted 39% of its activation routing into the persistent memory module. It basically learned on its own that memory is more valuable at a larger scale.

Limitations (Being honest): The text generation is still janky and nowhere near GPT-2 fluency yet. The loss (4.4) is high, mostly because I couldn't train it longer. But proving that a 1B pure SNN can converge from random init feels like a solid milestone.

I'm sharing this because I'd love some harsh technical feedback.

Does anyone here have experience with neuromorphic hardware? Would an architecture like this map well to Loihi?

If anyone has tips on pushing SNN loss lower or stabilizing surrogate gradients further, I'm all ears. The code, architecture details, and the 12GB full training checkpoint (weights + optimizer states) are on my GitHub:https://github.com/gtausa197-svg/-Project-Nord-Spiking-Neural-Network-Language-Model.git

source & further reading

dev.to — original article Why RAG Docs Chatbots Answer Wrong: Embeddings, Chunking, and Context Fixes I counted the sources in 13 of Google's AI answers. 168 citations, and not one domain appeared twice. Building a Secure MCP Server for AI-Assisted VPS Operations Without Giving the AI a Shell

~/api · this article 200

$curl api.wpnews.pro/v1/news/i-scaled-a-pure-spiking-…

Read original on dev.to → dev.to/gtausa197svg/i-scaled-a-pure-spiking-neur…

mentioned entities

Spiking Neural Network

SNN

SpikeBERT

Loihi

GitHub

metadata

slugi-scaled-a-pure-spiking-neural-network-snn-to-1-088b-parameters-from-scratch-ran

topic#artificial-intelligence

secondary4 topics

sentimentneutral

canonicaldev.to

navigation

← prevAnthropic's Mythos fight turns o…

next →Mastra NPM Supply Chain Attack: …

── more in #artificial-intelligence 4 stories · sorted by recency

trufflesecurity.com · 1 Aug · #artificial-intelligence

Scanning 7.6 Petabytes of HuggingFace Training Data for Secrets

pub.towardsai.net · 1 Aug · #artificial-intelligence

Representational Drift in Neural Networks: What Backpropagation and Hebbian Learning Reveal

marktechpost.com · 1 Aug · #artificial-intelligence

AMD Releases Instella-MoE-16B-A3B: A Fully Open Mixture-of-Experts LLM With 2.8B Active Parameters Trained On Instinct GPUs

pub.towardsai.net · 1 Aug · #artificial-intelligence

Gemma 4 26B in 2GB Is Real. The Headline Is Still Misleading.

── more on @spiking neural network 3 stories trending now

wpnews · 30 Jul · #artificial-intelligence

Microsoft and Meta Earnings Show Different AI Spending Pressures

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

wpnews · 1 Aug · #developer-tools

I Built a Portable AI Skill That Safely Upgrades .NET Applications

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required