Cosmicgpt – A GPT-in-space simulator to research SpaceX AI satellite viability

wpnews.pro

cd /news/artificial-intelligence/cosmicgpt-a-gpt-in-space-simulator-t… · home › topics › artificial-intelligence › article

[ARTICLE · art-31286] src=github.com ↗ pub=2026-06-17T15:09Z topic=artificial-intelligence verified=true sentiment=· neutral

Cosmicgpt – A GPT-in-space simulator to research SpaceX AI satellite viability

A new open-source simulator called Cosmicgpt models how space radiation, including cosmic-ray bit flips and other faults, affects GPT inference on satellites. The tool injects single-event effects into model weights, activations, and KV cache across orbits like LEO and SAA, generating reports on failure modes and output degradation. It aims to help research the viability of running AI models on SpaceX satellites.

read3 min views27 publishedJun 17, 2026

Simulate what happens to GPT inference under space conditions — cosmic-ray bit flips and other radiation-induced faults corrupting a model's weights, activations, KV cache, and output.

See what radiation does to an AI model's output: a single-run report and an environment comparison.

See DESIGN.md for goals and the conditions we model, and ARCHITECTURE.md for the technical design.

The end-to-end loop covers the full Single-Event-Effect taxonomy across three corruptible regions, with faults either hand-specified or derived from a physical radiation environment: build a seeded nanoGPT (with a real KV cache), generate a clean baseline, get faults (manual or from the flux scheduler), inject them (weight mutations, activation forward-hooks, KV-cache mutations), regenerate with the same sampling seed, and diff.

Fault kinds (--kind

): SEU (single bit flip), MBU (multi-bit upset), STUCK_AT (cell pinned 0/1), SEL (latch-up — a whole tensor zeroed), SET (transient activation glitch), SEFI (NaN/garbage cascade). Regions (--region

): weight, activation (incl. lm_head

→ logits), kv_cache. Environments (--orbit

): LEO, SAA, POLAR, GEO, INTERPLANETARY, SOLAR_STORM, with an optional solar-flare burst window raising λ(t) mid-inference.

Every run also reports a failure mode (silent_correct / subtle_wrong / repetition / garbage / nan_garbage / crash), time-to-failure, and mean KL divergence of the output distribution, and can emit a per-step RunTrace JSON (the data the upcoming visualizations consume).

cosmicgpt run --orbit SAA --flux-mult 1e4 --tokens 120
cosmicgpt run scenarios/mission_solar_storm.yaml
cosmicgpt run --orbit SOLAR_STORM --flux-mult 1e4 --report report.html
cosmicgpt report runs/storm/trace.json -o report.html
cosmicgpt compare --orbits LEO,SAA,SOLAR_STORM -o comparison.html

Reports are fully self-contained (inline CSS + inline SVG, no external assets, no matplotlib) so they're emailable and archivable.

python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

cosmicgpt run scenarios/walking_skeleton.yaml

cosmicgpt run --kind SEFI --n-flips 1 --tokens 120 --fault-seed 3
cosmicgpt run --kind SEL  --n-flips 8 --tokens 100

pytest

Single faults on low-impact sites(biases, low mantissa bits) are routinelymasked— realistic: most cosmic-ray hits do nothing visible. Exponent/sign flips andSEL are far more destructive than mantissa flips.SET(transient activation glitch) is gentle: without persistence it affects one step, and only if it lands on the emitted position.- The model now has a real KV cache(--region kv_cache

): a strike there is mutated once butpersists, because every later token re-reads the corrupted entry through attention. Region is independent of fault kind —--region weight|activation|kv_cache

. A single short inference in LEO is essentially fault-free at realistic upset rates; meaningful corruption needs the SAA, a solar storm, or long exposure. With a flareburst window, divergence visibly begins right when the flux spikes.

The model is a small, seeded, randomly-initialized char-level GPT, so the baseline text is gibberish — but that's fine for the skeleton: the point is to demonstrate the fault-injection loop and that flips (especially in the float exponent) measurably corrupt the output. Train a coherent model later via scripts/train_tiny.py

(roadmap).

src/cosmicgpt/
  model/        nanogpt.py (+KV cache), adapter.py, sites.py   # model + fault registry
  faults/       bitops.py, types.py, injector.py               # taxonomy + injection
  environment/  flux.py, presets.py, scheduler.py              # scaled-physical flux
  eval/         runner, metrics, classify, trace               # loop + metrics + RunTrace
  viz/          svg, diffview, timeline, report                # inline-SVG/HTML reports
  config.py, cli.py
scenarios/      walking_skeleton.yaml, sefi_cascade.yaml, mission_solar_storm.yaml
tests/          test_bitops, test_injection, test_kvcache, test_scheduler, test_eval, test_viz

See ARCHITECTURE.md §11. Next (step 6): mitigation wrappers (ECC / TMR voting / scrubbing / NaN guards) with cost-benefit experiments, then a pluggable larger-GPT backend to test whether findings generalize.

source & further reading

github.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/cosmicgpt-a-gpt-in-space…

Read original on github.com → github.com/davedx/cosmicgpt

mentioned entities

Cosmicgpt

SpaceX

nanoGPT

LEO

SAA

GEO

SOLAR_STORM

INTERPLANETARY

metadata

slugcosmicgpt-a-gpt-in-space-simulator-to-research-spacex-ai-satellite-viability

topic#artificial-intelligence

secondary4 topics

sentimentneutral

canonicalgithub.com

navigation

← prevTowards Conversational AI for Di…

next →llms.txt vs llms-full.txt: What'…

── more in #artificial-intelligence 4 stories · sorted by recency

ai.2it.onl · 2 Aug · #artificial-intelligence

Testing LLM Concurrency on Consumer Hardware (RTX 5060)

marktechpost.com · 2 Aug · #artificial-intelligence

NVIDIA AI Releases Molt: A PyTorch-Native Agentic Reinforcement Learning Framework

dev.to · 2 Aug · #artificial-intelligence

From Agents to Infrastructure: Building Secure, Local-First AI Assistants with Go and Rust

dev.to · 2 Aug · #artificial-intelligence

Your Agent Pays a Tax on Every Tool It Never Calls

── more on @cosmicgpt 3 stories trending now

wpnews · 1 Aug · #ai-products

OpenAI Atlas Shuts Down August 9: Migration Guide

wpnews · 1 Aug · #developer-tools

I Built a Portable AI Skill That Safely Upgrades .NET Applications

wpnews · 1 Aug · #ai-agents

Quality Isn't Accidental — Maker/Checker Separation and Automated Validation

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required