Not just RAG over chunks. Not vector search. Real memory β the kind humans have.
That question grabbed me and wouldn't let go. So I read about the hippocampus, Ebbinghaus forgetting curves, complementary learning systems, slow-wave sleep replay. And then I built it.
The Architecture #
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 5-TIER MEMORY ARCHITECTURE β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β TIER 1+2 EPISODIC BUFFER Brain: Hippocampus β
β βββββββββββββββββββββββββββββββ Speed: <1ms β
β 64 working + 256 episodic items β
β Ebbinghaus decay: n^0.3 Β· e^(-Ξ»t) Β· importance β β Forget threshold: 0.05 | Promote threshold: 0.65 β
β Access-based reinforcement on every read β
β β β
β TIER 3 SEMANTIC STORE Brain: Neocortex β
β βββββββββββββββββββββββββββββββ Speed: ~50ms β
β ChromaDB v2 Β· all-mpnet-base-v2 (768-dim) β β Hybrid search: dense + BM25 β Reciprocal Rank Fusion β
β β β
β TIER 4 KNOWLEDGE GRAPH Brain: Association Cortex β
β βββββββββββββββββββββββββββββββ Speed: ~100ms β
β spaCy NER + 30 keyword patterns β
β NetworkX + SQLite Β· Multi-hop reasoning β
β Auto-relation inference: uses/works_on/depends_on β
β β β
β TIER 5 COLD ARCHIVE Brain: Distributed Cortex β
β βββββββββββββββββββββββββββββββ Speed: async β
β Filesystem JSON Β· YYYY/MM organization β
β Full-text search Β· Thaw to active Β· Compact summaries β
β β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β CONSOLIDATION PIPELINE (Sleep Analog) β
β βββββββββββββββββββββββββββββββββββ β
β Decay β Cluster β Merge(LLM) β Rescore β Promote β β
β FindRelations(LLM) β Archive β Neurogenesis β
β Quick: 60ms (every 5min) | Full: ~3s (on idle) β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β STANDBY NEURON AGENTS β
β βββββββββββββββββββββββββββββββββββ β
β ββββββββββββ ββββββββββββ ββββββββββββ β
β β Personal β β Tech β β Projects β ...N agents β
β β Agent β β Agent β β Agent β β
β β β β β β β β
β β DEEP π€ β β LIGHT π‘ β β DEEP π€ β β
β β 0 RAM β β ~3KB RAM β β 0 RAM β β
β β 0 tokens β β ready β β 0 tokens β β
β ββββββββββββ ββββββββββββ ββββββββββββ β
β β
β Wake: trigger patterns + centroid similarity β
β Vote: all agents score β top K form consensus panel β
β Sleep: return to idle after task (zero token consumption) β
β Spawn: Neurogenesis creates agents from memory clusters β
β Prune: inactive agents auto-removed after 30 days β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
The Two Novel Things #
1. Standby Neuron Agents
Here's what hit me: biological neurons don't all fire at once. Only ~2% are active at any moment. The rest wait, silent, consuming almost nothing.
So I built agents that work the same way:
They wake on trigger pattern matching + embedding similarity, vote in consensus panels (all agents score the query, top K get activated), communicate via sparse blackboard, and return to sleep.
2. Neurogenesis
When the system notices a cluster of memories forming around a new topic β say, 6+ memories about Minecraft β it automatically spawns a new specialized agent for that domain.
If an agent hasn't been woken in 30 days? It gets pruned. Nobody else does either of these.
Sleep as a Feature #
The consolidation pipeline runs like biological sleep β reorganizing, merging duplicates, strengthening important memories, letting the rest decay:
Quick mode: 60ms, zero LLM calls, runs every 5 minutes
Full mode: ~3 seconds, LLM-powered merge + relation discovery,
triggers on idle detection (15+ minutes of inactivity)
Does It Work? #
Episodic Buffer 41/41 β
Memory Integration 32/32 β
Semantic Store 26/26 β
Knowledge Graph 37/37 β
Consolidation 31/31 β
Standby Agents 42/42 β
Cold Archive 27/27 β
Cross-Tier (E2E) 88/88 β ββββββββββββββββββββββββββββ
TOTAL 324/324 β
Why I'm Sharing This #
I'm 18, from Slovakia. This started as a random vibecoding project β a voice assistant. But the memory problem grabbed me and wouldn't let go.
The long-term thing that drives me: I believe better memory for AI could eventually help with conditions like Alzheimer's. Computational memory prosthesis. That's the direction I want to explore.
**GitHub:** [github.com/FogyXT/JARVIS](https://github.com/FogyXT/JARVIS)
**License:** AGPL-3.0
Thanks for reading.