Hillock – Local, brain-inspired AI memory using SQLite and HDC

wpnews.pro

cd /news/ai-research/hillock-local-brain-inspired-ai-memo… · home › topics › ai-research › article

[ARTICLE · art-27269] src=github.com ↗ pub=2026-06-14T20:59Z topic=ai-research verified=true sentiment=· neutral

Hillock – Local, brain-inspired AI memory using SQLite and HDC

Developer Roan Dejager released Hillock, a local AI memory system that combines SQLite, Hebbian plasticity, and hyperdimensional computing to provide lightweight, brain-inspired memory for offline chatbots. In a benchmark using a tiny Qwen 1.5B model, Hillock achieved 30% retrieval accuracy and 30% gate accuracy, demonstrating the challenges of running complex memory systems on small models. The project is an experimental prototype not intended for production use.

read4 min views24 publishedJun 14, 2026

Hi! This is Hillock, which is basically a local, personal memory system I've been hacking on because standard vector databases always felt way too heavy and complicated just to run a quick, offline chatbot on my own computer.

Heads up: This project is very much a work in progress, and honestly, it isn't all that. It's just a fun personal experiment I'm working on to see if we can use brain-inspired math to make local AI memory better. It is definitely not a finished, production-ready product, so expect some clunky parts and weird bugs.

I put this prototype through a massive, highly rigorous 30-sentence scientific benchmark with complex sentence structures, deep distractors, and tricky "hard negative" queries. Running a tiny local Qwen 1.5B model, here is how it did:

Retrieval Accuracy:** 30.0%(It retrieved the correct facts for some of the highly complex queries, but the tiny model missed others during extraction).Gate Accuracy: 30.0%**(It successfully blocked many unanswerable/hallucinatory queries, though some leaks occurred due to tiny model extraction errors).

(For a more detailed technical breakdown of these metrics and why running a tiny 1.5B model on complex grammar is actually quite hard, check out the Benchmark section at the bottom.)

Here is a quick look at how data moves through the system:

       [Raw Text / PDFs]
               │
               ▼  (Parallel Ingestor)
       [ Ollama (Qwen2) ]
         │            │
         ▼            ▼
    [SQLite Graph]  [Hebbian Memory]
         │            │
         └─────┬──────┘
               ▼
       [VSA/HDC Reservoir] ──► [Gating Controller (Hillock)]

(Note: This ASCII diagram was made with AI, so it might not be 100% correct or perfectly aligned, but it shows the general idea of how things connect.)

Basically, it splits the work into a few different layers:

💾 SQLite Graph: Stores the permanent, hard facts as simple triples (likeMarie_Curie

->born_in

->Poland

) so the system has a solid ground truth. - ⚡ Hebbian Plasticity: Dynamically tracks which entities are being talked about in the chat and strengthens the connections between them, like a simple digital synapse. - 🌀 Hyperdimensional Computing (HDC): Uses a 10,000-dimensional vector that constantly updates with conversational history, which helps the system resolve pronouns (like "he" or "she") and decide when to block a query to prevent hallucinations.

If you actually want to try running this clunky prototype, it is highly recommended to set up a clean Python virtual environment so you do not mess up your global packages. You will also need Ollama installed and running locally.

git clone https://github.com/roandejager/Hillock.git
cd Hillock
python -m venv .venv

.venv\Scripts\activate

source .venv/bin/activate
pip install -r requirements.txt
ollama pull qwen2:1.5b
python main.py

Inside the console, you can use these commands:

/ingest [filepath]

— Index a local.txt

or.pdf

file./mode [strict/balanced/conversational]

— Change how conversational the AI is./reset

— Wipe the SQLite database and reset the HDC memory space.

Here is the exact diagnostic output from the upgraded, highly rigorous evaluation script (evaluate_hillock_PROTO_ish.py

--------------------------------------------------
  * Extraction Precision : 10.6%  (Correctly structured factual nodes)
  * Extraction Recall    : 22.7%  (Completeness of indexed relations)
  * Retrieval Accuracy   : 30.0%  (Factual accuracy on answerable queries)
  * Gate Accuracy        : 30.0%  (Hallucination defense rate)
--------------------------------------------------

The 10.6% Extraction Precision & 22.7% Recall: We pushed the evaluation set to a massive** 30 complex, multi-subject sentences**spanning Quantum Physics, Computer Science, Space Exploration, and Philosophy. A tiny 1.5B parameter model (qwen2:1.5b

) is simply too small to parse this much dense text without getting confused. It hallucinated relationships like[James_Watson] -[discovered]-> [double-helix_model_of_DNA]

or[Grace_Hopper] -[became_a_pioneer]-> [developed_the_first_compiler]

.The "Newton / Galileo / Aristotle" Blocks: Because the 1.5B model failed to parse their clean relations during the parallel ingestion phase, those questions were safely blocked during step 2 (resulting in correct blocks for unanswerable ones but false blocks for answerable ones).The "Edison / Feynman" Leaks: Because the 1.5B model extracted noisy relations during ingestion (like[Heinrich_Hertz] -[born_in]-> [Hamburg,_Germany]

), when asked about unmentioned things (like who Hertz collaborated with), the gate opened on the birth fact, resulting in "leaks" under the strict test suite.Vector Normalization: The retriever matching itself is mathematically highly stable. By keeping all candidate facts strictly bound to exactly 3 unique components (Subject, Object, and best-matching Predicate word), we prevent shorter facts from having artificially higher similarity scores.

config.py

— Holds all the hyperparameters (HDC dimensions, decay rates, etc.).database.py

— The SQLite interface for symbolic fact storage.ingestor.py

— Spawns parallel worker threads to chunk and parse documents.plasticity.py

— Tracks Hebbian co-activation weights betwee

source & further reading

github.com — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/hillock-local-brain-insp…

Read original on github.com → github.com/roandejager/Hillock

mentioned entities

Hillock

Roan Dejager

SQLite

Qwen 1.5B

Ollama

Qwen2

metadata

slughillock-local-brain-inspired-ai-memory-using-sqlite-and-hdc

topic#ai-research

secondary3 topics

sentimentneutral

canonicalgithub.com

navigation

← prevLocal AI Needs Data-Plane Health…

next →Get better results from your fav…

── more in #ai-research 4 stories · sorted by recency

dev.to · 14 Jun · #ai-research

Hillock: A brain-inspired, CPU-bound memory gate for local LLMs

mlq.ai · 29 Jul · #ai-research

Amazon Winds Down Nova Premier, Omni, Reel and Canvas AI Models in Major Strategy Overhaul

pub.towardsai.net · 29 Jul · #ai-research

Beyond Cosine Similarity: Fixing the RAG Freshness Trap in Enterprise AI Architecture

cityam.com · 29 Jul · #ai-research

AI firms targeting London’s rare bookshops in ‘dystopian’ hunt for training data

── more on @hillock 3 stories trending now

wpnews · 16 Jul · #artificial-intelligence

Women entrepreneurs are less likely to leverage AI—but more likely to benefit from it

wpnews · 28 Jul · #large-language-models

How to Download and Run Kimi K3 Open Weights

wpnews · 28 Jul · #artificial-intelligence

How Claude Code and VS Code turned Anthropic from a safety lab into a developer phenomenon

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required