My AI memory benchmark said 98.3%. The number was true — and worthless.
A developer building Bastra Recall, an MIT-licensed MCP memory server for Claude, discovered that their initial 98.3% benchmark was misleading because it queried each memory with its own trigger phras…