19:13
2026-06-21
discuss.huggingface.co
large-language-models
OLMo-core + Engram graft: small-scale debug comparison
A debug comparison between a base OLMo3 600M model and an Engram memory variant showed the grafted model achieved lower training and evaluation cross-entropy loss and faster gradient norm stabilizatio…