08:27
2026-06-24
discuss.huggingface.co
large-language-models
๐ง I built a novel triple-hybrid LLM (Mamba + Attention + 32-expert MoE) from scratch for ~$50 โ Titan v1 complete, Titan v2 first cycle done, expanding dataset now
A developer built a novel triple-hybrid LLM combining Mamba, Attention, and a 32-expert Mixture of Experts architecture from scratch for approximately $50, completing Titan v1 and the first training cโฆ