Jamba

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

08:27

2026-06-24

discuss.huggingface.co

large-language-models

🧠 I built a novel triple-hybrid LLM (Mamba + Attention + 32-expert MoE) from scratch for ~$50 — Titan v1 complete, Titan v2 first cycle done, expanding dataset now

A developer built a novel triple-hybrid LLM combining Mamba, Attention, and a 32-expert Mixture of Experts architecture from scratch for approximately $50, completing Titan v1 and the first training c…

// co-occurs with top 7 entities

Titan 1 Mamba 1 Mixture of Experts 1 GPT-2 1 FineWeb 1 Chinchilla 1 LLaMA 1