{"slug": "a-comprehensive-bilingual-guide-to-transformers-from-foundations-to-kv-cache", "title": "A comprehensive, bilingual guide to Transformers: From foundations to KV-cache compression & attention dynamics", "summary": "Carles Marin released an open-source bilingual guide on Transformer architectures, covering attention dynamics, KV-cache compression, and advanced concepts like grokking. The resource includes reproducible code and interactive elements for practical learning.", "body_md": "Hi everyone,\n\nI’d like to share an open-source resource I’ve been working on: a comprehensive, bilingual (English & Spanish) guide on Transformer architectures.\n\nMy goal was to create a bridge between the mathematical foundations of attention mechanisms and their practical implementation. The guide goes beyond the basics and dives deep into low-level mechanics, including:\n\n**Attention Dynamics:** From scratch implementations to understanding attention collapse.\n\n**Context & Memory:** Exploring KV-cache compression and long-context challenges.\n\n**Advanced Concepts:** Grokking, optimization, and structural analysis.\n\nThe theoretical explanations are backed by reproducible code and interactive elements (like the TAF Agent framework I’ve been developing for browser-based LLM testing).\n\nYou can read it here: English: [https://karlesmarin.github.io/transformers-guide/en/index.html](https://www.google.com/search?q=https://karlesmarin.github.io/transformers-guide/en/index.html) *(Nota: asegúrate de que esta URL existe o cámbiala por la correcta)* Spanish: [Cómo Atienden los Transformers](https://karlesmarin.github.io/transformers-guide/es/index.html)\n\nI’d love to hear feedback from the community, especially regarding the visualization of attention states and optimization techniques. Contributions or suggestions are more than welcome!\n\nCheers, Carles", "url": "https://wpnews.pro/news/a-comprehensive-bilingual-guide-to-transformers-from-foundations-to-kv-cache", "canonical_source": "https://discuss.huggingface.co/t/a-comprehensive-bilingual-guide-to-transformers-from-foundations-to-kv-cache-compression-attention-dynamics/177222#post_1", "published_at": "2026-06-29 03:54:17+00:00", "updated_at": "2026-06-29 04:12:29.441089+00:00", "lang": "en", "topics": ["large-language-models", "artificial-intelligence", "machine-learning", "ai-research", "developer-tools"], "entities": ["Carles Marin", "TAF Agent", "Transformers"], "alternates": {"html": "https://wpnews.pro/news/a-comprehensive-bilingual-guide-to-transformers-from-foundations-to-kv-cache", "markdown": "https://wpnews.pro/news/a-comprehensive-bilingual-guide-to-transformers-from-foundations-to-kv-cache.md", "text": "https://wpnews.pro/news/a-comprehensive-bilingual-guide-to-transformers-from-foundations-to-kv-cache.txt", "jsonld": "https://wpnews.pro/news/a-comprehensive-bilingual-guide-to-transformers-from-foundations-to-kv-cache.jsonld"}}