A comprehensive, bilingual guide to Transformers: From foundations to KV-cache compression & attention dynamics

wpnews.pro

cd /news/large-language-models/a-comprehensive-bilingual-guide-to-t… · home › topics › large-language-models › article

[ARTICLE · art-42951] src=discuss.huggingface.co ↗ pub=2026-06-29T03:54Z topic=large-language-models verified=true sentiment=↑ positive

A comprehensive, bilingual guide to Transformers: From foundations to KV-cache compression & attention dynamics

Carles Marin released an open-source bilingual guide on Transformer architectures, covering attention dynamics, KV-cache compression, and advanced concepts like grokking. The resource includes reproducible code and interactive elements for practical learning.

read1 min views1 publishedJun 29, 2026

Hi everyone,

I’d like to share an open-source resource I’ve been working on: a comprehensive, bilingual (English & Spanish) guide on Transformer architectures.

My goal was to create a bridge between the mathematical foundations of attention mechanisms and their practical implementation. The guide goes beyond the basics and dives deep into low-level mechanics, including:

Attention Dynamics: From scratch implementations to understanding attention collapse.

Context & Memory: Exploring KV-cache compression and long-context challenges.

Advanced Concepts: Grokking, optimization, and structural analysis.

The theoretical explanations are backed by reproducible code and interactive elements (like the TAF Agent framework I’ve been developing for browser-based LLM testing).

You can read it here: English: https://karlesmarin.github.io/transformers-guide/en/index.html (Nota: asegúrate de que esta URL existe o cámbiala por la correcta) Spanish: Cómo Atienden los Transformers

I’d love to hear feedback from the community, especially regarding the visualization of attention states and optimization techniques. Contributions or suggestions are more than welcome!

Cheers, Carles

source & further reading

discuss.huggingface.co — original article Rakarrack-0.6.1 port making progress! ( AI assisted ) Cloud Storage Poll Welcome to Haiku basic(Haiku Docs, Haiku slide and Haiku sheets)

~/api · this article 200

$curl api.wpnews.pro/v1/news/a-comprehensive-bilingua…

Read original on discuss.huggingface.co → discuss.huggingface.co/t/a-comprehensive-bilingu…

mentioned entities

Carles Marin

TAF Agent

Transformers

metadata

sluga-comprehensive-bilingual-guide-to-transformers-from-foundations-to-kv-cache

topic#large-language-models

secondary4 topics

sentimentpositive

canonicaldiscuss.huggingface.co

navigation

← prevRegional revival via mega projec…

next →NewMax takes the privacy route i…

── more in #large-language-models 4 stories · sorted by recency

discuss.huggingface.co · 29 Jun · #large-language-models

Hey everyone! 👋 I just published an open-source, bilingual (EN/ES) guide on the inner workings of Transformers

arxiv.org · 29 Jun · #large-language-models

Verifiable Geometry Problem Solving: Solver-Driven Autoformalization and Theorem Proposing

arxiv.org · 29 Jun · #large-language-models

Large Language Model Teaches Visual Students: Cross-Modality Transfer of Fine-Grained Conceptual Knowledge

arxiv.org · 29 Jun · #large-language-models

Grounded Iterative Language Planning: How Parameterized World Models Reduce Hallucination Propagation in LLM Agents

── more on @carles marin 3 stories trending now

wpnews · 28 May · #ai-startups

[AINews] Cognition raises $1B in $26B Series D

wpnews · 5 Jun · #ai-agents

Miasma Worm Targets AI Coding Agents via GitHub Repos

wpnews · 28 Jun · #ai-agents

OpenCode v1.17: Session Snapshots Undo Your AI Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required