Hey everyone! 👋 I just published an open-source, bilingual (EN/ES) guide on the inner workings of Transformers An open-source, bilingual guide explaining the inner workings of Transformers has been published, covering topics such as attention collapse and KV-cache compression with reproducible code. The guide is available in English and Spanish, and the author invites feedback and pull requests on GitHub. Hey everyone I just published an open-source, bilingual EN/ES guide on the inner workings of Transformers. If you are interested in the exact math and mechanics behind attention collapse, KV-cache compression, or just want a solid visual step-by-step from scratch, you might find this very useful. It includes reproducible code and connects with my TAF Agent project for practical testing. Check it out here: Cómo Atienden los Transformers https://karlesmarin.github.io/transformers-guide/es/index.html Language toggle at the top . Feedback or PRs on GitHub are super welcome