Leanpub Book LAUNCH 🚀 My Adventures with Large Language Models: Build foundational LLMs from Transformers to DeepSeek, from scratch, in PyTorch by Prathamesh S.

wpnews.pro

cd /news/large-language-models/leanpub-book-launch-my-adventures-wi… · home › topics › large-language-models › article

[ARTICLE · art-32905] src=leanpub.com ↗ pub=2026-06-04T22:18Z topic=large-language-models verified=true sentiment=↑ positive

Leanpub Book LAUNCH 🚀 My Adventures with Large Language Models: Build foundational LLMs from Transformers to DeepSeek, from scratch, in PyTorch by Prathamesh S.

Prathamesh S. launched a Leanpub book titled 'My Adventures with Large Language Models' that teaches readers to build five LLM architectures from scratch in PyTorch, including GPT-2, Llama 3.2, and DeepSeek. The book targets ML engineers and researchers seeking hands-on code-level understanding of modern LLMs beyond GPT-2.

read2 min views17 publishedJun 4, 2026

Books My Adventures with Large Language Models walks you through building five real LLM architectures from scratch in PyTorch, starting from a vanilla encoder-decoder Transformer and ending at DeepSeek's Multi-Head Latent Attention and Mixture-of-Experts.

Welcome to the Leanpub Launch video for My Adventures with Large Language Models: Build foundational LLMs from Transformers to DeepSeek, from scratch, in PyTorch by Prathamesh S.!

About the Book #

Most LLM tutorials stop at GPT-2. This book doesn't.

My Adventures with Large Language Models walks you through building five real LLM architectures from scratch in PyTorch, starting from a vanilla encoder-decoder Transformer and ending at DeepSeek's Multi-Head Latent Attention and Mixture-of-Experts.

Every chapter has runnable, end-to-end code. No pseudocode, no hand-waving. You type it, you run it, you understand it.

What you'll build:

Chapter 1: A vanilla encoder-decoder Transformer for English-to-Hindi translation. The fundamentals, implemented from the ground up.

Chapter 2: GPT-2 (124M parameters) from scratch, then load real OpenAI pretrained weights to verify your implementation works.

Chapter 3: Llama 3.2-3B by swapping exactly four components of your GPT-2. LayerNorm becomes RMSNorm. Learned positional encodings become RoPE. GELU becomes SwiGLU. Multi-Head Attention becomes Grouped-Query Attention. Then load Meta's pretrained weights.

Chapter 4: KV cache, Multi-Query Attention, and Grouped-Query Attention for inference optimisation.

Chapter 5: DeepSeek's full architecture. Multi-Head Latent Attention (with the absorption trick and decoupled RoPE), DeepSeekMoE (shared experts, fine-grained segmentation, auxiliary-loss-free load balancing), Multi-Token Prediction, and FP8 quantisation.

The code repository is open source: https://github.com/S1LV3RJ1NX/mal-code

This book is for ML engineers, researchers, and senior developers who know Python and PyTorch and want to understand modern LLMs at the level of code, not slides or blog posts. If you've read Raschka or watched Karpathy and want to go further, into Llama, GQA, MLA, and MoE, this is the book.

About the Author #

Prathamesh is a Senior Forward Deployed Engineer at TrueFoundry, where he helps enterprises and startups solve real problems with LLMs and agents. He wrote this book because he wanted a resource that went past GPT-2 and into the architectures actually running in production. He is based in Bangalore, India. Portfolio: https://psaraf.pages.devFollow the author here!

source & further reading

leanpub.com — original article Leanpub Book LAUNCH 🚀 Rethinking Performance Engineering for Agentic AI by Kandasamy Selvaraj Leanpub Book LAUNCH 🚀 Architect the Agent: The CCAR-F Certification Guide by Thomas De Vos Leanpub Book LAUNCH 🚀 Securing Enterprise AI Agents: A Field Guide to Bounded AI Autonomy, AgentSecOps, and MCP Security by Thomas De Vos

~/api · this article 200

$curl api.wpnews.pro/v1/news/leanpub-book-launch-my-a…

Read original on leanpub.com → leanpub.com/blog/leanpub-book-launch-my-adventur…

mentioned entities

Prathamesh S.

Leanpub

PyTorch

OpenAI

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required