Home Home >
Reference >
Computers >
Building Large Language Models from Scratch

Click for an excerpt

Building Large Language Models from Scratch

Design, Train, and Deploy LLMs with PyTorch

Dilyan Grigorov

Apress
Publication date : 2026-04-26

See accessibility information

Add to basket

£54,99

Download | LCP DRM 🛈 Adobe DRM 🛈

Streaming

Add to my wish list

Louise Reader

Read on Louise Reader App.

Get £8.25 by recommending this book

This book is a complete, hands-on guide to designing, training, and deploying your own Large Language Models (LLMs)—from the foundations of tokenization to the advanced stages of fine-tuning and reinforcement learning. Written for developers, data scientists, and AI practitioners, it bridges core principles and state-of-the-art techniques, offering a rare, transparent look at how modern transformers truly work beneath the surface.

Starting from the essentials, you’ll learn how to set up your environment with Python and PyTorch, manage datasets, and implement critical fundamentals such as tensors, embeddings, and gradient descent. You’ll then progress through the architectural heart of modern models, covering RMS normalization, rotary positional embeddings (RoPE), scaled dot-product attention, Grouped Query Attention (GQA), Mixture of Experts (MoE), and SwiGLU activations, each explored in depth and built step by step in code. As you advance, the book introduces custom CUDA kernel integration, teaching you how to optimize key components for speed and memory efficiency at the GPU level—an essential skill for scaling real-world LLMs. You’ll also gain mastery over the phases of training that define today’s leading models:

Pretraining - Building general linguistic and semantic understanding.
Midtraining - Expanding domain-specific capabilities and adaptability.
Supervised Fine-Tuning (SFT) - Aligning behavior with curated, task-driven data.
Reinforcement Learning from Human Feedback (RLHF) - Refining responses through reward-based optimization for human alignment.

The final chapters guide you through dataset preparation, filtering, deduplication, and training optimization, culminating in model evaluation and real-world prompting with a custom TokenGenerator for text generation and inference.

By the end of this book, you’ll have the knowledge and confidence to architect, train, and deploy your own transformer-based models, equipped with both the theoretical depth and practical expertise to innovate in the rapidly evolving world of AI.

What You’ll Learn

How to configure and optimize your development environment using PyTorch
The mechanics of tokenization, embeddings, normalization, and attention mechanisms.
How to implement transformer components like RMSNorm, RoPE, GQA, MoE, and SwiGLU from scratch.
How to integrate custom CUDA kernels to accelerate transformer computations.
The full LLM training pipeline: pretraining, midtraining, supervised fine-tuning, and RLHF.
Techniques for dataset preparation, deduplication, model debugging, and GPU memory management.
How to train, evaluate, and deploy a complete GPT-like architecture for real-world tasks.

Who this book is for:

Software developers, data scientists, machine learning engineers and AI enthusiasts looking to build their models from scratch.

Les livres numériques peuvent être téléchargés depuis l'ebookstore Numilog ou directement depuis une tablette ou smartphone.

PDF : format reprenant la maquette originale du livre ; lecture recommandée sur ordinateur et tablette
EPUB : format de texte repositionnable ; lecture sur tous supports (ordinateur, tablette, smartphone, liseuse)

Votre support de lecture	Format	Protection	Application
Ordinateur	-EPUB -PDF	DRM Adobe LCP	Lecture en ligne (streaming) Adobe Digital Editions(DRM Adobe) Thorium Reader (LCP)
Tablette et smartphone iOS / Android	EPUB PDF	LCP DRM Adobe	Application Louise Reader : IOS / Android(ne lit pas les fichiers protégés par Adobe DRM) Adobe Digital Edition : IOS/Android(Lit uniquement la DRM Adobe)
Liseuse	EPUB	DRM Adobe	Module de lecture de la liseuse
Liseuse Diva	EPUB	LCP DRM Adobe	Module de lecture de la liseuse Diva

Consultez l’aide pour en savoir plus.

About the book

Author

Dilyan Grigorov

Imprint

Apress

Collection

n.c

Publication date

2026-04-26

Pages

530 pages

Print ISBN

9798868822964

Language

English

Ebook informations

EAN PDF

9798868822971

Price

£54.99

Copy count 5

Print count 53

File size 21146 Ko

Informations accessibility

EAN EPUB

9798868822971

Price

£54.99

Copy count 5

Print count 53

File size 8868 Ko

Informations accessibility

Compatibility

To check the compatibility with your devices,
see help page

About author(s)

Dilyan Grigorov

Dilyan Grigorov is a software developer with a passion for Python software development, generative deep learning & machine learning, data structures, and algorithms. He is an advocate for open source and the Python language itself. He has 16 years of industry experience programming in Python and has spent 5 of those years researching and testing Generative AI solutions. His passion for them stems from his background as an SEO specialist dealing with search engine algorithms daily. He enjoys engaging with the software community, often giving talks at local meetups and larger conferences. In his spare time, he enjoys reading books, hiking in the mountains, taking long walks, playing with his son, and playing the piano.

You may also be interested in...