Name: Build a Large Language Model (From Scratch)
ISBN: 9781633437166

Build a Large Language Model (From Scratch)

作者: Sebastian Raschka
出版社: Manning
出版年: 2024-10-29
页数: 400
定价: USD 47.99
装帧: 平装
ISBN: 9781633437166

豆瓣评分

评价人数不足

评价:

内容简介 · · · · · ·

Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up!

(展开全部)

Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up!

In Build a Large Language Model (from Scratch), you’ll discover how LLMs work from the inside out. In this insightful book, bestselling author Sebastian Raschka guides you step by step through creating your own LLM, explaining each stage with clear text, diagrams, and examples. You’ll go from the initial design and creation to pretraining on a general corpus, all the way to finetuning for specific tasks.

Build a Large Language Model (from Scratch) teaches you how to:

Plan and code all the parts of an LLM

Prepare a dataset suitable for LLM training

Finetune LLMs for text classification and with your own data

Use human feedback to ensure your LLM follows instructions

Load pretrained weights into an LLM

The large language models (LLMs) that power cutting-edge AI tools like ChatGPT, Bard, and Copilot seem like a miracle, but they’re not magic. This book demystifies LLMs by helping you build your own from scratch. You’ll get a unique and valuable insight into how LLMs work, learn how to evaluate their quality, and pick up concrete techniques to finetune and improve them.

The process you use to train and develop your own small-but-functional model in this book follows the same steps used to deliver huge-scale foundation models like GPT-4. Your small-scale LLM can be developed on an ordinary laptop, and you’ll be able to use it as your own personal assistant.

about the book

Build a Large Language Model (from Scratch) is a one-of-a-kind guide to building your own working LLM. In it, machine learning expert and author Sebastian Raschka reveals how LLMs work under the hood, tearing the lid off the Generative AI black box. The book is filled with practical insights into constructing LLMs, including building a data loading pipeline, assembling their internal building blocks, and finetuning techniques. As you go, you’ll gradually turn your base model into a text classifier tool, and a chatbot that follows your conversational instructions.

search inside this book

about the reader

For readers who know Python. Experience developing machine learning models is useful but not essential.

about the author

Sebastian Raschka has been working on machine learning and AI for more than a decade. Sebastian joined Lightning AI in 2022, where he now focuses on AI and LLM research, developing open-source software, and creating educational material. Prior to that, Sebastian worked at the University of Wisconsin-Madison as an assistant professor in the Department of Statistics, focusing on deep learning and machine learning research. He has a strong passion for education and is best known for his bestselling books on machine learning using open-source software.

喜欢读"Build a Large Language Model (From Scratch)"的人也喜欢 · · · · · ·

: Advanced Algorithms and Data Str...

: Reactive Design Patterns

: Database Internals 8.0

: 生成式AI入门与AWS实战 9.1

: Data Pipelines with Apache Airflow

: 深度学习入门2 9.6

: Algorithms 9.8

: Software Architecture: The Hard Pa... 8.7

: 大语言模型：原理与工程实践 7.5

: Transactional Information Systems

我来说两句

短评 · · · · · · ( 全部 32 条 )

Build a Large Language Model (From Scratch)的书评 · · · · · · ( 全部 4 条 )

热门只看本版本的评论

一步 2025-07-14 10:59:40 人民邮电出版社2025版

很棒的大语言模型入门书籍

这篇书评可能有关键情节透露

跟着书完成了一个自己的大模型，从模型实现/预训练/微调走完流程，github仓库地址： https://github.com/mcuking/PocketLLM 作者做到了深入浅出，用各种图形象的展示了大模型中的技术原理，尤其是注意力机制实现。不过美中不足的是，缺少了反向传播用到的技术详解，比如梯度下... (展开)

0回应

豆豆爸爸 2025-07-04 15:22:36 人民邮电出版社2025版

如何手搓一个“微型GPT”小模型，看这本书就够了

在如今AI 大模型霸屏的时代，想不想弄清楚像ChatGPT、DeepSeek这些大模型到底是怎样造出来的？这本在 GitHub仓库上打星58.1K的书像一位导师，手把手一步步教你从0到1来构建和应用大模型。这本书的作者是Sebastian Raschka，他是一位在人工智能和数据科学领域著名专家，他有个... (展开)

0回应

评评你好看 2025-06-17 17:22:04 人民邮电出版社2025版

这是一本初学AI非常好的书

我买了十多本AI学习方面的书。作为人工智能方面的初学者，只有这本书是让我能一点点跟着学习的。作者真的是从零开始，由浅入深地教会我去理解晦涩的原理，用浅显易懂的方式告诉我如何去理解某个名称或者某个知识点。之前看其他书籍视频都没搞明白到底什么是自注意力，读这本书... (展开)

0回应

Heartbeats 2025-06-17 16:08:52 人民邮电出版社2025版

一本易上手的跟练书

Github上很火的开源项目LLMs-from-scratch，中文版纸书4月份才上市，没想到电子版这么快就上架了微信读书，这是一本实操性很强的书，不仅开发了一个小型的类GPT-2大语言模型，还实现了数据集处理、模型预训练、针对特定任务的微调，涵盖了构建大模型的整个流程。作者把模型的核... (展开)

0回应

> 更多书评 4篇

读书笔记 · · · · · ·

我来写笔记

按有用程度
按页码先后
最新笔记

展开收起
Chapter 1 Understanding large language model

夏嘉莫察瓦绒 (余生北国，虽闻飞鱼之名...)

LLM is trained on a large, diverse dataset to develop a broad understanding of language. The success behind LLMs can be attributed to the transformer architecture that underpins many LLMs and the vast amounts of data on which LLMs are trained, allowing them to capture a wide variety of linguistic nuances, contexts, and patterns that would be challenging to encode manually. LLMs are trained on v...
2024-11-22 10:06:06
展开收起
Appendix A: an introduction to pytorch

夏嘉莫察瓦绒 (余生北国，虽闻飞鱼之名...)

Here is the basic concept about AI/Machine Learning/Deep Learning AI is fundamentally about creating coputer systems capable of performing tasks that ususally require human intelligence. These tasks include understanding natural language, recognizing patterns, and making decisions. Machine learning focuses on developing and improving learning algorithms. the key idea behind machine learning is ...
2024-11-20 08:11:59

展开收起
Chapter 1 Understanding large language model

夏嘉莫察瓦绒 (余生北国，虽闻飞鱼之名...)

LLM is trained on a large, diverse dataset to develop a broad understanding of language. The success behind LLMs can be attributed to the transformer architecture that underpins many LLMs and the vast amounts of data on which LLMs are trained, allowing them to capture a wide variety of linguistic nuances, contexts, and patterns that would be challenging to encode manually. LLMs are trained on v...
2024-11-22 10:06:06
展开收起
Appendix A: an introduction to pytorch

夏嘉莫察瓦绒 (余生北国，虽闻飞鱼之名...)

Here is the basic concept about AI/Machine Learning/Deep Learning AI is fundamentally about creating coputer systems capable of performing tasks that ususally require human intelligence. These tasks include understanding natural language, recognizing patterns, and making decisions. Machine learning focuses on developing and improving learning algorithms. the key idea behind machine learning is ...
2024-11-20 08:11:59

展开收起
Chapter 1 Understanding large language model

夏嘉莫察瓦绒 (余生北国，虽闻飞鱼之名...)

LLM is trained on a large, diverse dataset to develop a broad understanding of language. The success behind LLMs can be attributed to the transformer architecture that underpins many LLMs and the vast amounts of data on which LLMs are trained, allowing them to capture a wide variety of linguistic nuances, contexts, and patterns that would be challenging to encode manually. LLMs are trained on v...
2024-11-22 10:06:06
展开收起
Appendix A: an introduction to pytorch

夏嘉莫察瓦绒 (余生北国，虽闻飞鱼之名...)

Here is the basic concept about AI/Machine Learning/Deep Learning AI is fundamentally about creating coputer systems capable of performing tasks that ususally require human intelligence. These tasks include understanding natural language, recognizing patterns, and making decisions. Machine learning focuses on developing and improving learning algorithms. the key idea behind machine learning is ...
2024-11-20 08:11:59

论坛 · · · · · ·

在这本书的论坛里发言

+ 加入购书单

这本书的其他版本 · · · · · · ( 全部2 )

人民邮电出版社（2025）
9.6分 63人读过

展开有售 (2)

以下书单推荐 · · · · · · ( 全部 )

谁读这本书? · · · · · ·

空栈

7月14日想读

XYZ

7月10日在读

哈皮

7月8日想读

之言

7月8日想读

> 69人在读

> 71人读过

> 561人想读

二手市场 · · · · · ·

在豆瓣转让有561人想读，手里有一本闲着?

订阅关于Build a Large Language Model (From Scratch)的评论:
feed: rss 2.0

Build a Large Language Model (From Scratch)

内容简介 · · · · · ·

喜欢读"Build a Large Language Model (From Scratch)"的人也喜欢 · · · · · ·

短评 · · · · · · ( 全部 32 条 )

0 有用无牙仔最乖了 2025-05-21 10:12:45 北京

1 有用 Jayceexu 2024-12-26 07:27:19 波多黎各

1 有用以地之名 2024-12-21 12:09:53 美国

3 有用 yinchaoonline 2024-10-17 08:42:39 中国香港

0 有用后火Backfire 2025-04-20 18:05:34 英国

Build a Large Language Model (From Scratch)的书评 · · · · · · ( 全部 4 条 )

很棒的大语言模型入门书籍

如何手搓一个“微型GPT”小模型，看这本书就够了

这是一本初学AI非常好的书

一本易上手的跟练书

读书笔记 · · · · · ·

展开 收起
Chapter 1 Understanding large language model

展开 收起
Appendix A: an introduction to pytorch

展开 收起
Chapter 1 Understanding large language model

展开 收起
Appendix A: an introduction to pytorch

展开 收起
Chapter 1 Understanding large language model

展开 收起
Appendix A: an introduction to pytorch

论坛 · · · · · ·

这本书的其他版本 · · · · · · ( 全部2 )

以下书单推荐 · · · · · · ( 全部 )

谁读这本书? · · · · · ·

二手市场 · · · · · ·

Build a Large Language Model (From Scratch)

内容简介 · · · · · ·

喜欢读"Build a Large Language Model (From Scratch)"的人也喜欢 · · · · · ·

短评 · · · · · · ( 全部 32 条 )

0 有用 无牙仔最乖了 2025-05-21 10:12:45 北京

1 有用 Jayceexu 2024-12-26 07:27:19 波多黎各

1 有用 以地之名 2024-12-21 12:09:53 美国

3 有用 yinchaoonline 2024-10-17 08:42:39 中国香港

0 有用 后火Backfire 2025-04-20 18:05:34 英国

读书笔记 · · · · · ·

论坛 · · · · · ·

这本书的其他版本 · · · · · · ( 全部2 )

以下书单推荐 · · · · · · ( 全部 )

谁读这本书? · · · · · ·

二手市场 · · · · · ·

0 有用无牙仔最乖了 2025-05-21 10:12:45 北京

1 有用以地之名 2024-12-21 12:09:53 美国

0 有用后火Backfire 2025-04-20 18:05:34 英国