Rate this book

Natural Language Processing with Transformers, Revised Edition

Lewis Tunstall, Leandro von Werra, Thomas Wolf

Rate this book

Since their introduction in 2017, transformers have quickly become the dominant architecture for achieving state-of-the-art results on a variety of natural language processing tasks. If you're a data scientist or coder, this practical book -now revised in full color- shows you how to train and scale these large models using Hugging Face Transformers, a Python-based deep learning library.

Transformers have been used to write realistic news stories, improve Google Search queries, and even create chatbots that tell corny jokes. In this guide, authors Lewis Tunstall, Leandro von Werra, and Thomas Wolf, among the creators of Hugging Face Transformers, use a hands-on approach to teach you how transformers work and how to integrate them in your applications. You'll quickly learn a variety of tasks they can help you solve.

Build, debug, and optimize transformer models for core NLP tasks, such as text classification, named entity recognition, and question answering
Learn how transformers can be used for cross-lingual transfer learning
Apply transformers in real-world scenarios where labeled data is scarce
Make transformer models efficient for deployment using techniques such as distillation, pruning, and quantization
Train transformers from scratch and learn how to scale to multiple GPUs and distributed environments

GenresArtificial IntelligenceComputer ScienceTechnologyProgrammingNonfictionTechnicalTextbooks

691 pages, Kindle Edition

Published May 26, 2022

212 people are currently reading

795 people want to read

About the author

Lewis Tunstall

1 book5 followers

What do you think?

Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars

105 (52%)

4 stars

75 (37%)

3 stars

16 (8%)

2 stars

1 (<1%)

1 star

2 (1%)

Displaying 1 - 20 of 20 reviews

José Angel

95 reviews4 followers

August 2, 2022

I was afraid this book would be redundant given how much information one can find online about transformers and the Hugging Face platform. Still, it turned out to be a very concise and pragmatic introduction to the topic and a valuable reference book with dozens of tips for training and tailoring your data tasks to the transformers paradigm.

research techbooks

Piotr Gabrys

8 reviews4 followers

March 25, 2023

This is a super cool book on NLP using the HuggingFace 🤗 ecosystem. It's well-written, and you can read it quite quickly (except for two very technical but important chapters). I would recommend it to anyone who has basic experience with deep learning and wants to dive into NLP.

Luisa

273 reviews47 followers

February 27, 2025

Ejemplos chulos, buenísima introducción a los distintos tipos de modelos y arquitecturas, las Notebooks están curradas, un buen libro para aprender

Ethan J

346 reviews11 followers

September 26, 2022

nice read, provided some good pointers

Thomas Dehaene

11 reviews5 followers

April 21, 2022

Both a great primer on the subject, as well as a nice collection of more advanced 'gotchas'. Would have loved to see a bit more on the 'in production' side of things, but a great read nonetheless :-)

Niraj Shah

106 reviews5 followers

October 23, 2023

Super handy if you want to get into NLP with transformers based models.

Aihua

55 reviews4 followers

August 17, 2022

1. Recommend to read and code after taking Deep Learning Specialization on Coursera by Andrew Ng.

2. All code examples are available for download on GitHub with great explanation

Daniel Frost

11 reviews1 follower

June 17, 2024

As a programmer I found this book initially interesting for laying the grounds to a more fundamental understading of transformers and why they are so hyped. I really hoped - being naive - I would read something groundbreaking in terms of computation and also hoped for an enablement to use transformers for future programming undertakings.

I was wrong. But that is really not the books fault. This technologies around transformers and llms strikes me as a black box where the underlying abstractions are not only difficult to understand but also difficult to put to use without tremendous effort and data. I truly do not understand where this is applicable outside of text-heavy domains and therefore I am a bit dissappointed in the technology.

At the same time the book has put a bit of calmness in me while reading stories of so-called AI products and software, where now I know at least the fundamentals and how hard they really are to put to use. All those abstractions resemble to me how the inner workings of a rdbms, and I know that for rdbms' that those are a career path in its own. Hence I am also starting to believe that at this stage, transformers has a long and steep learning curve for really putting them to use, and must not change its underlyings so much that it becomes difficult to follow over time.

I liked the book, it introduced me to a lot of things that is out of my daily work and I will definitly glimpse through the fundamentals many times again.

Klaus-Michael Lux

55 reviews7 followers

June 1, 2023

Accessibly written, with useful code examples and lots of directly actionable information on how to use HuggingFace tools. The chapters on making models efficient for production and on dealing with situations in which few labels are available are especially illuminating. It seems as if HuggingFace has developed a number of useful abstraction layers to make ML engineers more productive, especially around storing and accessing both models and data in a straightforward manner. Training your own neural network and deploying it has never been as easy as today and this book is a useful introduction to the ecosystem. The authors state that as of writing, 20,000 models had been shared on the model hub, but by the time I checked, the number was already 10 times as high. The explanations of model architectures and some technical details such as self-attention are also well-written, though of course given the current pace of technological change, are sure to be in need of another revision in only a couple years.

ai programming

Guust

3 reviews1 follower

March 26, 2023

This book does a good job at introducing and explaining concepts of transformers. I especially like the Named Entity Recognition section, which also goes explains how to do model debugging.

I wish that this book focused a bit more on the huggingface ecosystem rather than only on the transformers part. When you would be tackling your custom problem, you will often have to deal with hugging face tokenisers and datasets. In my opinion they are vital to solving a NLP problem using hugging face and this book sadly does not give them enough attention.

Overall I did very much enjoy this book, however I am still sitting a bit with hunger w.r.t. solving real-world problems using hugging face.

tech-related

Vaidotas Zemlys-Balevičius

35 reviews

February 10, 2023

Very good overview about capabilities of transformer architecture. Great examples. However more complicated math topics are glossed over in a very unelegant way. Math typograhpy is really bad. Some of the examples are really contrived.

Vicki

531 reviews237 followers

April 8, 2023

Really nice overview of all things Transformers. Sometimes very vendor-centric but they tried their best to be as neural and inclusive as possible. One complaint I have is that it kinda starts midway without any context about Transformers but maybe that’s fine given the target audience of the book.

Lupin V

131 reviews

February 17, 2025

a lot of code and technical details. I read haft of the book and scanned through the other half. I wish there were more details and architecture design choices instead. Nevertheless, every well-written book.

Iván

15 reviews

June 22, 2025

A highly recommended book for anyone looking to understand how Transformer architecture works, combining theoretical concepts with practical code examples to ensure a thorough grasp of these models.

Thorben

47 reviews

June 23, 2022

Great Introduction and some further knowledge. A little bit huggingface biased but which Transformer-enthusiast isn't 😉

7 reviews

Exceptional?

27 reviews

No sale Optimus Prime

Paweł Rusin

199 reviews5 followers

August 30, 2024

Great book that goes very deep into explaining Transformers architecture, LLM's and it's practical applications.

Sara

3 reviews

November 21, 2024

I’m impressed by how clearly the author explains the architectures and their applications. The clarity definitely makes complex concepts approachable to both technical and non-technical people.