Rate this book

Deep Learning with Python

François Chollet

Rate this book

Deep learning is applicable to a widening range of artificial intelligence problems, such as image classification, speech recognition, text classification, question answering, text-to-speech, and optical character recognition. It is the technology behind photo tagging systems at Facebook and Google, self-driving cars, speech recognition systems on your smartphone, and much more.

In particular, Deep learning excels at solving machine perception problems: understanding the content of image data, video data, or sound data. Here's a simple example: say you have a large collection of images, and that you want tags associated with each image, for example, "dog," "cat," etc. Deep learning can allow you to create a system that understands how to map such tags to images, learning only from examples. This system can then be applied to new images, automating the task of photo tagging. A deep learning model only has to be fed examples of a task to start generating useful results on new data.

GenresProgrammingArtificial IntelligenceComputer ScienceTechnologyTechnicalNonfictionScience

350 pages, Paperback

First published November 30, 2017

591 people are currently reading

3930 people want to read

About the author

François Chollet

19 books123 followers

François Chollet is a French engineer and researcher in artificial intelligence.

What do you think?

Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars

906 (66%)

4 stars

352 (25%)

3 stars

81 (5%)

2 stars

15 (1%)

1 star

7 (<1%)

Displaying 1 - 30 of 139 reviews

Nicholas Teague

69 reviews15 followers

July 30, 2017

Based on the simple metric of frequency / density of highlighted passage, this text is by far the best book have ever read on subject of machine learning (or most subjects for that matter). The language is clear and easy to follow, even the coding language for that matter - Keras, the machine learning platform of which the author is the architect, is IMO much better entry point for coders to deep learning than TensorFlow based on complexity of reviewed coding tutorials. The text somehow manages to balance covering basics and foundational concepts even while addressing considerations for the advanced practitioner or researcher. For the time being this book will be the one I would recommend first for anyone looking to gain an understanding of current practice / state of the art for machine learning. I believe this will prove to be an important work which opens doors to a whole new class of machine learning engineers. Now if you excuse me I think am finally ready to start experimenting with Kaggle competitions.

Kenta Suzuki

25 reviews3 followers

August 14, 2017

This book focuses on hands-on approach to deep learning written by the author of Keras. You'll need another book for theory such as deep learning(Ian, Yoshua, Aaron) if you want to study further (whether good or not, Keras abstracts away internal functions of the neural networks). This books covers pretty much basics of what is happening in neural networks. After reading a few chapters, readers(with some programming experience) will be able to apply the techniques easily to real world datasets or to participate in Kaggle (API is much simpler than that of TensorFlow). Although there are some tutorials online about the usage of Keras, it is convenient to have a book which covers most of API with examples. I think this book is much more like 'deep learning with Keras'.

Romet Aidla

18 reviews6 followers

April 14, 2018

Best practical introduction to deep learning by author of Keras framework and Google researcher.

First part of the book gives fundamental understanding and mathematical building blocks needed.

Second part introduces different practical applications of deep learning networks:
1. Computer vision with convolutional neural networks (CNNs)
2. Text and sequences (time series) with recurrent neural networks (RNNs)
3. Generating text and images using variational autoencoders (VANs) and generative adversarial networks (GANs)

Code is provided as Jupyter notebooks and run without problems. All examples are done using Keras framework, which is very intuitive for a beginner, but all the topics are covered in general perspective and knowledge can be easily used with other frameworks.

Book does not cover deep reinforcement learning.

James Foster

158 reviews16 followers

May 10, 2018

Deep learning is the newest fad in the computer science world. There are really just two reasons for its popularity: it has solved some difficult problems, and there is a lot of money in it. It has worked pretty well for some problems that looked really hard before, like distinguishing pictures of cats from dogs, recognizing faces (like in your photos), recognizing spoken words (hello, Alexa), interpreting text (as in google queries), and driving cars (without killing too many pedestrians). As it happens, companies with lots of money are actually interested in these things: I’m looking at you, Google, Facebook, and Uber.

As an aside, some people think that science is “objective”, just following the facts, and that engineers use the pencils in their pocket protectors to follow known rules for building things. Neither is actually true. Computer scientists sometimes latch onto a new technology, just because it works on a popular, sometimes actually uninteresting, problem. For example, classifying Irises was a popular, but actually easy, problem back when machine learning was first taking off. When the next algorithm comes around, it gets tested on the same, sometimes easy, problem, succeeding (of course), and then everyone switches to the new algorithm. You stil find many computer scientists classifying Irises (I read a paper doing that just today). Today, the popular, relatively easy, problem is recognizing hand-written numbers from the MNIST dataset. Anyway, a few years ago the hot thing was Support Vector Machines. Then everyone fell in love with Random Forests. And now it’s Deep Learning, which is more like engineering where the engineers are just trying stuff out, making it up as they go along. We computer scientists/engineers sometimes give up on old technologies when shiny new ones come around.

End of digression. Back to the book review.

This is not a “popular” book. It’s more about how these deep nets work, and how to write programs to create them. This is a great book to read if you’re a computer geek interested in coding deep learning programs. If you’re not, then this isn’t the book for you.

But if you ARE, then this is a very clear introduction to what deep nets are, how they seem to work (we don’t really know), and how to build them for several different types of applications. The book uses Keras, which is a nice high-level net description language built on top of TensorFlow (all the non-geeks just stopped reading). It’s pretty easy to follow along and understand the programming, though there is still some unexplained “magic” hidden in the examples.

All in all, of the three deep learning books I have read, this is the clearest for a programmer trying to jumpstart actualy deep learning.

Terran M

78 reviews103 followers

April 3, 2018

This book is good, provided you do not believe the author's facile claims that it is the only book you need. This book explains almost nothing about how deep learning actually works, and is actually more like a user manual for Keras. Provided you actually want an instruction manual for Keras, it's an excellent book. If you want to *understand* something about Deep Learning, go read the book by Goodfellow et al. They make a nice set, in either order or alternating between the two.

Philipp

681 reviews218 followers

August 26, 2018

Always remember that when it comes to markets, past performance is not a good predictor of future returns—looking in the rear-view mirror is a bad way to drive. Machine learning, on the other hand, is applicable to datasets where the past is a good predictor of the future.

Really good general intro to machine learning using the Keras library, from the Keras creator himself.

It goes over all the 'standard' approaches with copious code examples - classification, regression, and the main architectures Keras supports: Convolutional Neural Networks, Recurrent Neural Networks, LSTMs, and going above these now-standard models like multi-input/output models, ensembling, etc.

The best part of the book is that it serves as a collection of DL 'folk wisdom'. It's a fast-moving field that builds massive models, which have some attributes that nobody really understands. Over time people found many 'tricks' which are not explainable, but they just make your model work better. Some of that 'folk wisdom' here is a bit explainable - for example, there's a great summary which activation and loss function to use when, and when to use which architecture - some are less explainable, and that's what makes this book so useful. For example, normalise your input values to between 0 and 1 which makes the distribution of learned weights more regular.

If you've done the fast.ai MOOC like I have you won't learn that much in the CNN part, but the other 80% of the book would be truly novel to you.

An interesting prediction towards the end I haven't heard before - 'regular' convolutions should soon be replaced by depthwise separable convolutions (SeparableConv2D). They are cheaper and faster to train, apparently. Has this taken off yet?

For a taste, the author put parts of the final 'outlook' chapters on his blog - the future and the limitations of deep learning. Be aware that the rest of the book is much more technical with heaps of code examples.

Here’s what you should remember: the only real success of deep learning so far has been the ability to map space X to space Y using a continuous geometric transform, given large amounts of human-annotated data. Doing this well is a game-changer for essentially every industry, but it’s still a long way from human-level.

non-fiction programming

Sergii Khomenko

19 reviews11 followers

February 11, 2019

Great introductory book, that brings a pretty clear practical explanations of DeepLearning and its limitations.

As a seasonal ML practitioner, you can still find a couple of interesting tricks.

The books also touches more advanced topics like building Variational Autoencoders and Generative Adversarial Networks from scratch while providing practical advices how to make it work.

Ewan

27 reviews1 follower

April 26, 2020

The perfect practical introduction to DL, it doesn't claim to be a mathematical explanation of DL and it doesn't get bogged down in maths. Instead it gives lots of examples and explains in intuitive ways the implementation where necessary.

Gabriel Santos

62 reviews12 followers

July 29, 2024

This book is really excellent! I would actually rate it as 4.5 stars. I have just made it 4 stars because I think some concepts in the last chapters could have been better explained.

The first chapters introducing how deep learning works are very informative and didactic as well. I have even incorporated some of the analogies in my daily dialogues to briefly explain how things work in DL.

If you want to learn by actually implementing and testing deep learning I strongly recommend this book.

Mèo lười

193 reviews246 followers

September 7, 2020

Quyển này chính là Dế mèn phiêu lưu kí trong deep learning =,= nhẹ nhàng, dễ hiểu, dễ tưởng tượng.
Hiuhiu.

Mohamed Nijadi

7 reviews1 follower

September 27, 2023

Provides a very clear and easy to understand walkthrough over the keras library. Later chapters are harder to read as the author said that those topics are more advanced, but the overall introduction serves as a very good starting point for anyone looking for a practical hands on experience.

technical

Ozan

22 reviews

January 12, 2020

Nowadays most deep learning resources are full of exaggeration -- even though it is true that deep learning has many impressive applications people rarely talk about its shortcomings. This book is certainly not one of those resources, and the author does not hesitate criticizing and explaining limitations of deep learning while giving a very good practical introduction using Python.

The book is technically well written, and I like that Chollet tries to explain the intuition behind all the models he builds. One thing I didn't like is that he argues that it's not necessary to explain the technical concepts using mathematical notations, as programmers with no mathematical background can still understand those concepts when presented as snippets. Hence, the book does not contain any mathematical notation, and at times this makes it difficult to decode from Python snippets. Still, since the book mentions "Python" in its title, it's probably okay not to expect any mathematical notation from it.

f1yegor

9 reviews1 follower

July 26, 2018

Excellent and clear explanation of main architectures and extensive technics of deep learning from the creator of Keras.
Read Conclusion chapter as a sneak-peak of content and also a good fresh-up of the topic.
Book has supplemented github repo for notebooks, but I wasn't reviewing it thoroughly during the first read. Will now.

Alex Ott

Author 3 books208 followers

October 15, 2017

(maybe close to 9/10)

Very good practical introduction into deep learning based on the Keras. Author explains all major architectures of neural networks without digging into mathematics, on practical examples that are easy to adapt to your own tasks.

ir-dm-nlp-ml-search own-ebook

Alejandro Donaire Salvador

81 reviews

May 25, 2025

It took me almost three years to finish the book, reading it little by little and from time to time. But it was worth it. It's fun to see that despite being a recent book (2022), it was published a few months before ChatGPT was a thing, and so there is no mention of it in its 450+ pages. (It does mention GPT-2/3, though, but only as interesting examples of transformer-based architectures, not as huge breakthroughs.)

All in all, it's a very comprehensive book, featuring plenty of examples, clear explanations, plots, pictures, diagrams... It's a great complement to a university or master's degree related to AI. I often found myself revisiting or expanding on concepts I had previously been taught, or even getting a preview of topics I encountered later on.

But beyond the purely technical knowledge, like models, metrics, and loss functions, of which I was familiar already, I particularly appreciated Chollet's intuition on complex ML/DL topics and the way he looks at and tackles AI problems. His vast expertise can be felt in the way he explains workflows, like model development and refinement loops, or the way he decomposes complex sub-fields within AI.

Furthermore, the final thoughts on the future of the field, practical advice on doing real-life ML, and pointers on how to keep learning, were a nice addition to an already excellent resource.

Philipda Luangprasert

70 reviews

June 30, 2020

This is the guide to deep learning using Keras, the most popular tool, from the creator himself.

It provides advanced introduction without dealing with complicated stuffs (math, tensorflow, etc.). Although it is also good for starters, it is not the best choice to jump in without some prior knowledge. As he said in the book, starters may tempt to think that "all you have is [a powerful, easy, black-boxed] deep-learning hammer, and every problem starts to look like a nail."

math

Joe

108 reviews1 follower

April 14, 2020

Fantastic read!! The explanations are clear, the examples are very illustrative, and the Jupyter notebooks make following along with those examples very easy.

Rick Sam

430 reviews146 followers

January 13, 2022

Easy, reader friendly —practical work by François Chollet.

An outline, overview for Deep Learning, which is a sub-set of Machine Learning.

Recommended to everyone.

artificial-intelligence computer-science

Fernando Flores

4 reviews3 followers

March 31, 2019

A must read book if you want to learn to use effectively keras

Bilge

263 reviews24 followers

February 11, 2021

Türkçesin’i değil ama İngilizcesini tavsiye ederim

Chris Esposo

680 reviews56 followers

June 19, 2021

It’s hard to get a book “closer-to-the-metal” in applied deep learning than one written by the author of Keras, but that’s exactly what this book is, and it delivers the content extremely well. I listened to the book both in audio format, and probably have gone through 40 - 50% of the book via physical reading, and found the audio delivery was a better channel than I expected, given that this is effectively a programming text, but still was no cigar compared to an actual reading.

The better-than-expected delivery via audio probably has much to do with how much of the book is dedicated not to a software or technical treatment of instrumenting the neural networks, but to the history of machine learning in general, as well as a conceptual overview of the basic machine learning pipeline. Roughly 50 pages of the first 4 chapters (or about half of the page-count for those chapters) is dedicated to this conceptual treatment of machine learning, the first chapter being a history of shallow-learning that led up to the advent of the first neural network, it’s failure in the early 90s, the emergence of both kernel methods and ensembling as a the predominant paradigm, then finally the reemergence of neural networks within the framework of GPU-backed deep neural networks (DNNs), The 4th chapter goes through the standard ML pipeline (model-estimation, train/test, tuning and performance tracking via cross-validation etc., which is almost entirely independent of DNN as a conceptual framework.

These 2 chapters are easily digestible via audio. Chapters 2 and 3, which discusses the rudiments of neural networks conceptually and how to navigate Kera’s internal API are also pretty light-weight in terms of technical content and also are easily digestible via audio. Thus, slightly above a ⅓ of the text is well-written for audio. Where the text becomes troublesome for the audio reader is from chapter 5 onward, where step-by-step code templates for DNNs written within Keras are presented for the two major domain-areas computer vision and text/language. These chapters in my opinion are where the real value of the text lay as they include not only the cookie-cutter basic nets for a set of use-cases within each domain, but there’s a lot of commentary on how design-features (adding/subtracting hidden-layers, where to regularize/drop-out, where to pool etc.) a person may customize in their own net may impact the performance of that use-case in the domain. Going through these chapters carefully can quickly get someone to implement (at least with decent performance).

That being said, the templates are all fairly simple, and more recent innovations like multi-task and other bifurcating network topologies do not make it into the material in any substantial (or at all) manner. Though, it should be remembered this is a first-pass book, and was written in 2015-16 (published in 17/18’ depending on the format), and so this can be excused. Overall, the level of content is decidedly more from the practitioner’s standpoint, which is appropriate for this subfield in machine learning, since a full rigorous treatment of the mathematical properties of NNs still mostly elusive, with the theory currently being constructed, this field should be more of a software engineering domain. In fact, Andrew Ng himself has often stated that the best way to learn NNs is to do NNs, and this book will have you implementing NNs within the first 2 - 3 hours of reading.

This was not the first treatment of the material I’ve gone through, having done most of Ng’s Coursera courses, and other academic material, and having attended some seminars on the the theoretical treatment of the subject also in an academic context (mostly, understanding NNs within the context of universal approximation), as well as deploying some in work, I still found the book to be useful, and will reference it at least for a little while longer. There’s a chapter on best-practices within the Keras API, and I suspect there is some value for even intermediate users of these objects in sections of the text like this, as well as the aforementioned code template-sections.

Overall, I enjoyed this book, and recommend it highly (especially for newer practitioners or students of the subject matter). It seems like it would pair well with Goodellow et. al., and I've seen both of these books required as textbooks in sync in some syllabi for standard graduate courses on the subject.

Derek Bridge

960 reviews20 followers

December 29, 2022

Not only a top researcher in the field, and the mastermind behind Keras (which is used throughout the book) but Chollet can also explain the ideas too, which makes this one of the few useful additions to the growing number of books on deep learning. (Geron's book is also very worthwhile.) The book has many strengths: code samples, fun examples, distillation of experience. There were one or two places where a bit more detail might have been desirable (e.g. autodiff, batch normalization,...). By putting the balance heavily on fun practical examples, these are places where the book becomes a bit superficial. Nevertheless, a very useful resource.

Additional review of 2nd edition:
Better, if possible, than the first edition. An outstanding introduction; highly recommended.

Boris

71 reviews5 followers

May 4, 2018

The FASTEST way to become a machine learning developer.

This is surely the best introduction to Machine Learning there is! You get a clear, concise description of how the technology works, and code that you can follow line-by-line and build a variety of models.

I can't praise this book highly enough. Rather than scavenging the internet trying to piece together the ideas behind machine learning, get this book for a coherent package!

Ferhat Culfaz

266 reviews18 followers

August 1, 2018

Absolutely phenomenal book. A very practical and up to the point book on deep learning techniques in python by the guru who created the Keras library. Hence all the examples in the book are in Keras. Highly recommend anyone who want to get into the field to start with this first, write their own code and tinker, and then go through the more theoretical books such as Deep Learning by Goodfellow et al which is very theoretical, broad and academic.

Ovo Dibie

3 reviews

October 9, 2019

Best book on practical deep learning by a mile. Francois offers a good balance between theory and practice. As an ML/AI PhD, I found his descriptions of the various areas of deep learning to be thorough yet gentle. Furthermore, code samples in the book are very easy to follow and implement. I recommend this book to everyone serious about deep learning. Can't wait for the second edition of the book

Kevin

11 reviews

January 2, 2018

Great book overall! Some things took awhile to fully process, but that's expected with the depth the author gets into. The author provided easy-to-follow github repositories to actually work on the projects he walked through in the book. I plan to use this book as a reference whenever I do my deep learning projects