Python is one of the top/growing programming languages for doing data science. If you are interested in learning Data Science with Python, there are a number of fantastic books and resources available online for free from top data scientists.
Here is a list of best books for learning Python for data science. Current list contains 6 fantastic books. Get started learning Data Science with Python.
Automate the Boring Stuff with Python
Automate the Boring Stuff with Python
The free version of the Python book is available online at https://automatetheboringstuff.com/.
Python Data Science Handbook
Python Data Science Handbook: Essential Tools for Working with Data
Jake VanderPlas, the author of the book and well known data scientist has made the book available for free. The free version of book is available at https://jakevdp.github.io/PythonDataScienceHandbook/ as Jupyter notebooks.
It is also worth to have the print edition of the Python Data Science Handbook. Most of the times you might find it for half its original price at Amazon.
Machine Learning with Python Cookbook
Machine Learning with Python Cookbook
Machine Learning with Python Cookbook is not freely available. However, the book came out of Chris Albon’s, the author of the book, fantastic website chrisalbon.com. The website’s “Technical Notes On Using Data Science & Artificial Intelligence” with loads of recipes for common Machine Learning challenges in using Python.
Probabilistic Programming and Bayesian Methods for Hackers:
Bayesian Methods for Hackers: Probabilistic Programming and Bayesian Inference is great book for anyone who wants to Bayesian analysis in Python to their data science tool kit. Traditionally, Bayesian analysis has been taught Math-first approach, but this book turns it around and teaches one to learn Bayesian inference with computing-first approach. The book showcases the use of PyMC3, the python library for Bayesian computing. Yes, PyMC3 is a great addition for practicing data scientists.
Bayesian Methods for Hackers is available online for free at http://camdavidsonpilon.github.io/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers/. The online version has all the chapters that one needs to learn Bayesian inference. To swetten the deal, recently the print version is available at amazon with additional chapters on Bayesian A/B testing and more.
The next two books from the fantastic “Think” series of books by Allen Downey.
Think Stats
Think Stats: Exploratory Data Analysis
EDA is probably the best way to learn probability and statistics for anyone doing data science and Think Stats offers you that for free at https://greenteapress.com/wp/think-stats-2e/.
Think Bayes: Bayesian Statistics in Python
Think Bayes is a great free book from the Think series to learn Bayesian statistics with Python. As the book introduction says
If you know how to program with Python and also know a little about probability, you’re ready to tackle Bayesian statistics. With this book, you’ll learn how to solve statistical problems with Python code instead of mathematical notation, and use discrete probability distributions instead of continuous mathematics. Once you get the math out of the way, the Bayesian fundamentals will become clearer, and you’ll begin to apply these techniques to real-world problems.
And yes, like other books from Think series, Allen Downey has made this book available free online at http://greenteapress.com/wp/think-bayes/. But if are already a Data Scientist looking to add Bayesian computing to armour, show some love by getting the print version Think Bayes: Bayesian Statistics in Python