A collection of learning resources on computer programming and data science.

Computer Programming in Python

For free Python books, check out my post. They’re all excellent learning options.

Python and SQL Books (non-free)


Python for Data Analysis
by Wes McKinney
2017, O’Reilly
—–
The pandas library is what makes Python a data analysis powerhouse and this title is written by the creator of pandas himself.
I used the second edition of this book. However, in my opinion, it’s already outdated: that’s how fast this library has evolved since the book’s publication. In any case, I still have to find a better resource to learn pandas in detail. A third edition is on the way, and I would wait for that if I had to purchase a copy now.

Learning SQL: Generate, Manipulate, and Retrieve Data
by Alan Beaulieu
2020, O’Reilly
—–
Data scientist are often required to use SQL to handle large datasets.
This text is not for absolute beginners, but it’s still one of the best resources to master the power of SQL. I used the second edition of this book to to get my grips on SQL years ago, and I keep going back to it when I need to refresh my knowledge.

Python MOOCs


There are a several online courses that can help to set some foundations of coding with Python if you, like me, don’t come from a computing background. Here are some that I liked most.

The next two courses are part of the Computational Thinking using Python program by MIT on edX:

Another good one:


Statistics

Free Books

Introduction to Modern Statistics
by Mine Çetinkaya-Rundel and Johanna Hardin

OpenIntro Statistics
by David Diez, Mine Çetinkaya-Rundel and Christopher Barr
—–
Both books can be downladed for free from OpenIntro.


Mathematics

Free Books

  • Mathematics for Machine Learning
    by Marc Peter Deisenroth, A. Aldo Faisal, and Cheng Soon Ong
    2020, Cambridge University Press
    —–
    Not a text for beginners, since it requires some solid foundations of Calculus and Linear Algebra. It can be downloaded for free on mml-book.github.io/.

  • Introduction to Probability for Data Science
    by Stanley Chan
    2021, Michigan Publishing
    —–
    Read for free on probability4datascience.com/, or download the free PDF from Michigan Publishing.


Machine Learning

Free Books

An Introduction to Statistical Learning (with applications in R)
by G. James, D. Witten, T. Hastie, R. Tibshirani
Springer Verlag
—–
One of the best books around on Statistical Learning. It can be downloaded for free onfrom its web page www.statlearning.com or from trevorhastie.github.io/ISLR/.

Other Books (non-free)

Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow
by Aurélien Géron
2019, O’Reilly
—–
At the time of writing, I have just started with this title: I wish I had done this earlier! Beautifully and clearly written, it covers the most relevant Machine Learning topics from an applied point of view with just the right amount of theory.

Free Online Learning


Data Science Online Learning Platforms

  • DataCamp

  • Dataquest
    So far, my favourite learning platform. It does not use videos, rather just clearly written text and exercises, which I pefer. The link above contains a referral link, it should give you $15 off if you sign up.

  • 365DataScience