While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

From both sides now: the math of linear regression

Katherine Bailey

A journey starting from the standard formulation of linear regression, moving on to the probabilistic approach, and then progressing to Bayesian linear regression.

Read it!

Do data scientists spend 80% of their time cleaning data? Turns out, no?

Leigh Dodds

"Data scientists do a whole range of different types of task. If you arbitrary label some of these as analysis and others not, then you can make them add up to 80%."

Read it!

20 ideas for better data visualization

Taras Bakusevych

A list of 20 tips for great data visualization, including "Always start a bar chart at a 0 baseline" and ""Pick a color palette that matches the nature of your data".

Read it!