While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Data Cleaning IS Analysis, Not Grunt Work

Randy Au

"The act of cleaning data is the act of preferentially transforming data so that your chosen analysis algorithm produces interpretable results. That is also the act of data analysis."

Read it!

Unit Testing in Data Science

Jason Ash

A practical example of unit testing a function used to clean data before analysis.

Read it!

Do data scientists spend 80% of their time cleaning data? Turns out, no?

Leigh Dodds

"Data scientists do a whole range of different types of task. If you arbitrary label some of these as analysis and others not, then you can make them add up to 80%."

Read it!