While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Data Cleaning IS Analysis, Not Grunt Work

Randy Au

"The act of cleaning data is the act of preferentially transforming data so that your chosen analysis algorithm produces interpretable results. That is also the act of data analysis."

Read it!

Variance after scaling and summing: One of the most useful facts from statistics

Chris Said

"What do R2, laboratory error analysis, ensemble learning, meta-analysis, and financial portfolio risk all have in common? The answer is that they all depend on a fundamental principle of statistics that is not as widely known as it should be. Once this principle is understood, a lot of stuff starts to make more sense."

Read it!

Why Open-Source a Model?

Matt Rickard

A compilation of several examples that illustrate the motivation behind open sourcing a machine learning model.

Read it!