While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

How Dangerous Is Biking in New York?

Gregory Gundersen

Estimating the probability of a fatal injury occurring during bike commuting to work in New York.

Read it!

Variance after scaling and summing: One of the most useful facts from statistics

Chris Said

"What do R2, laboratory error analysis, ensemble learning, meta-analysis, and financial portfolio risk all have in common? The answer is that they all depend on a fundamental principle of statistics that is not as widely known as it should be. Once this principle is understood, a lot of stuff starts to make more sense."

Read it!

In Praise of Small Data

Evan Miller

Unless you are training a model with thousands of parameters, big data should not be seen as a source of value, but rather as a source of cost.

Read it!