While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

How Dangerous Is Biking in New York?

Gregory Gundersen

Estimating the probability of a fatal injury occurring during bike commuting to work in New York.

Read it!

Prediction intervals for Random Forests

Ando Saabas

Prediction intervals are commonly used for linear models but are often underused for random forests. Leveraging the fact that a random forest can provide a conditional distribution instead of just the conditional mean makes prediction intervals relatively straightforward to use in this context.

Read it!

Why is machine learning 'hard'?

S. Zayd Enam

"The difficulty is that machine learning is a fundamentally hard debugging problem. Debugging for machine learning happens in two cases: 1) your algorithm doesn't work or 2) your algorithm doesn't work well enough."

Read it!