While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Song Lyrics Across the United States

Julia Silge

An analysis of the frequency of US state names in song lyrics of Billboard's Year-End Hot 100 from 1958 to 2015.

Read it!

The First Rule of Machine Learning: Start without Machine Learning

Eugene Yan

Before applying machine learning, you should get to know the data and build heuristics.

Read it!

Prediction intervals for Random Forests

Ando Saabas

Prediction intervals are commonly used for linear models but are often underused for random forests. Leveraging the fact that a random forest can provide a conditional distribution instead of just the conditional mean makes prediction intervals relatively straightforward to use in this context.

Read it!