While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Song Lyrics Across the United States

Julia Silge

An analysis of the frequency of US state names in song lyrics of Billboard's Year-End Hot 100 from 1958 to 2015.

Read it!

Understanding the beta distribution (using baseball statistics)

David Robinson

"The beta distribution is best for representing a probabilistic distribution of probabilities- the case where we don’t know what a probability is in advance, but we have some reasonable guesses."

Read it!

Finding bad flamingo drawings with recurrent neural networks

Colin Morris

Using Sketch-RNN as a probability estimator to identify the worst sketches of flamingos in 'Quick, Draw!'.

Read it!