While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Prediction intervals for Random Forests

Ando Saabas

Prediction intervals are commonly used for linear models but are often underused for random forests. Leveraging the fact that a random forest can provide a conditional distribution instead of just the conditional mean makes prediction intervals relatively straightforward to use in this context.

Read it!

The overengineered Solution to my Pigeon Problem

Max Nagy

Engaging in a battle against pigeons using OpenCV and a water gun. Spoiler: No pigeons were harmed.

Read it!

20 ideas for better data visualization

Taras Bakusevych

A list of 20 tips for great data visualization, including "Always start a bar chart at a 0 baseline" and ""Pick a color palette that matches the nature of your data".

Read it!