While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Open letter to journal editors: dynamite plots must die

Rafael Irizarry

A critique of dynamite plots and suggestions for better alternatives.

Read it!

Data scientists work alone and that's bad

Ethan Rosenthal

"The norm is that of a lonely life for the data scientist. Whether they lie near analytics, machine learning, or elsewhere in the large latent space that spans this ill-defined role, just like in the curse of high-dimensionality, they are likely alone."

Read it!

How much data should you allocate to training and validation?

Francesco Pochetti

To avoid responding with "that's what Andrew NG said" when asked about the reason behind choosing an 80% training and 20% validation split, consider this explanation.

Read it!