While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Understanding the beta distribution (using baseball statistics)

David Robinson

"The beta distribution is best for representing a probabilistic distribution of probabilities- the case where we don’t know what a probability is in advance, but we have some reasonable guesses."

Read it!

Building AI Trading Systems

Denny Britz

"Machine Learning can give you an edge, but probably not an edge so huge that it allows you to ignore other factors. You still need to build good infrastructure, collect good data, and have low latencies."

Read it!

Data Analysis at the Command Line

@lucytalksdata

Using csvkit, grep, gnuplot, and other command-line tools to perform data analysis directly from the command line.

Read it!