While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Why is machine learning 'hard'?

S. Zayd Enam

"The difficulty is that machine learning is a fundamentally hard debugging problem. Debugging for machine learning happens in two cases: 1) your algorithm doesn't work or 2) your algorithm doesn't work well enough."

Read it!

Optimizing Slow SQL Queries

Miguel Grinberg

How to fix the most common database performance problems, which are caused by improper indexing, along with examples for Postgres, MySQL, and SQLite.

Read it!

Data Cleaning IS Analysis, Not Grunt Work

Randy Au

"The act of cleaning data is the act of preferentially transforming data so that your chosen analysis algorithm produces interpretable results. That is also the act of data analysis."

Read it!