While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

3 levels of complexity: How I approach data science storytelling

Susan Shu Chang

A framework for crafting storylines in talks: help the audience understand real life impacts, bridge the context to abstract or technical, technical deep dives exist to do something.

Read it!

Optimizing Slow SQL Queries

Miguel Grinberg

How to fix the most common database performance problems, which are caused by improper indexing, along with examples for Postgres, MySQL, and SQLite.

Read it!

Prediction intervals for Random Forests

Ando Saabas

Prediction intervals are commonly used for linear models but are often underused for random forests. Leveraging the fact that a random forest can provide a conditional distribution instead of just the conditional mean makes prediction intervals relatively straightforward to use in this context.

Read it!