While Model Trains

Read data blog posts.
Carefully handpicked.
Presented 3 at a time.

Evolution of ML Fact Store

Vivek Kaushal

How Netflix designed Axion, their fact store utilized for computing ML features online, and the lessons learned throughout the process.

Read it!

In Praise of Small Data

Evan Miller

Unless you are training a model with thousands of parameters, big data should not be seen as a source of value, but rather as a source of cost.

Read it!

Do data scientists spend 80% of their time cleaning data? Turns out, no?

Leigh Dodds

"Data scientists do a whole range of different types of task. If you arbitrary label some of these as analysis and others not, then you can make them add up to 80%."

Read it!