An analysis of the frequency of US state names in song lyrics of Billboard's Year-End Hot 100 from 1958 to 2015.
Read it!Before applying machine learning, you should get to know the data and build heuristics.
Read it!Prediction intervals are commonly used for linear models but are often underused for random forests. Leveraging the fact that a random forest can provide a conditional distribution instead of just the conditional mean makes prediction intervals relatively straightforward to use in this context.
Read it!