DATA ANALYSIS · MACHINE LEARNING
2021 – 2025
Spotify gives you a cute annual recap. I downloaded five years of raw play-by-play data and actually looked at it: themes, energy, skip patterns, and a model that predicts whether I'll skip a song before it ends.




MODEL RESULT
Short answer: kind of. An RF + XGBoost soft-voting ensemble lands around 70–75% accuracy without data leakage. The two features that matter most are daily average energy (your mood that day) and whether the song is in English. Genre alone is basically useless. Including ms_played inflates accuracy to 92%, but that's cheating since play-time is downstream of the skip decision itself.
TRY IT ON YOUR OWN DATA