#66DaysOfData Resources Datasets

The datasets for the #66daysofdata challenge

The core of the #66daysofdata with KNIME project draws on three Spotify datasets freely available on Kaggle (sign in to download them). As the Kaggle descriptions don't provide too much information about the different columns - check out this brief overview.

The tracks.csv dataset contains about 600k tracks from the period 1900-2021 and is described by 20 columns

The artist-uris.csv dataset contains data on roughly 81k artists and is described by 2 columns (header names are not provided)

The artist.csv dataset is very similar to the tracks.csv dataset but also includes a popularity metric for the artists.

P.S. What is the #66DaysOfData Challenge?

The idea is to spend around 5-10 minutes on a specific data science project each day for 66 days and share your progress on your favorite social media platform with #66daysofdata. Ken Jee is the original instigator of #66daysofdata. Why 66 days? Because that's the average time it takes us to get practiced at doing something. In this case, data science with KNIME. Find the full roadmap here.

#66DaysOfData Resources Datasets

P.S. What is the #66DaysOfData Challenge?

You might also like