Following data sets and ideas are only there to give you a starting point. You are free to propose a data set or project idea not listed here.
Google’s Dataset Search, Kaggle and the “Awesome Public Datsets” github repository are good places to look for data sets
Data & Statistics resources from James B. Duke Library
Example Data sets
Movies: i) Scripts data ii) Subtitles data iii) IMDB Dataset
Music: i) Million Song Dataset ii) Last.fm Dataset iii) Spotify Dataset iv) Lyrics data
TV series: i) TV Series Dataset ii) Subtitles data iii) IMDB Dataset
Books: i) Goodreads Dataset ii) Book Reviews Dataset iii) Book Summaries Dataset
Socio-Economic: i) S&P 500 ii) World Development Indicators
Environment: i) Earth Surface Temperature ii) US Pollution Data
Sports: i) College Basketball ii) FIFA Soccer Rankings iii) Cricket