Dataset would be created by web-scrpaing imdb for all episodes to see their rankings
Note: Also get information from the movie
Columns from web-scraping:
- Episode Number
- Season Number
- Rating
Columns to add:
- Holiday Episode (column would be binary, yes/no)
- Holiday (Halloween, Thanksgiving, Christmas, Valentines Day)
- Musical_Number (column would be binary, yes/no)
- Song_Name (name of song in episode; multiple rows of same episode would be required for episodes with more than one song in them)
- Song_Popularity (pull number of plays from Spotify, and
- Main_Characters (which character the episode is mainly focused or centered on. When it's all three children, == "Belcher Kids"; when it's truly centered around the whole family == "Belcher Family"); (allow for multiple MCs. Useful for when there's conflict between two characters, such as Bob vs Hugo, Bob & Teddy)
- Episode_Characters (Number of named characters per episode)
- Characters_in_Episode ?
Questions to ask/visualize:
- Ratings over time
- Average rating per season
- Favorite musical numbers (gathered from spotify; number of plays), facet-wrapped by seasons
- Best musical number from each season
- Does the number of characters per episode predict the rankings?
- Does the number of musical numbers predict the show ranking, outside of the finales? Including the finales?
Dataset would be created by web-scrpaing imdb for all episodes to see their rankings
Note: Also get information from the movie
Columns from web-scraping:
Columns to add:
Questions to ask/visualize: