Goal: To perform an exploratory data analysis (EDA) on the Netflix Movies and TV Shows dataset to extract meaningful patterns and insights about its content. Stack: Python • SQL • JupyterLab
This analysis involves a complete data wrangling and exploration workflow on the specified dataset. My process includes the following steps:
- Ingest: Load the raw data from the provided CSV file.
- Clean: Handle data types, missing values, duplicates, and parse text and date fields to ensure data quality.
- Transform: Reshape, join, and encode data as needed to prepare it for analysis.
- Export: Create a tidy, documented version of the dataset for reproducible analysis.
- Analyze & Visualize (EDA): Generate visual and statistical summaries to identify trends, relationships, and answer questions about the content.
- Source: Kaggle — “Netflix Movies and TV Shows”
- Link: https://www.kaggle.com/datasets/shivamb/netflix-shows