Netflix Content: An Exploratory Data Analysis

Goal: To perform an exploratory data analysis (EDA) on the Netflix Movies and TV Shows dataset to extract meaningful patterns and insights about its content. Stack: Python • SQL • JupyterLab

Process

This analysis involves a complete data wrangling and exploration workflow on the specified dataset. My process includes the following steps:

Ingest: Load the raw data from the provided CSV file.
Clean: Handle data types, missing values, duplicates, and parse text and date fields to ensure data quality.
Transform: Reshape, join, and encode data as needed to prepare it for analysis.
Export: Create a tidy, documented version of the dataset for reproducible analysis.
Analyze & Visualize (EDA): Generate visual and statistical summaries to identify trends, relationships, and answer questions about the content.

Dataset

Source: Kaggle — “Netflix Movies and TV Shows”
Link: https://www.kaggle.com/datasets/shivamb/netflix-shows

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.ipynb_checkpoints		.ipynb_checkpoints
images		images
README.md		README.md
for_flashcards_learn.txt		for_flashcards_learn.txt
netflix.db		netflix.db
netflix_explore.ipynb		netflix_explore.ipynb
netflix_titles.csv		netflix_titles.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Netflix Content: An Exploratory Data Analysis

Process

Dataset

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Netflix Content: An Exploratory Data Analysis

Process

Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages