Exploratory data analysis of Instagram post performance to understand the relationship between reach, engagement quality, and follower growth.
On social media platforms, reach is often treated as a primary success metric. However, high reach does not always translate into meaningful audience growth.
This project performs an end-to-end exploratory data analysis (EDA) on an Instagram post performance dataset to examine how reach, engagement quality, content strategy, and timing influence follower growth and post performance.
The analysis is focused on business decision-making, not prediction.
To answer a key growth question:
Does higher reach automatically lead to higher follower growth, or does engagement quality play a bigger role?
- Platform: Instagram (anonymised/simulated performance data)
- Granularity: Post-level data
- Each row represents a single Instagram post
- Python
- Pandas — data cleaning, aggregation, and analysis
- Matplotlib & Seaborn — data visualisation
🔹 Reach vs Follower Growth 🔹 Engagement Quality 🔹 Performance Bucket Analysis 🔹 Content & Format Strategy 🔹 Timing & Discovery
- 🚫 High reach does not guarantee follower growth
- ⭐ Engagement quality is a stronger growth driver than reach
- 📉 Some high-reach posts generate shallow engagement
- 📈 Moderately reached posts with strong engagement outperform
- 🎯 Engagement rate is a more actionable optimisation metric
- 🔁 Virality is influenced more by content and format than timing
- Optimise for engagement quality, not just exposure
- Use reach as a discovery metric, not a success metric
- Content strategy should prioritise relevance and interaction
- Growth decisions should be data-driven, not assumption-based
Day-2. Instagram_analytics/ │ ├── data/ │ └── instagram.csv │ ├── notebooks/ │ └── IG.ipynb │ ├── visuals/ │ └── IG_Visuals.ipynb │ ├── README.md