This project provides a comprehensive analysis of the Indian Premier League (IPL) Player Auctions from 2013 to 2022.
The workflow integrates Data Engineering, Database Management, and Visual Analytics to uncover trends in franchise spending, player valuations, and auction dynamics over a decade.
| File Name | Description |
|---|---|
IPLPlayerAuctionData.csv |
Raw Dataset: Original auction records with player names, prices, and teams. |
Cleaned_IPL_Auction_Data.csv |
Processed Dataset: Cleaned data with normalized columns and new features. |
Main_SQL.ipynb |
ETL Pipeline: Python notebook that cleans raw data and loads it into a MySQL database. |
Main_NoSQL.ipynb |
Data Cleaning: Standalone notebook for cleaning data (CSV output only, no SQL). |
cleaned_ipl_import.sql |
SQL Script: Database schema definition and 10+ analytical queries. |
IPL Visualization.pbix |
Dashboard: Interactive Power BI report visualizing the insights. |
The dataset tracks the financial history of IPL auctions. Key variables include:
- Player: Name of the cricketer.
- Role: Playing category (Batsman, Bowler, All-Rounder, Wicket Keeper).
- Amount: Auction price in INR.
- Team: Purchasing franchise.
- Year: Auction year (2013-2022).
- Player_Origin: Domestic (Indian) vs. International (Overseas).
- Amount_Cr: Calculated field representing price in Crores.
- Repeated: Boolean flag indicating if a player appeared in multiple auctions.
To run this project locally, you will need:
- Python 3.x (with
pandasandmysql-connector-pythonlibraries) - MySQL Server (local instance)
- Microsoft Power BI Desktop (to view the .pbix file)
- Open your SQL client (e.g., MySQL Workbench).
- Run the
cleaned_ipl_import.sqlscript. This will:- Create the
ipl_auctiondatabase. - Create the
auction_datatable schema.
- Create the
- Open
Main_SQL.ipynbin Jupyter Notebook. - Update the database connection settings (host, user, password) in the notebook.
- Run all cells to:
- Load and clean
IPLPlayerAuctionData.csv. - Export
Cleaned_IPL_Auction_Data.csv. - Insert records directly into your MySQL database.
- Load and clean
- SQL Analysis: Use the queries provided at the bottom of
cleaned_ipl_import.sqlto generate insights like "Top 10 Most Expensive Players" or "Team-wise Spend". - Dashboard: Open
IPL Visualization.pbixin PowerBI to explore the data visually.
The analysis answers critical questions such as:
- Spending Power: Which teams spend the most on average?
- Role Valuation: Do All-Rounders command higher prices than purely Batsmen or Bowlers?
- Inflation: How has the total auction purse increased from 2013 to 2022?
- Player Demand: Who are the most frequently auctioned players?
Rishabh
