The Amazon Sales Rapport Data Warehouse (DW) project aims to create a comprehensive data infrastructure for analyzing and reporting on sales data from Amazon. It involves building a robust data warehouse architecture, implementing Extract, Transform, Load (ETL) processes, and enabling visualization and forecasting capabilities.
-
ETL Process
- Extraction: Data is extracted from various sources, including CSV and Excel files, using Python scripts.
- Transformation: Extracted data undergoes transformation steps to ensure consistency, handle missing values, and prepare it for loading into the data warehouse.
- Loading: Transformed data is loaded into the data warehouse tables, including the sales fact table and dimension tables for customers, products, dates, and geography.
-
Data Warehouse Architecture
-
Update Process
-
Visualization and Reporting
-
Sales Forecasting
- Clone the Repository: Clone this GitHub repository to your local machine.
- Install Dependencies: Ensure you have Python, MySQL, and necessary Python packages installed. Additionally, Power BI Desktop is required.
- Install Required Packages: Use
pip install -r requirements.txtto install the necessary Python packages listed in the requirements.txt file. - Run ETL Process: Execute the Python scripts to perform the ETL process and update the data warehouse.
- Explore Data: Utilize Power BI to explore and visualize the sales data stored in the data warehouse.
- Forecast Sales: Utilize R for sales forecasting based on historical data.
This project is made by Farouk Daboussi .
Contributions to the project are welcome! Feel free to submit bug reports, feature requests, or pull requests.
This project is licensed under the MIT License.








