This is a project done as a part of the course MTH416A (Regression Analysis) at IITK.
Link to dataset used : https://www.kaggle.com/c/house-prices-advanced-regression-techniques/data
We use a subset (in terms of number of columns) of this dataset to do our analysis -> house_price.csv
The following Regression Analysis Techniques have been implemented:
- Exploratory Data Analysis
- Calculation of various parameters of the OLS fit
- Test statistics and Hypothesis Testing
- Model Adequacy Checks (Normality Assumption check and Residual Analysis)
- Transformation of variables
- Outlier Detection and Treatment
- Handling Categorical variables
- Multi-collinearity check
- Variable Selection using Backward Elimination
Python Notebook -> MTH416.ipynb
Project Report -> project report.pdf
Plots and Tables present in respective folders.