Data 301 Github Analysis
by: John Bradbury and Anirudh Venkatesh
Visualization and Analysis of Pull request + Pull review comment request event data of Github Archive from December 2016. Project was done with the goal of understanding Github user activity across holidays - a way of representing the population that codes outside of work and in the holiday season (for fun or otherwise). ML models used for attribute analysis:
- KNN
- Decision Trees
- Lasso
- Linear Regression