I'm an 18-year-old data scientist studying at the University of Hertfordshire, graduating in November 2026 with a B.Sc. in Data Science. I build end-to-end ML systems that are explainable, deployable, and business-focused β from feature engineering to SHAP insights to production-ready APIs.
What I do:
- π§ͺ Feature Engineering β Turn raw, messy data into predictive signals
- π Data Viz & Explainability β Uncover hidden patterns & demystify models with SHAP
- β‘ Scalable ML β XGBoost & LightGBM for high-performance tabular data
- π MLOps-lite β FastAPI for real-time predictions + Streamlit for interactive dashboards
- π Production Alerts β Discord webhooks for real-time fraud alerts
Here's what I've been building β each project is a complete story from raw data to working app.
| Project | What I Built | Key Insight | Stack |
|---|---|---|---|
| EβCommerce Churn Prediction | Predict if a customer will ever make a second purchase. Live Streamlit dashboard with SHAP explanations. | Delivery delay, not price, is the #1 driver of churn. | LightGBM, Optuna, SHAP, Streamlit |
| Fraud Detection API | Real-time credit card fraud API. Catches 72% of fraud with 50% precision. FastAPI + Discord alerts. | Imbalanced classification works better with optimized thresholds & features. | XGBoost, FastAPI, Discord Webhooks, Streamlit |
| CPU Benchmark Predictor | Predict PassMark score from CPU specs. XGBoost model with RΒ² = 0.983. | threadMark & cores matter more than clock speed or price. |
XGBoost, SHAP, Streamlit |
- Becoming a better data scientist. Learning new skills. Building things that work. Every single day. π
β From raw data to real decisions β explainable, deployable, impactful.