Skip to content
View shahdab089's full-sized avatar

Block or report shahdab089

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shahdab089/README.md

Hi, I'm Shadab Akhtar

Data Scientist at T-Mobile | Fraud & Credit Risk | GenAI & LLM Systems

6+ years building end-to-end ML pipelines that catch fraud, optimize credit risk, and save real money — $1M+ in monthly loss prevention from models I designed, deployed, and monitor in production. I work across the full stack: from feature engineering on multi-terabyte data (PySpark, Snowflake, BigQuery) to deploying LightGBM models on Databricks, to building LLM-powered tools with LangGraph and RAG.

Previously: Data Scientist at DEPT (media mix modeling, $30M+ ad spend reallocation), Data Analyst at Tekion (recommendation engines), Graduate Research Assistant at Georgia State University (NLP classifiers).


What I Build

  • Production ML — fraud detection models scoring 500K+ transactions/month, cascaded scoring pipelines, champion/challenger A/B testing frameworks
  • LLM Applications — enterprise RAG agents, multi-agent systems with LangGraph, hybrid LLM + deterministic pipelines with structured outputs
  • Causal Inference — GeoLift incrementality testing, treatment-assignment frameworks, causal guardrails for experimentation

Featured Projects

Project What It Does Stack
Hirelens AI resume analyzer — scores resume-to-JD fit, diagnoses rejection stage, generates fixes. Hybrid LLM + deterministic pipeline with eval harness. FastAPI, Groq (Llama 3.1), Pydantic, Chart.js, Docker
Collections Intelligence Agent Multi-agent system that translates plain-English strategy questions into SQL, evaluates performance, delivers executive reports. LangGraph, Streamlit, SQLite, Docker, GitHub Actions CI/CD

Tech Stack

Languages & Data: Python, PySpark, SQL, R, Pandas, NumPy, Scikit-learn

ML & Modeling: LightGBM, XGBoost, PyTorch, TensorFlow, SHAP, Causal Inference, A/B Testing

GenAI & LLMs: LangGraph, LangChain, RAG, Prompt Engineering, Groq API, Anthropic Claude API, Hugging Face

Cloud & MLOps: Databricks, AWS, GCP, Azure, Docker, MLflow, GitHub Actions CI/CD

Data Platforms: Snowflake, BigQuery, PostgreSQL, MongoDB, Delta Lake

Visualization: Tableau, Power BI, Plotly, Streamlit


GitHub Stats

GitHub Stats Top Languages


Connect

LinkedIn Email Hirelens

Pinned Loading

  1. hirelens hirelens Public

    Python

  2. talkdb talkdb Public

    Turn any database into a conversational AI analyst. Ask questions in plain English, get answers with charts.

    Python

  3. telecom-customer-intelligence-agent telecom-customer-intelligence-agent Public

    Multi-agent AI system for telecom customer analytics, collection strategy optimization & executive reporting

    Python

  4. Predict-future-sales Predict-future-sales Public

    Jupyter Notebook

  5. Store-Sales-Time-Series-Forecasting Store-Sales-Time-Series-Forecasting Public

    Jupyter Notebook

  6. Wamart-Forecasting-Data-Analysis Wamart-Forecasting-Data-Analysis Public

    Jupyter Notebook