ML Engineer · LLM Inference · NLP · Multi-Agent Systems · Data Engineering
MS Data Science | Building production AI systems across LLM serving, NLP pipelines, and agentic frameworks.
| Area | Stack |
|---|---|
| LLM Inference & MLOps | vLLM · Prometheus · KV cache profiling · GPU memory optimisation |
| NLP & Language Models | HuggingFace Transformers · BERT · LSTM · LangChain · LangServe |
| Agentic Systems | Multi-agent orchestration · Google ADK · MCP · Autonomous agents |
| Data Engineering | Apache Airflow · PostgreSQL · ETL pipelines · Real-time dashboards |
| ML & Deep Learning | PyTorch · XGBoost · Scikit-learn · Computer vision · Anomaly detection |
- LLM Inference Benchmarking Dashboard — Real-time vLLM benchmarking with TTFT, TPOT, ITL, E2EL metrics and Prometheus integration
- KV Cache Profiler — Live KV cache hit rate, eviction, and GPU memory pressure profiler for LLM inference
- Multi-Agent AI System — Autonomous multi-agent orchestration with task decomposition and LangChain tool use
- EmotiCare — Crisis Detection — BERT-based emotional crisis detection and contextual empathy engine
- NASA APOD ETL Pipeline — Production Airflow + PostgreSQL data pipeline for NASA astronomy data
- CyberSentinel — ML anomaly detection for network intrusion, DDoS, and threat identification
Python PyTorch vLLM HuggingFace LangChain LangGraph Apache Airflow PostgreSQL
XGBoost Scikit-learn BERT Transformers Prometheus FastAPI Streamlit Docker
Multi-agent systems RAG NLP Computer Vision Time-series Anomaly detection ETL
📍 San Francisco Bay Area | 🎓 MS Data Science | 🤖 Open to ML Engineer / AI Engineer roles
