Skip to content
View Theepankumargandhi's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Theepankumargandhi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Theepankumargandhi/README.md

Hi, I’m Theepan

I build machine learning systems that actually run in production not just models in notebooks.

Right now I’m finishing my Master’s in Data Science at Illinois Institute of Technology, and most of my work sits at the intersection of:

  • Retrieval-augmented generation (RAG)
  • Multi-agent LLM systems
  • Recommender systems and ranking
  • MLOps and deployment pipelines
  • Real-time inference and monitoring

I enjoy building full systems end-to-end: training, evaluation, deployment, and observability.


What I’ve Been Building Recently

Here are a few projects that represent the kind of work I like doing.

AutoML Agents with LangGraph

I built a multi-agent AutoML system that can generate pipelines, track experiments, and deploy models automatically.
It uses LangGraph for orchestration, MLflow for tracking, and DVC for versioning, with everything containerized and deployed on AWS.

What I cared about here wasn’t just accuracy — it was reproducibility and automation.


Multimodal RAG Assistant

This project handles text, images, audio, and video in one retrieval pipeline.
It combines CLIP embeddings, Whisper transcription, vector search, and graph memory with Neo4j.

The interesting part was improving retrieval quality using hybrid ranking and reranking instead of relying on embeddings alone.


Finance Document Assistant (Deployed on Kubernetes)

I built a hybrid RAG pipeline for financial documents and deployed it to AWS EKS.
The system uses BM25 + dense retrieval, agent workflows, and evaluation dashboards to track retrieval quality and latency.

This project was mostly about infrastructure and evaluation, not just building the model.


Two-Tower Recommender System

A full recommendation pipeline with TensorFlow Recommenders, FAISS retrieval, ranking models, and monitoring.

I focused on:

  • Candidate generation vs ranking separation
  • Online inference APIs
  • Metrics like NDCG and Recall@K

Multi-Agent Orchestration Platform

A LangGraph-based system coordinating multiple agents through a FastAPI backend with PostgreSQL persistence.

This was where I spent time designing routing logic, memory, and observability rather than just chaining prompts.


QLoRA Notebook Assistant

Fine-tuned a Mistral-7B model with dual adapters and routing logic to switch between explanation and code generation modes.

The most interesting challenge here was making a 7B model run reliably on limited GPU memory.


Other Things I’ve Worked On

Some additional areas I’ve explored:

  • Graph neural networks for fraud detection
  • OCR and document AI pipelines
  • YOLOv8 optimization with TensorRT
  • Healthcare ML (ECG anomaly detection, imaging models)
  • Causal inference and experimentation
  • ETL and data pipelines

How I Like to Work

I’m interested in problems where:

  • Models are part of a larger system
  • Deployment and monitoring matter
  • Performance and latency are real constraints
  • Engineering decisions matter as much as model choice

Currently Looking For

Entry-level roles in:

  • Machine Learning Engineering
  • Applied AI / GenAI
  • Data Science (applied / product-focused)

I’m authorized to work in the U.S. on F-1 OPT.


Contact

Email: tgandhi1107@gmail.com
LinkedIn: https://www.linkedin.com/in/theepankumar
Portfolio: https://theepan-portfolio.netlify.app


Outside of work, I like trekking and baking. One clears the head, the other feeds it.

💻 Tech Stack:

R Python Windows Terminal LaTeX AWS Azure Netlify Anaconda Elasticsearch nVIDIA Django FastAPI Jinja OpenCV Apache Airflow MongoDB MySQL Neo4J Redis SQLite Postgres Keras Matplotlib mlflow NumPy Pandas Plotly PyTorch scikit-learn Scipy TensorFlow GitHub Actions GitLab CI GitHub Grafana ElasticSearch Docker Kubernetes Power Bi Postman Prometheus Riot Games nVIDIA

Popular repositories Loading

  1. ECG-Anomaly-Detection-using-LSTM-AutoEncoder ECG-Anomaly-Detection-using-LSTM-AutoEncoder Public

    Built an LSTM AutoEncoder to detect anomalies in ECG time series data with 97.93% accuracy. Trained on normal signals, the model uses reconstruction error to identify anomalies. Implemented with Te…

    Jupyter Notebook 2 1

  2. Multi-Agent-Orchestration Multi-Agent-Orchestration Public

    Production ready LangGraph multi-agent orchestration template with supervisor routing, FastAPI + Streamlit chat UI, MCP tool calling, local RAG + knowledge graph retrieval, PostgreSQL memory, per-u…

    Python 1

  3. Theepankumargandhi Theepankumargandhi Public

    Config files for my GitHub profile.

  4. brain-tumor-segmentation brain-tumor-segmentation Public

    Brain tumor segmentation app using a ResNeXt50-UNet model trained on LGG MRI data. Built with PyTorch and deployed via Streamlit, allowing users to upload MRI images and view tumor masks in real-ti…

    Jupyter Notebook

  5. creditcard_risk_prediction creditcard_risk_prediction Public

    Jupyter Notebook

  6. Diabetes-Disease-Progression-Prediction Diabetes-Disease-Progression-Prediction Public

    Built regression models to predict diabetes progression using clinical features. Compared Linear, Ridge, and XGBoost regressors. Applied SHAP and permutation importance to interpret feature impact.…

    Jupyter Notebook