A web-based leukemia subtype prediction tool using ML and DAA. Features include gene selection using mRMR and classification via KNN, SVM, and Random Forest. Built with Vite, React, TypeScript, Tailwind CSS (UI) and Flask (backend). Includes Knapsack for optimal feature or model selection
bio-daa.1.mp4
LeucoPredic is an AI-powered web platform that predicts subtypes of Acute Lymphoblastic Leukemia (ALL) based on gene expression input. Built with a modern tech stack, it integrates machine learning algorithms and discrete algorithmic approaches (DAA) like Knapsack and mRMR for optimal feature selection and classification accuracy.
- 🧪 Predict ALL subtypes using gene expression data
- 🧠 Machine Learning models: KNN, SVM, Random Forest, Logistic Regression
- 🎯 Feature selection using mRMR (Minimum Redundancy Maximum Relevance)
- 📦 Discrete Algorithmic Approaches: Knapsack for optimal model/feature optimization
- 🌐 Modern, responsive UI built with Vite + React + TypeScript + Tailwind CSS + shadcn/ui
- 🔗 Flask backend for model integration and inference
- 📊 Displays prediction results with confidence scores and treatment guidance
Accurate subtyping of leukemia is critical for determining the right treatment plan. This tool aids medical researchers and physicians by automating subtype prediction and reducing diagnostic errors through high-accuracy machine learning.
- Vite
- React
- TypeScript
- Tailwind CSS
- shadcn/ui
- Python (Flask)
- Scikit-learn (ML models)
- Pandas, NumPy
- Feature selection using
skfeatureor custom mRMR
- Clone the repository
git clone https://github.com/your-username/leucopredic.git
cd leucopredic- Start the frontend
cd frontend
npm install
npm run dev- Run the backend
cd backend
pip install -r requirements.txt
python app.py- Navigate to
localhost:5173to use the app.
| Model | Purpose |
|---|---|
| K-Nearest Neighbors (KNN) | Fast, intuitive classification |
| Support Vector Machine (SVM) | Robust decision boundary |
| Random Forest | High-accuracy ensemble learning |
| Logistic Regression | Baseline interpretability model |
- User uploads gene expression data
- Feature selection via mRMR or Knapsack algorithm
- Prediction using trained ML models
- Result visualization with subtype classification & confidence
- Public leukemia datasets from NCBI GEO
- Gene expression profiles from microarray/RNA-seq studies
- Subtype: B-ALL with ETV6-RUNX1
- Confidence: 92.7%
- Suggested Therapy: Standard chemo with excellent prognosis
Harish S.S. Machine Learning & Bioinformatics Enthusiast Email: harishdeepikassdeepikass@gmail.com
This project is licensed under the MIT License. See LICENSE file for details.