Metabolic BioTwin

Personal Metabolic Decision Engine — glucose & prediabetes focus

Transform your health data into actionable, personalized insights for metabolic and glucose decisions. This is decision support only, not medical diagnosis or treatment.

Quick Start

Prerequisites

Python 3.8 or higher
pip package manager

Installation & Setup

Clone the repository

git clone https://github.com/pkumar1025/Metabolic-BioTwin.git
cd Metabolic-BioTwin

Install dependencies
```
pip install -r requirements.txt
```
Run the application
```
python -m uvicorn app.main:app --reload
```
Or make run. Then open http://localhost:8000 (redirects to the dashboard at /app/).
Open your browser Navigate to http://localhost:8000

Optional: LLM-generated insight text (OpenAI)

Insight cards can show personalized intervention and success text.

Get an API key from OpenAI.
Set the environment variable before running the app:
```
export OPENAI_API_KEY=your_key_here
```
Or add it to a .env file if you use one.
Optional: override the model with OPENAI_MODEL (default: gpt-4o-mini; e.g. gpt-4o for higher quality).

If OPENAI_API_KEY is not set, the app uses fallback text and works as before.

Environment variables (optional):

Variable	Default	Description
`OPENAI_API_KEY`	—	When set, insight cards use OpenAI for intervention/success text.
`OPENAI_MODEL`	`gpt-4o-mini`	Model for insight text (e.g. `gpt-4o` for higher quality).

Session data is in-memory; no database or other env vars are required.

How to Use

Option 1: Demo Data (Recommended for first-time users)

Click "Get Started with Demo Data" on the homepage
Explore the 4 dashboard tabs with pre-loaded sample data
Navigate between Health Trends, Meals, AI Insights, and Predictions tabs

Option 2: Upload Your Own Data

Prepare CSV files with your health data (see supported formats below)
Click "Upload Your Data" and select your CSV files
The system will automatically process and analyze your data
Access the same 4 dashboard tabs with your personal data

Additional Features

Health Score API: Available via /api/health-score endpoint
Correlations API: Available via /api/correlations endpoint
Note: These features have complete backend implementations but are not yet integrated into the dashboard UI

Supported Data Formats

Required CSV Files

Upload any combination of these data types:

Meals Data (meals.csv)

date,time,carbs_g,protein_g,fat_g,fiber_g,calories
2024-01-01,08:00,45,20,15,8,350
2024-01-01,13:00,60,25,20,12,450

Sleep Data (sleep.csv)

date,sleep_hours,hrv,rhr
2024-01-01,7.5,45,65
2024-01-02,8.0,50,60

Activity Data (activity.csv)

date,steps,workout_min,hydration_l
2024-01-01,8500,30,2.5
2024-01-02,9200,45,3.0

Vitals Data (vitals.csv)

date,fg_fast_mgdl,weight,bp_systolic,bp_diastolic
2024-01-01,95,70.5,120,80
2024-01-02,92,70.2,118,78

Flexible Column Names

The system automatically recognizes various column naming conventions:

date, Date, DATE, meal_date, timestamp
carbs_g, carbs, carbohydrates, carbs_grams
sleep_hours, sleep_duration, total_sleep, hours_slept
And many more...

Dashboard Features

1. Health Trends

Interactive time-series visualizations
Correlated view of sleep, activity, nutrition, and vitals over time
Zoom/pan functionality for detailed analysis

2. Meals

Detailed nutrition analysis with streamlined data table
Meal-by-meal breakdown with glucose response predictions
Color-coded status indicators and AI-driven insights
Export functionality for meal data

3. AI Insights

Data-driven actionable recommendations (decision support only, not diagnosis)
Causal relationships and correlation discoveries
Statistical analysis with confidence intervals
Personalized insights for glucose and metabolic decisions

4. Predictions

ML-powered forecasting for glucose response
Scenario modeling for different health choices
Risk assessment and recommendations
Interactive prediction charts

Additional API Features

Note: The following features have backend APIs available but are not yet integrated into the dashboard UI:

Health Score API

Multi-dimensional health assessment across glucose, sleep, recovery, nutrition, and activity
Trend analysis and personalized recommendations
Component breakdown and scoring

Correlations API

Hidden relationship discovery between health metrics
Time-lagged correlation analysis
Statistical significance testing

Technical Details

Architecture

Backend: FastAPI with Python 3.8+
Frontend: Plotly Dash for interactive visualizations
ML: Scikit-learn for machine learning models
Data Processing: Pandas for data manipulation

AI/ML Implementation

Machine Learning: Scikit-learn for predictive modeling (Random Forest, Linear Regression)
Statistical Analysis: SciPy for correlation analysis and statistical significance testing
Data Processing: Pandas and NumPy for data manipulation and feature engineering
Causal Inference: Custom implementation of doubly robust estimation for treatment effects (runs locally)
Insight card text (optional): With OPENAI_API_KEY set, intervention and success copy for the four AI Insight cards is generated by OpenAI from your analysis results; otherwise fixed fallback text is used. All numbers and card logic are computed locally; only the short narrative text is optionally sent to OpenAI.

Key AI Features

Causal Inference: Doubly robust estimation for treatment effects
Correlation Discovery: Statistical analysis of health metric relationships
Predictive Modeling: Random Forest for glucose response prediction
Anomaly Detection: Rolling median + MAD for outlier identification
Health Scoring: Multi-dimensional health assessment

Roadmap

Phase 1 – Honest prototype ✓ Predictions use real API data; upload path handles missing meal metrics (warnings, no placeholders); README + UI position as "Personal Metabolic Decision Engine," glucose/prediabetes, decision support not diagnosis.
Phase 2 – Advocate + compare Backend: rank top levers, compare-scenarios endpoint. Dashboard: "Your best next step" and "Compare two options."
Phase 3 – Outcomes A1c proxy and trajectory from fasting glucose (and later CGM); disclaimers.
Phase 4 – CGM Ingest CGM CSV; derive meal AUC/peak from CGM + meal times; optional time-in-range.
Phase 5 – Forecasting Baseline model, counterfactuals, optional simulated future curve.
Phase 6 – Polish Disclaimers, "we don't know yet" when data is low, validation, tests/runbook.

Troubleshooting

Port already in use

python -m uvicorn app.main:app --reload --port 8001

Missing dependencies

pip install -r requirements.txt --force-reinstall

Data not loading

Ensure CSV files have proper headers
Check that date columns are in a recognized format
Verify file size is under 10MB per file

"No session data" on tabs

Load demo data or upload CSVs and click PROCESS UPLOADED DATA.

Predictions show "Not enough data"

Need 14+ days of daily data (and meals with glucose metrics for meal-based predictions). Use demo data to see full predictions.

Health score missing nutrition

Add meal CSVs and re-process; nutrition score appears when meal data is present.

Import errors

Run pip install -r requirements.txt (e.g. openai, dash, fastapi).

Other commands

# Generate synthetic demo data
python scripts/generate_synthetic.py

Getting help: Check the console for errors; try demo data first; ensure required CSV columns are present. The repo does not yet include automated tests; a quick smoke check is: run the app, load demo data, and open each tab to confirm no 500s.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.venv/lib/python3.12/site-packages/requests-2.31.0.dist-info		.venv/lib/python3.12/site-packages/requests-2.31.0.dist-info
app		app
scripts		scripts
.gitignore		.gitignore
DESIGN.md		DESIGN.md
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Metabolic BioTwin

Quick Start

Prerequisites

Installation & Setup

Optional: LLM-generated insight text (OpenAI)

How to Use

Option 1: Demo Data (Recommended for first-time users)

Option 2: Upload Your Own Data

Additional Features

Supported Data Formats

Required CSV Files

Flexible Column Names

Dashboard Features

1. Health Trends

2. Meals

3. AI Insights

4. Predictions

Additional API Features

Health Score API

Correlations API

Technical Details

Architecture

AI/ML Implementation

Key AI Features

Roadmap

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Metabolic BioTwin

Quick Start

Prerequisites

Installation & Setup

Optional: LLM-generated insight text (OpenAI)

How to Use

Option 1: Demo Data (Recommended for first-time users)

Option 2: Upload Your Own Data

Additional Features

Supported Data Formats

Required CSV Files

Flexible Column Names

Dashboard Features

1. Health Trends

2. Meals

3. AI Insights

4. Predictions

Additional API Features

Health Score API

Correlations API

Technical Details

Architecture

AI/ML Implementation

Key AI Features

Roadmap

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages