DocAI: Agentic Medical Consultation Assistant

DocAI is a sophisticated, privacy-focused medical consultation system. It replaces naive static RAG (Retrieval-Augmented Generation) with a dynamic ReAct Agent architecture, utilizing Google's medically-tuned MedGemma-4b model to interact with patients and autonomously query their historical health records.

🚀 Key Features

Autonomous ReAct LLM Agent: The core consultation loop is agentic. The AI evaluates the conversation natively and decides whether to [SEARCH] the patient's medical history for context, or [ANSWER] the user directly, minimizing hallucinations.
Semantic Health Record Search: Integrates pgvector and a locally hosted BioBERT embedding model to perform high-speed, semantic similarity searches across a user's consultation summaries and clinical notes.
Event-Driven Memory Pipeline: Automates background insight extraction, summarization, and active condition detection asynchronously when a consultation ends, keeping the UI blazing fast.
Decoupled Backend Architecture: Runs the heavy 4-Billion parameter MedGemma model on Kaggle's free GPU tier via an automated Gradio tunnel, keeping the local Flask server lightweight.
Modern React Frontend: Built with Vite and TailwindCSS for a responsive, dynamic user experience.
One-Click Startup Automation: Features a robust Powershell script (start.ps1) to spin up PostgreSQL, apply database schemas, launch the Flask backend, and the Vite frontend simultaneously.

🏗️ Architecture Stack

Frontend Framework: React (Vite) + TailwindCSS
Backend Framework: Python / Flask
Database: PostgreSQL with pgvector extension
Embedding Model: pritamdeka/BioBERT-mnli-snli-scinli-scitail-mednli-stsb (768D)
Agentic Reasoning LLM: google/medgemma-4b-it (Hosted off-device via Kaggle+Gradio)
Background Processing LLM: Qwen/Qwen2.5-7B-Instruct (via HuggingFace Inference API)

📁 Project Structure

DocAI/
├── frontend/               # React (Vite) User Interface
├── ai/                     # Core Agentic Intelligence Logic
│   ├── ai.py               # Orchestrates the ReAct [SEARCH]/[ANSWER] loops
│   ├── embedding.py        # Generates BioBERT vector embeddings
│   ├── LLM_module.py       # Interfaces with HuggingFace/Gradio & houses strict Prompts
│   ├── post_processing.py  # Background memory & summarization pipeline
│   ├── UserConditionManager.py # Autonomous diagnosis state machine
│   └── MemoryManager.py    # Standardizes Message arrays for history tracking
├── app/                    # Flask Web Application
│   ├── routes.py           # API Endpoints (/consult, /end_consultation)
│   └── __init__.py         # App factory & config
├── db/                     # Database schemas and CRUD operations
├── .env                    # Secrets (HF Token, Ngrok Token, etc)
├── start.ps1               # Automated Docker, Flask & React bootstrapper
└── run.py                  # Backend Entry point

⚙️ Setup and Installation

1. Requirements

Node.js and npm (for the React frontend)
Docker Desktop (for Postgres/pgvector)
Python 3.10+
A free Ngrok Account
A free HuggingFace Account (with user access token)
A Kaggle Account (for free GPU hosting)

2. Environment Configuration

Create a .env file in the root directory:

# Database Configuration
DB_HOST=127.0.0.1
DB_PORT=5432
DB_USER=root
DB_PASSWORD=root
DB_NAME=docbase

# HuggingFace & Kaggle Configuration
HF_API_TOKEN=your_hf_access_token
URL_UPDATE_SECRET=docai-url-push-secret

3. Launch the Application

Simply run the robust startup script from PowerShell. It will boot Docker, initialize pgvector, install python/npm dependencies, and launch both Flask and React servers.

.\start.ps1

4. Boot the AI Brain (Kaggle)

Upload a hosted Kaggle Notebook running the DocAI Inference server.
Ensure the "T4 x2" (or better) GPU accelerator is active.
Add your HF_TOKEN, DOCAI_SERVER_URL (the ngrok url printed out by start.ps1), and DOCAI_SECRET (matching your .env) to the Kaggle Secrets tab.
Run all cells. The notebook will host MedGemma-4b on Kaggle's GPU and automatically inform your local Flask server of the secure tunnel address.

5. Chat!

Your DocAI system is now fully synced, agentic, and ready to respond to complex medical queries while searching semantic history.

Disclaimer: DocAI is a theoretical software project. It is not a certified medical device and should not be used to replace professional clinical judgment or offer definitive medical diagnoses.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
ai		ai
app		app
db		db
frontend		frontend
.gitignore		.gitignore
README.md		README.md
render.yaml		render.yaml
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocAI: Agentic Medical Consultation Assistant

🚀 Key Features

🏗️ Architecture Stack

📁 Project Structure

⚙️ Setup and Installation

1. Requirements

2. Environment Configuration

3. Launch the Application

4. Boot the AI Brain (Kaggle)

5. Chat!

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DocAI: Agentic Medical Consultation Assistant

🚀 Key Features

🏗️ Architecture Stack

📁 Project Structure

⚙️ Setup and Installation

1. Requirements

2. Environment Configuration

3. Launch the Application

4. Boot the AI Brain (Kaggle)

5. Chat!

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages