GovAI Secure Intelligence Assistant (G-SIA)

Overview

GovAI Secure Intelligence Assistant (G-SIA) is a compliance-aware AI chatbot that enables secure, policy-driven access to de-identified patient data. It uses multi-agent workflows orchestrated with LangGraph, Retrieval-Augmented Generation (RAG) for regulatory policy enforcement, and LangSmith for observability. This project demonstrates how AI can safely operate in regulated environments like healthcare and public services.

Objectives

Enforce data privacy and regulatory compliance (HIPAA, GDPR, CCPA).
Enable secure structured database access through AI agents.
Provide explainable, policy-backed responses to user queries.
Demonstrate robust agent orchestration and monitoring using LangChain ecosystem.

Key Features

Policy-Aware Reasoning: Validates every query against policies using RAG.
SQL Query Agent: Generates and executes parameterized queries.
Query Rewriting: Modifies unsafe queries to meet regulations.
Audit Logging: Tracks queries, decisions, and database interactions.
Agent Observability: LangSmith for debugging and tracing.
Scalability: Modular agents that can extend to new compliance rules.

System Architecture

+---------------------------------------------------+
|                User Interface / API               |
+---------------------------------------------------+
                       │
                       ▼
+---------------------------------------------------+
|         LangGraph Orchestration Controller        |
|  (manages workflow between agents and tools)      |
+---------------------------------------------------+
       │                       │                   │
       ▼                       ▼                   ▼
+--------------+      +-----------------+   +----------------+
| Policy Agent |----->| Query Rewriter |   | Audit Logger   |
|  (RAG over   |      | (only if needed)|   | (logs decisions|
| HIPAA/GDPR)  |      +-----------------+   | and executions)|
+--------------+              │             +----------------+
       │                     ▼
       │            +------------------+
       │            | SQL Query Agent  |
       │            | (LangChain SQL   |
       │            | toolkit)         |
       │            +------------------+
       │                     │
       ▼                     ▼
+---------------------------------------------------+
| Secure PostgreSQL (De-identified Patient Database) |
+---------------------------------------------------+
                       │
                       ▼
+---------------------------------------------------+
| Response Formatter → Sends Compliant Response     |
+---------------------------------------------------+
                       │
                       ▼
+---------------------------------------------------+
|    Monitoring & Observability (LangSmith & Azure) |
+---------------------------------------------------+

Agent Descriptions

1. Policy Agent

Purpose: Determines if a user query complies with regulations.
Functionality:
- Uses RAG to retrieve and interpret policies from HIPAA, GDPR, and CCPA.
- Classifies queries as Allowed, Partially Allowed, or Blocked.
- Provides reasons for blocking or modifying queries.

2. Query Rewriter

Purpose: Adjusts non-compliant queries to make them compliant.
Functionality:
- Removes sensitive fields or replaces them with aggregated metrics.
- Ensures the rewritten query still provides useful information without violating policies.

3. SQL Query Agent

Purpose: Executes secure data retrieval from the PostgreSQL database.
Functionality:
- Converts natural language into parameterized SQL using LangChain SQL Toolkit.
- Prevents direct access to identifiers, enforces aggregation thresholds.
- Only executes queries approved by the Policy Agent.

4. Audit Logger

Purpose: Provides complete traceability of system actions.
Functionality:
- Logs every query, decision, SQL command, and response metadata.
- Integrates with Azure Monitor & SIEM for compliance-friendly storage.

Workflow

User Query: User asks a question (e.g., patient statistics).
Policy Agent: Checks query legality using policy embeddings.
Decision:
- ✅ Allowed: Forward to SQL Agent.
- ⚠ Partial: Query Rewriter modifies it.
- ❌ Blocked: Returns explanation of violated policy.
SQL Agent: Generates secure SQL, queries PostgreSQL.
Audit Logger: Records the complete interaction.
Response Formatter: Returns answer with compliance notes.
LangSmith & Azure Monitor: Capture reasoning and security logs.

Data Layer

Dataset: Synthea Synthetic Patient Dataset (fully de-identified).

Core Tables:

patients – demographics
conditions – diagnoses
encounters – hospital visits
medications – prescriptions
organizations – hospital details
audit_logs – query history & policy decisions

Tech Stack

LLM: OpenAI models
RAG: (Any vector database) + LangChain Retriever
Agents: LangChain + LangGraph for orchestration
Observability: LangSmith
Database: PostgreSQL
Backend: FastAPI
Security: RBAC, TLS, PII masking
Logging: SIEM

Project Structure

govai-secure-assistant/
│── data/             # Synthea CSV files
│── db/               # SQL schema & migration scripts
│── agents/           # PolicyAgent, SQLAgent, QueryRewriter
│── retrievers/       # Policy document loaders
│── logs/             # Dev logs
│── app/              # FastAPI backend
│── notebooks/        # Jupyter tests
│── README.md         # Documentation
│── requirements.txt  # Dependencies

Installation

git clone https://github.com/your-username/govai-secure-assistant.git
cd govai-secure-assistant
python -m venv venv
source venv/bin/activate   # Windows: venv\Scripts\activate
pip install -r requirements.txt

Running the Project

# Load Synthea CSVs into PostgreSQL
data/load_data.py

# Start FastAPI backend
uvicorn app.main:app --reload

Monitoring & Logging

LangSmith: Traces agent reasoning for debugging.
Azure Monitor + SIEM: Stores immutable logs for compliance.

Example Query Flow

User Query: "How many patients with hypertension were treated in Denver hospitals?"

Policy Agent: ✅ Allowed (aggregated query)
SQL Agent: Generates secure SQL → Executes
Audit Logger: Logs full trace
Response: "There were 184 patients diagnosed with hypertension treated in Denver hospitals. (HIPAA compliant)."

Compliance Considerations

Uses only de-identified synthetic data.
Policy Agent enforces regulatory checks before any data access.
Audit logs provide full traceability for compliance audits.

Future Enhancements

Add differential privacy for aggregate queries.
FHIR API support.
Extend to other regulated domains.

Why This Project Stands Out

✅ Advanced multi-agent orchestration with LangGraph.
✅ Implements policy-driven AI reasoning.
✅ Provides a secure and auditable AI solution.
✅ Ideal for showcasing AI Solution Architect expertise.

Author

Sai Pratheek Kerthi Venkata AI/ML Engineer | Cloud & Data Security Enthusiast LinkedIn | GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
db		db
docs		docs
notebooks		notebooks
policy_corpus		policy_corpus
scripts		scripts
src		src
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
PROJECT_PROGRESS.md		PROJECT_PROGRESS.md
README.md		README.md
README_LANGGRAPH_ORCHESTRATION.md		README_LANGGRAPH_ORCHESTRATION.md
README_RAG_SYSTEM.md		README_RAG_SYSTEM.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GovAI Secure Intelligence Assistant (G-SIA)

Overview

Objectives

Key Features

System Architecture

Agent Descriptions

1. Policy Agent

2. Query Rewriter

3. SQL Query Agent

4. Audit Logger

Workflow

Data Layer

Tech Stack

Project Structure

Installation

Running the Project

Monitoring & Logging

Example Query Flow

Compliance Considerations

Future Enhancements

Why This Project Stands Out

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GovAI Secure Intelligence Assistant (G-SIA)

Overview

Objectives

Key Features

System Architecture

Agent Descriptions

1. Policy Agent

2. Query Rewriter

3. SQL Query Agent

4. Audit Logger

Workflow

Data Layer

Tech Stack

Project Structure

Installation

Running the Project

Monitoring & Logging

Example Query Flow

Compliance Considerations

Future Enhancements

Why This Project Stands Out

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages