LLM Sentinel: AI Security Gateway

Hey! I'm Aman. I built Sentinel to demonstrate how organizations can safely integrate Large Language Models (LLMs) without compromising user privacy or system security. Instead of connecting end-users directly to an AI, I engineered a locally-hosted middleware that acts as a secure proxy, dynamically filtering out sensitive data and blocking malicious inputs in real-time.

🧩 The Problem & Solution

The Problem: Connecting raw user inputs directly to LLMs introduces two massive attack vectors:

Data Exfiltration: Users accidentally sharing Personally Identifiable Information (PII) like credit cards, Aadhaar numbers, or addresses.
Prompt Injection: Attackers using malicious instructions to bypass system guardrails, hijack the AI's logic, or extract backend secrets.

The Solution: A lightweight, high-performance security gateway built with FastAPI. It intercepts every request, runs a multi-pass sanitization engine (Regex + Contextual checks) to redact PII, actively blocks prompt injection signatures, and logs every event into an immutable audit trail — all before the data ever reaches the target LLM, ensuring a secure, zero-dependency defense layer.

🏗️ System Architecture & Data Flow

I designed this gateway with a "Privacy by Design" architecture. Processing happens 100% locally, ensuring zero data leaves the host machine.

       [Web UI / Client] ──(HTTP POST)──> [FastAPI Security Gateway]
                                                     │
                                                     ▼
                                     [Multi-Pass Sanitization Engine]
                                     ├─ 1. PII Redaction (Regex/Context)
                                     └─ 2. Injection Signature Scanning
                                                     │
                            ┌────────────────────────┴────────────────────────┐
                            ▼                                                 ▼
                    [SQLite Audit DB]                                 [Ollama AI Engine]
                   (Immutable Logging) ◄──(AI Response / Block Status)── (tinyllama)

📸 System in Action

Note: These are live captures from my development environment demonstrating the gateway's active defense capabilities under testing.

1. Secure Access Portal

A JWT-secured authentication layer ensuring only authorized administrators can access the gateway's telemetry and audit logs.

2. Real-Time Telemetry Dashboard

The central command center. It visualizes intercepted PII types, system health, and features a dynamic control panel to toggle sanitization filters (Name, Phone, Email, Address, Cards). Note: Government IDs are permanently locked ON as a mandatory security protocol. Turning off a filter explicitly flags leaked data as UNMASKED in the risk assessment.

3. Active Threat Mitigation

Demonstrating the gateway intercepting a malicious prompt injection (forget all instructions...). The system blocks the attack before it reaches the LLM and instantly updates the real-time heatmap and threat metrics. It also enforces a strict context guardrail, preventing the AI from leaking previously provided user data.

4. Immutable Audit Trail

The compliance log. It records the original input (blurred for privacy), the dynamically secured output (showing <TAGS>), the computed risk level (SECURE, MEDIUM, HIGH), and the timestamp. You can see the MEDIUM (UNMASKED) flag actively catching instances where toggles were disabled but sensitive data was still detected. Includes a one-click Export feature to generate CSV/PDF reports for security audits.

📂 Project Structure

llm-sentinel/
├── assets/                    # Live system screenshots for documentation
├── static/
│   ├── script.js              # Frontend logic, charts, and API communication
│   └── style.css              # UI styling and dark theme
├── database.py                # Database connection and table setup
├── index.html                 # Main interface and dashboard UI
├── main.py                    # FastAPI application, auth, and API routes
├── sanitizer.py               # Core logic for PII masking & injection detection
├── .env.example               # Template for environment variables
├── requirements.txt           # Backend dependencies
├── LICENSE                    # MIT License
└── README.md                  # Project documentation

⚖️ Architecture Tradeoffs & Design Choices

Local LLM (Ollama) vs. Cloud Providers: I chose Ollama to guarantee 100% data sovereignty. Tradeoff: Bounded by local compute limits compared to accessing massive cloud APIs like OpenAI, but entirely eliminates external API data-sharing risks.
Regex/Heuristics vs. NLP Models for PII: The sanitization engine uses optimized regular expressions and contextual visibility checks instead of heavy ML models. Tradeoff: Incredibly fast (sub-millisecond latency) making it viable as a real-time gateway, but may lack the deep semantic understanding of a dedicated NLP classification model.
Dynamic Risk Scoring: Implemented an intelligent detection system that differentiates between harmless conversational fillers ("I live in") and actual data exposure. Tradeoff: Requires stricter pattern maintenance, but drastically reduces false positives for the end-user.
SQLite vs. PostgreSQL: Used SQLite for the audit and history databases. Tradeoff: Perfect for a lightweight, zero-config local gateway demonstration, but would require migrating to PostgreSQL for distributed, high-concurrency enterprise deployments.

🚀 Get it Running Locally

1. Clone & Setup

git clone https://github.com/iamanpathak/llm-sentinel.git
cd llm-sentinel

# Create and activate virtual environment
python -m venv .venv

# Windows
.\.venv\Scripts\activate 
# Mac/Linux
# source .venv/bin/activate 

# Install required packages
pip install -r requirements.txt

2. Start the AI Engine

Ensure Ollama is installed and running on your machine, then pull the required model:

ollama run tinyllama

3. Launch the Gateway

Start the FastAPI server:

python main.py

Dashboard/Gateway: Navigate to http://127.0.0.1:8000
Default Admin Credentials: admin / admin123

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.

Made with ❤️ by Aman Pathak

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Sentinel: AI Security Gateway

🧩 The Problem & Solution

🏗️ System Architecture & Data Flow

📸 System in Action

1. Secure Access Portal

2. Real-Time Telemetry Dashboard

3. Active Threat Mitigation

4. Immutable Audit Trail

📂 Project Structure

⚖️ Architecture Tradeoffs & Design Choices

🚀 Get it Running Locally

1. Clone & Setup

2. Start the AI Engine

3. Launch the Gateway

📄 License

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
assets		assets
static		static
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
database.py		database.py
index.html		index.html
main.py		main.py
requirements.txt		requirements.txt
sanitizer.py		sanitizer.py

Folders and files

Latest commit

History

Repository files navigation

LLM Sentinel: AI Security Gateway

🧩 The Problem & Solution

🏗️ System Architecture & Data Flow

📸 System in Action

1. Secure Access Portal

2. Real-Time Telemetry Dashboard

3. Active Threat Mitigation

4. Immutable Audit Trail

📂 Project Structure

⚖️ Architecture Tradeoffs & Design Choices

🚀 Get it Running Locally

1. Clone & Setup

2. Start the AI Engine

3. Launch the Gateway

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages