RAG System Backend

Role-based RAG system with LangChain and ChromaDB for the BadCompany game.

Quick Start

1. Install Dependencies

pip install -r requirements.txt

2. Configure Environment

Copy and configure .env file:

copy .env.example .env

Template for .env file:

OPENROUTER_API_KEY=your_api_key
OPENROUTER_BASE_URL=your_project_url
TAVILY_API_KEY=''
HF_TOKEN=your_api_key
USE_HF_EMBEDDINGS=true
HF_LOGGING=true
HF_API_URL=https://api-inference.huggingface.co/models/BAAI/bge-small-en-v1.5
USER_AGENT=RAG-System/1.0

See SETUP_GUIDE.md for detailed configuration instructions.

3. Run Server

uvicorn server:app --reload --port 8000

4. Verify Setup

curl http://127.0.0.1:8000/debug/status

Server will be available at http://localhost:8000

Features

HuggingFace Embeddings Integration
- Automatic endpoint discovery and fallback
- Batch processing for efficiency
- Support for multiple embedding models
- Fallback to local sentence-transformers
Role-Based Access Control
- Public, worker, and admin document access
- Separate vector stores per role
- Prevents privilege escalation
Robust Error Handling
- Clear error messages for missing credentials
- Automatic retry with endpoint variations
- Graceful degradation

API Endpoints

GET / - Health check
GET /debug/status - Diagnostic information (embeddings, docs, config)
POST /session/start - Initialize session
POST /agent/chat - Chat with RAG system
POST /judge/evaluate - Evaluate attack attempts
GET /scenarios - List available scenarios

Project Structure

server.py - FastAPI server with RAG endpoints
core/ - RAG pipeline (embeddings, retrieval, LLM, vectorstore)
- embeddings.py - HuggingFace embeddings wrapper with auto-discovery
- retrieval.py - Document loading and RAG chain
- vectorstore.py - ChromaDB integration
data/ - Documents organized by access level (public, worker, admin)
config/ - Settings and credentials
vector_store_*/ - ChromaDB vector stores per role

Role-Based Access

public: Access to public documents only
worker: Access to public + worker documents
admin: Access to all documents

Documents in data/admin/ contain sensitive information that attackers try to extract.

HuggingFace Embeddings

The system uses HuggingFace Inference API by default with intelligent endpoint discovery:

Automatic Format Detection: Tries array and string payload formats
Batch Processing: Embeds multiple texts in single API call when possible
Smart Fallbacks: Tests multiple endpoint variations automatically
Local Fallback: Can use sentence-transformers if HF unavailable

Supported models:

sentence-transformers/all-MiniLM-L6-v2 (default, fast)
BAAI/bge-small-en-v1.5 (better quality)
BAAI/bge-base-en-v1.5 (best quality)

See SETUP_GUIDE.md for configuration options.

Troubleshooting

No embeddings / token errors:

Ensure HF_TOKEN is set in .env
Get token from https://huggingface.co/settings/tokens

Slow first request:

HF models have 30-60s cold start on first request
Subsequent requests are fast (model stays loaded)

Document loading issues:

Check /debug/status endpoint for diagnostics
Verify files exist in data/ subdirectories

Full troubleshooting guide: See SETUP_GUIDE.md

Development

# Install dev dependencies
pip install -r requirements.txt

# Run with auto-reload
uvicorn server:app --reload --port 8000

# Check logs for HF embedding calls
# (Set HF_LOGGING=true in .env)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
agent		agent
config		config
core		core
data		data
platform_logic		platform_logic
scenarios		scenarios
tools		tools
users		users
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
Caddyfile		Caddyfile
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG System Backend

Quick Start

1. Install Dependencies

2. Configure Environment

3. Run Server

4. Verify Setup

Features

API Endpoints

Project Structure

Role-Based Access

HuggingFace Embeddings

Troubleshooting

Development

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG System Backend

Quick Start

1. Install Dependencies

2. Configure Environment

3. Run Server

4. Verify Setup

Features

API Endpoints

Project Structure

Role-Based Access

HuggingFace Embeddings

Troubleshooting

Development

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages