🧠 EEG-RAG

Production-grade Retrieval-Augmented Generation for EEG research literature — multi-agent, medically cited, instantly queryable.

Important

Research/Clinical Disclaimer: EEG-RAG is designed for research and educational purposes. All retrieved citations must be independently verified before clinical decision-making. This system is not a substitute for professional medical advice.

Tip

Get started in 5 minutes: pip install -e . && uvicorn eeg_rag.api.main:app --reload then visit http://localhost:8080/docs

Overview
Key Features
- Feature Table
Architecture
Agent Roster
- New Agents
Quick Start
- Installation
- Configuration
- API Endpoints
Usage
- Python SDK
- Web UI
Paper Database
Technology Stack
EEG Domain Knowledge
Advanced Retrieval
Systematic Review
Bibliometrics
Enterprise Features
Project Roadmap
Development Status
Development
Claim Verification
Contributing
Changelog
License & Acknowledgements

🎯 Overview

EEG-RAG is an enterprise-ready, multi-agent RAG system built specifically for electroencephalography (EEG) research and clinical applications. It processes scientific literature from PubMed (35M+ papers), Semantic Scholar, arXiv, OpenAlex, ClinicalTrials.gov, and Europe PMC, then answers natural-language queries with verified, PMID-cited responses in under 2 seconds.

The problem it solves: EEG researchers spend 40-60% of their time searching literature. PubMed holds 150,000+ EEG papers, but there is no unified way to query that knowledge semantically, verify citations, or synthesize findings across studies.

Who it is for: Clinical EEG researchers, epileptologists, BCI engineers, cognitive neuroscientists, ML engineers working on neural data, and graduate students entering the field.

In Plain Language — Benefits for EEG Professionals

⏱️ Spend less time digging through papers. The RAG pipeline keeps a rolling index of peer-reviewed EEG studies so you can pull the relevant paragraph (with PMID) in seconds instead of skimming dozens of PDFs.¹
🧩 See patient-matched precedents. By linking EEG waveforms, clinical context, and prior cases (replicating the EEG-MedRAG methodology that beat other retrieval methods by 5–20 F1 points across seven disorders), you can quickly sanity-check seizure patterns or cognitive task responses against similar cohorts.²
📑 Trust the answer because the evidence is attached. Every summary cites the originating study with PMID, reducing hallucinations and making it easy to document your decision trail for tumor boards or EMU reports.¹
🔄 Stay aligned across the care team. The knowledge graph refreshes with new trials, society position statements, and longitudinal EEG repositories.¹²

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
.copilot		.copilot
.github		.github
.vscode		.vscode
assets		assets
data		data
docker		docker
docs		docs
examples		examples
k8s		k8s
memory-bank		memory-bank
schemas		schemas
scripts		scripts
src/eeg_rag		src/eeg_rag
tests		tests
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.pylintrc		.pylintrc
=1.28.0		=1.28.0
CHANGELOG.md		CHANGELOG.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
Makefile		Makefile
QUICKSTART.md		QUICKSTART.md
README.md		README.md
README.md.bak		README.md.bak
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

_Icon	_Feature	_Description	_Impact	_Status
_🤖	_{Multi-Agent System}	_{12 specialized AI agents work in parallel — see full agent table below}	_High	_{✅ Stable}
_🔍	_{Hybrid Retrieval}	_{BM25 + Dense vectors + SPLADE learned sparse + Cross-encoder reranking with RRF fusion}	_High	_{✅ Stable}
_📡	_{FastAPI Web Service}	_{REST API with 10 endpoints + Server-Sent Events (SSE) for real-time streaming progress}	_High	_{✅ Stable}
_✅	_{Citation Verification}	_{Medical-grade PMID validation, hallucination detection, retraction checking}	_Critical	_{✅ Stable}
_🧠	_{PubMedBERT Embeddings}	_{768-dim domain embeddings pre-trained on 14M PubMed abstracts; selectable via model_preset}	_High	_{✅ Stable}
_📥	_{Multi-Source Ingestion}	_{PubMed, Semantic Scholar, arXiv, OpenAlex, ClinicalTrials.gov, Europe PMC (120K+ papers)}	_High	_{✅ Stable}
_🏥	_{ClinicalTrials.gov}	_{EEG clinical trial data (epilepsy, BCI, neurofeedback, sleep) via REST v2 API with EEG relevance scoring}	_High	_{✅ New}
_🌍	_{Europe PMC}	_{Open-access EEG literature via cursor-based pagination with full-text XML retrieval}	_High	_{✅ New}
_🔬	_{ResearchAgent}	_{Parallel multi-source coordinator — PubMed + Semantic Scholar + Local in one call with dedup & evidence ranking}	_High	_{✅ New}
_🗂️	_{SystematicReviewAgent}	_{Fully automated PRISMA-compliant systematic reviews: dedup → screen → grade → themes → gaps}	_High	_{✅ New}
_🩺	_{ClinicalMatchingAgent}	_{Maps EEG patterns to clinical diagnoses using ACNS terminology, ICD-10 codes, evidence PMIDs and drug effect lookup}	_High	_{✅ New}
_📋	_{CitationAgent}	_{Batch citation validation: impact scoring, retraction detection, ORCID linking, cross-reference checking, open-access status}	_Critical	_{✅ Stable}
_📊	_{Bibliometrics Dashboard}	_{pyBiblioNet integration: trends, citation networks, KeyBERT NLP, Scopus export}	_Medium	_{✅ Stable}
_🔬	_{NER System}	_{EEG Named Entity Recognition: 400+ terms across 12 categories (electrodes, bands, ERPs, conditions)}	_Medium	_{✅ Stable}
_🗂️	_{Systematic Review (YAML)}	_{YAML-schema extraction, reproducibility scoring, temporal comparison vs Roy et al. 2019}	_Medium	_{✅ Stable}
_🏢	_{Enterprise Security}	_{SVG/PDF malware scanning, prompt injection detection, SHA-256 audit trail, OpenTimestamps}	_Medium	_{🔄 Beta}
_🗄️	_{Knowledge Graph}	_{Neo4j with Cypher queries: multi-hop reasoning across entities (PAPER, BIOMARKER, CONDITION, OUTCOME)}	_Medium	_{🔄 Beta}
_🚀	_{Adaptive Query Routing}	_{Intelligent routing to optimal agents based on query complexity, 30% latency reduction}	_Medium	_{🟡 Planned}

_#	_Agent	_Type	_Focus	_{How It Works}
₁	_{OrchestratorAgent _{agents/orchestrator/}}	_ORCH	_{Central coordinator}	_{Receives a user query, builds a plan with QueryPlanner, fans out to sub-agents in parallel via asyncio.gather, merges ranked results}
₂	_{LocalDataAgent _{agents/local_agent/}}	_LOCAL	_{Fast in-corpus retrieval}	_{Hybrid BM25 + FAISS dense search over the 120K-paper corpus; <100 ms for 10K docs via RRF fusion}
₃	_{PubMedAgent _{agents/pubmed_agent/}}	_CLOUD	_{Peer-reviewed literature}	_{NCBI E-utilities with MeSH expansion, rate-limited to 3 req/s (10 req/s with API key), returns PMID-annotated results}
₄	_{SemanticScholarAgent _{agents/semantic_scholar_agent/}}	_CLOUD	_{Citation graphs & influence}	_{Queries the S2 Graph API for papers + citation counts + influential-citation flags; re-ranks by citation velocity}
₅	_{WebSearchAgent _{agents/web_agent/}}	_WEB	_{Web / preprint search}	_{Falls back to web search for topics not covered by academic DBs; handles arXiv and bioRxiv preprints}
₆	_{GraphAgent _{agents/graph_agent/}}	_CLOUD	_{Multi-hop reasoning}	_{Runs Cypher queries on Neo4j linking PAPER → BIOMARKER → CONDITION → OUTCOME nodes}
₇	_{CitationAgent _{agents/citation_agent/}}	_AGG	_{Citation validation & impact}	_{Validates PMIDs/DOIs; computes impact score (citations + journal IF + recency); detects retractions; batch-validates 100+ papers with caching}
₈	_{SynthesisAgent _{agents/synthesis_agent/}}	_AGG	_{Multi-LLM answer generation}	_{Feeds ranked context chunks to a configurable LLM ensemble; includes EvidenceRanker (1a–5 OCEBM) and hallucination detection}
₉	_{MCPAgent _{agents/mcp_agent/}}	_MCP	_{MCP protocol bridge}	_{Exposes all agents as callable tools via Model Context Protocol; enables Claude Desktop and other MCP client integrations}
₁₀	_{ResearchAgent _{agents/research_agent/}}	_CLOUD	_{Multi-source coordinator ✨}	_{Runs PubMed + SemanticScholar + LocalData in parallel; isolates per-source errors; deduplicates by PMID/DOI/title; applies 13-group EEG synonym expansion}
₁₁	_{SystematicReviewAgent _{agents/systematic_review_agent/}}	_AGG	_{PRISMA automation ✨}	_{Full PRISMA pipeline: dedup → abstract screening → OCEBM grading → thematic grouping (freq bands, methods, conditions) → gap detection}
₁₂	_{ClinicalMatchingAgent _{agents/clinical_matching_agent/}}	_LOCAL	_{EEG → diagnosis ✨}	_{13-entry ACNS pattern KB (spike-wave, hypsarrhythmia, LPDs, GRDA, LRDA, sleep stages, BCI); age modifiers + drug-EEG lookup; returns ICD-10 codes + PMIDs}

_Endpoint	_Method	_Description
_/health	_GET	_{Health check with per-agent status}
_/metrics	_GET	_{Performance metrics (latency, cache rate)}
_/search	_POST	_{Standard search with AI synthesis}
_{/search/stream}	_POST	_{SSE streaming — real-time progress}
_{/paper/details}	_POST	_{Fetch full paper metadata}
_{/paper/citations}	_POST	_{Citation network analysis}
_/suggest	_GET	_{Query autocomplete}
_/query-types	_GET	_{Available query categories}
_/docs	_GET	_{Swagger UI}
_/redoc	_GET	_{ReDoc documentation}

_Agent	_Role	_{What It Does}
_{🎯 Orchestrator}	_{Central Coordinator}	_{Routes queries, manages workflow}
_{📋 Query Planner}	_{Query Analyst}	_{Decomposes complexity, identifies entities}
_{💾 Local Search}	_{Fast Retrieval}	_{FAISS hybrid BM25+vector search (<100ms)}
_{🏥 PubMed Search}	_{Literature Gateway}	_{MeSH-expanded queries, NCBI-compliant rates}
_{🔬 Semantic Scholar}	_{Citation Analysis}	_{Influence scoring, citation network}
_{🕸️ Knowledge Graph}	_{Relationship Mapper}	_{Neo4j entity resolution}
_{✅ Citation Agent}	_{Quality Assurance}	_{PMID verification, retraction + impact scoring}
_{🧪 Synthesis}	_{Answer Generator}	_{Multi-LLM ensemble summaries}
_{📡 MCP Agent}	_{Tool Bridge}	_{Exposes agents via Model Context Protocol}
_{🔗 Research Agent}	_{Multi-Source Search}	_{Parallel PubMed + S2 + Local with dedup}
_{📋 Systematic Review}	_{PRISMA Automation}	_{Structured review pipeline with evidence grading}
_{🩺 Clinical Matching}	_{EEG → Diagnosis}	_{Pattern→diagnosis with ICD-10 + drug effects}

_Source	_{ID Types}	_{Best For}	_{Rate (no key)}	_{Rate (with key)}
_{✅ PubMed}	_PMID	_{Medical / life sciences}	_{3 req/sec}	_{10 req/sec}
_{✅ Semantic Scholar}	_{DOI, PMID, arXiv}	_{Citation data, CS/neuro}	_{20 req/min}	_{100 req/min}
_{✅ arXiv}	_{arXiv ID}	_{Physics, CS, math preprints}	_{~20 papers/min}	_—
_{✅ OpenAlex}	_{DOI, OpenAlex ID}	_{Open metadata, broad coverage}	_100K/day	_—
_{✅ CrossRef}	_DOI	_{Authoritative DOI metadata}	_{50 req/sec}	_—
_{✅ bioRxiv / medRxiv}	_{DOI (10.1101/*)}	_{Life science preprints}	_{2 req/sec}	_—
_{✅ ClinicalTrials.gov}	_{NCT ID}	_{EEG clinical trials (epilepsy, BCI, sleep, neurofeedback)}	_{REST v2, unlimited}	_—
_{✅ Europe PMC}	_{PMID, PMCID}	_{Open-access EEG literature with full-text XML}	_{cursor-based, unlimited}	_—
_{⚠️ IEEE Xplore}	_—	_{Engineering (requires API key)}	_—	_—

_Technology	_Purpose	_{Why Chosen}	_{Alternatives Considered}
_{Python 3.9+}	_{Core runtime}	_{Rich ML/NLP ecosystem, async support, type hints}	_{Node.js (lacks NLP maturity)}
_FastAPI	_{REST API framework}	_{Async-native, auto OpenAPI docs, SSE support}	_{Flask (no async), Django (heavier)}
_FAISS	_{Vector similarity search}	_{<10ms for 1M vectors, GPU support, free}	_{Pinecone (cloud/paid), Weaviate (heavier)}
_PubMedBERT	_{Biomedical embeddings}	_{Pre-trained on 14M PubMed papers, 87% NER F1; selectable via model_preset parameter}	_{BioBERT (older), SciBERT (general science)}
_{BM25 (rank-bm25)}	_{Sparse keyword retrieval}	_{Fast, no GPU, strong baseline for EEG terms}	_{TF-IDF (less nuanced), Elasticsearch}
_SPLADE	_{Learned sparse retrieval}	_{+10-15% recall over BM25, domain-aware}	_{ANSERINI (less flexible)}
_Streamlit	_{Web UI}	_{Rapid data science UI, no frontend expertise needed}	_{React (more complex), Gradio}
_Neo4j	_{Knowledge graph}	_{Cypher queries, multi-hop reasoning, visualization}	_{ArangoDB (steeper curve), TigerGraph}
_Redis	_{Query cache}	_{Sub-ms latency, TTL support, LRU eviction}	_{Memcached (no persistence), DynamoDB}
_{Pydantic v2}	_{Data validation}	_{Type-safe models, fast validation at I/O boundaries}	_{dataclasses (no validation), marshmallow}
_{pytest + asyncio}	_Testing	_{Async test support, parametrize, 294+ tests passing}	_{unittest (verbose), nose (deprecated)}
_Docker	_{Containerization}	_{Reproducible builds, isolation, K8s-ready}	_{Conda (Python-only), venv (no system deps)}

_Method	_Latency	_Recall@10	_{When to Use}
_{BM25 baseline}	_~20ms	_78%	_{Fast, exact-term queries}
_{SPLADE learned sparse}	_~40ms	_88%	_{Better quality needed}
_{Dense (PubMedBERT)}	_~30ms	_82%	_{Semantic / conceptual queries}
_{Hybrid BM25 + Dense (RRF)}	_~60ms	_91%	_{Best general baseline}
_{Hybrid + Reranking}	_~160ms	_95%	_{High-precision tasks}

_Scenario	_{Without Cache}	_{With Cache}	_Speedup
_{Repeated query}	_1.8s	_0.05s	_36x
_{Similar query}	_1.8s	_1.8s	_1x
_{Popular EEG terms}	_1.8s	_0.05s	_36x

_Model	_{PubMed NER F1}	_{EEG Term Recall}
_BERT-base	_0.78	_72%
_BioBERT	_0.84	_81%
_PubMedBERT	_0.87	_89%
_SciBERT	_0.82	_75%

_Band	_Frequency	_{Cognitive State}	_{Clinical Relevance}
_{Delta (δ)}	_{0.5–4 Hz}	_{Deep sleep, unconsciousness}	_{Tumor detection, encephalopathy}
_{Theta (θ)}	_{4–8 Hz}	_{Drowsiness, meditation}	_{Memory encoding, ADHD markers}
_{Alpha (α)}	_{8–13 Hz}	_{Relaxed wakefulness}	_{Eyes-closed resting state}
_{Beta (β)}	_{13–30 Hz}	_{Active thinking, focus}	_{Anxiety, motor planning}
_{Gamma (γ)}	_{30–100 Hz}	_{Cognitive processing, binding}	_{Attention, consciousness}

_Component	_Latency	_Paradigm	_{Clinical Use}
_P300	_~300ms	_{Oddball (target detection)}	_{Working memory, BCI spellers}
_N400	_~400ms	_{Semantic violation}	_{Language disorders}
_N170	_~170ms	_{Face stimulus}	_{Face processing research}
_MMN	_150–250ms	_{Deviant auditory stimulus}	_{Pre-attentive processing, schizophrenia}
_ERN	_50–100ms	_{Error response}	_{Error monitoring, OCD}

_{Entity Type}	_{Term Count}	_Examples
_{Frequency Bands}	₁₄	_{delta (0.5-4Hz), theta, alpha, beta, gamma}
_{Brain Regions}	₄₀₊	_{frontal cortex, hippocampus, amygdala}
_Electrodes	₆₀₊	_{Fp1, Fz, Cz, Pz, O1, O2 (10-20 system)}
_{Clinical Conditions}	₅₀₊	_{epilepsy, Alzheimer's, depression, ADHD}
_Biomarkers	₄₀₊	_{P300, alpha asymmetry, theta-beta ratio}
_{Measurement Units}	₁₀₊	_{Hz, μV, ms, amplitude, power}
_{Signal Features}	₂₀₊	_{artifacts, epochs, phase, waveforms}
_{Experimental Tasks}	₃₀₊	_{resting state, oddball, motor imagery}
_{Processing Methods}	₃₅₊	_{ICA, FFT, bandpass filter}
_{EEG Phenomena}	₂₅₊	_{alpha blocking, sleep spindles}
_{Cognitive States}	₂₀₊	_{attention, drowsiness, meditation}
_Hardware	₁₅₊	_{EEG cap, amplifier, BioSemi}

_Preset	_Model	_{Best For}
_{general (default)}	_{sentence-transformers/all-MiniLM-L6-v2}	_{Fast baseline, general text}
_pubmedbert	_{microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract-fulltext}	_{Clinical and biomedical EEG text}
_biobert	_{dmis-lab/biobert-base-cased-v1.2}	_{Biomedical NLP tasks}
_mpnet	_{sentence-transformers/all-mpnet-base-v2}	_{High-quality general retrieval}
_multilingual	_{sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2}	_{Non-English EEG literature}

_Criterion	_Score	_Example
_{Public GitHub repo}	₁₀	_{https://github.com/author/repo}
_{Code on request}	₅	_{"Available upon reasonable request"}
_{Public dataset}	₈	_{CHB-MIT, PhysioNet, DEAP, TUSZ}
_{Private/clinical dataset}	₄	_{Hospital EEG (ethics-approved)}
_Maximum	₁₈	_{Fully reproducible research}

_Standard	_Status	_Notes
_HIPAA	_{✅ Ready}	_{Healthcare data protection (US)}
_GDPR	_{✅ Ready}	_{Data protection (EU)}
_{FDA 510(k)}	_{🟡 Partial}	_{Medical device clearance — documentation ready}
_{CE Mark}	_{🟡 Partial}	_{European conformity — documentation ready}

_Phase	_Goals	_Status
_{Phase 1 — Foundation}	_{Architecture, BaseAgent, QueryPlanner, Memory, Orchestrator}	_{✅ 100%}
_{Phase 2 — Agents}	_{LocalSearch, PubMed, GraphAgent, CitationValidator}	_{✅ 100%}
_{Phase 3 — Pipeline}	_{Chunking, NER, Corpus, Embeddings, FinalAggregator}	_{✅ 100%}
_{Phase 4 — Ingestion}	_{Multi-source 120K papers, Streamlit UI, FastAPI}	_{✅ 100%}
_{Phase 5 — Advanced}	_{SPLADE, Reranker, IR Metrics, Bibliometrics, Systematic Review}	_{✅ 100%}
_{Phase 6 — New Agents & Sources}	_{ClinicalTrials.gov, Europe PMC, ResearchAgent, SystematicReviewAgent, ClinicalMatchingAgent, PubMedBERT presets}	_{✅ 100%}
_{Phase 7 — Production}	_{Full LLM, <2s p95 target, Docker prod, K8s}	_{🟡 10%}

_Metric	_Target	_Current
_{Unit tests}	_{>85% coverage}	_{330+ passing (100% pass rate)}
_{Query latency p95}	_{< 2s}	_{~1.8s (local FAISS, no LLM)}
_{Cache hit rate}	_{> 60%}	_{TBD (Redis optional)}
_{Retrieval Recall@10}	_{> 90%}	_{~91% (Hybrid+RRF)}
_{Citation precision}	_{> 95%}	_{99%+ (PMID regex + PubMed validation)}
_{System uptime}	_{> 99.5%}	_Target
_{Data sources}	₄	_{6 (+ ClinicalTrials.gov + Europe PMC)}
_{Agent count}	₈	_{12 (+ Research + SystematicReview + ClinicalMatching + Citation)}

_Command	_Description
_eeg-rag	_{Main query interface, health check, stats}
_{eeg-rag-history}	_{Browse and replay search history}
_{eeg-rag-stats}	_{Detailed corpus + system stats dashboard}

_Metric	_{What it measures}
_Faithfulness	_{Fraction of answer claims supported by retrieved context (hallucination score)}
_{Answer Relevance}	_{Semantic similarity between the query and the answer}
_{Context Precision}	_{Average precision of the retrieved chunk ranking}
_{Context Recall}	_{Coverage of ground-truth documents or sentences}

_Resource	_Contribution
_{Microsoft Research}	_{PubMedBERT — biomedical embeddings pre-trained on 14M PubMed abstracts}
_{Facebook AI Research}	_{FAISS — billion-scale vector similarity search}
_{NCBI / NIH}	_{PubMed E-utilities API — unrestricted access to 35M+ citations}
_{Semantic Scholar (AI2)}	_{Citation graph API — influence scores and citation networks}
_{EEG Research Community}	_{Domain expertise, test corpora, and validation of terminology}
_{Cormack et al. 2009}	_{Reciprocal Rank Fusion algorithm underlying hybrid retrieval}
_{Wang et al. 2025}	_{EEG-MedRAG methodology — hypergraph retrieval for clinical EEG}

Folders and files

Latest commit

History

Repository files navigation

🧠 EEG-RAG

Table of Contents

🎯 Overview

In Plain Language — Benefits for EEG Professionals

✨ Key Features

Feature Status Table

🏗️ Architecture

System Overview

Query Lifecycle

EEG Domain Taxonomy

🤖 Agent Roster

Complete Agent Table

New Agents Added

🔗 ResearchAgent — Multi-Source Literature Coordinator

🗂️ SystematicReviewAgent — PRISMA Review Automation

🩺 ClinicalMatchingAgent — EEG Pattern → Diagnosis

✅ CitationAgent — Batch Citation Validation

🚀 Quick Start

Installation

Configuration

Start the API Server

API Endpoints

💻 Usage

Python SDK

Web UI — 12 AI Agents

Ingest Research Papers

📚 Paper Database

Supported Sources

New Sources — ClinicalTrials.gov and Europe PMC

🔧 Technology Stack

Retrieval Stage Comparison

Cache Impact

PubMedBERT vs Alternatives

🔬 EEG Domain Knowledge

Frequency Bands

ERP Components

NER System — 400+ Terms, 12 Categories

⚡ Advanced Retrieval

Multi-Stage Pipeline

Embedding Model Presets

IR Evaluation Metrics (Built-in)

🗂️ Systematic Review Automation

Reproducibility Scoring

📈 Bibliometrics & Research Analytics

🏢 Enterprise Features

Citation Provenance Tracking

Dataset Security Scanner

Regulatory Compliance

📅 Project Roadmap

Milestone Summary

📊 Development Status

🛠️ Development

Setup Dev Environment

Run Tests

Code Quality

Code Standards

Claim Verification

🤝 Contributing

PR Requirements

Reporting Bugs

📋 Changelog

v0.4.1 — April 2026

PyPI Packaging

Structured Code Comments

Diagram Fixes

v0.4.0 — April 2026

Agentic RAG Loop

RAGAS Evaluation Metrics

Stub Code Filled

Test Coverage

📜 License & Acknowledgements

Acknowledgements

Footnotes

About

Topics

Resources

Packages