BAM-RAG

A toy RAG (Retrieval-Augmented Generation) system for learning and experimentation.

What it does

Scrapes documentation → enriches with LLM → indexes with hybrid search → serves via MCP

flowchart LR
    subgraph scrape["① Scrape"]
        URL[🌐 URL] --> S[Colly]
        S --> MD{".md?"}
        MD -->|yes| RAW[Markdown]
        MD -->|no| HTML[HTML] --> RAW
    end

    subgraph store["② Store"]
        MINIO[(MinIO)]
    end

    subgraph enrich["③ Enrich"]
        ENG[Engine]
        LLM["🤖 Gemma3<br/>tags + summary"]
        EMB["🧮 qwen3<br/>embeddings"]
        ENG <--> LLM
        ENG <--> EMB
    end

    subgraph index["④ Index"]
        ES[("Elasticsearch<br/>BM25 + KNN")]
    end

    subgraph query["⑤ Query"]
        MCP[MCP Server] <--> CLAUDE[Claude]
    end

    RAW --> MINIO --> ENG --> ES <--> MCP

Prerequisites

Docker Desktop 4.40+ with Model Runner enabled
Go 1.21+
Make

Enable Docker Model Runner

Open Docker Desktop → Settings → Features in development
Enable "Docker Model Runner"
Restart Docker Desktop

Quick Start

# 1. Full setup (pulls models + starts infrastructure)
make setup

# 2. Update config with your Docker socket path
#    Edit config/config.yaml - set socket_path
#    Mac: ~/.docker/run/docker.sock
#    Linux: /var/run/docker.sock

# 3. Scrape some docs
make scrape URL=https://go.dev/doc/tutorial/getting-started

# 4. Search
make search Q="getting started"

# 5. Start MCP server (for Claude Desktop)
make serve

Available Commands

make help          # Show all commands
make setup         # Full setup: models + infrastructure
make models        # Pull AI models only
make infra         # Start Elasticsearch + MinIO only
make scrape URL=x  # Scrape and index a URL
make search Q="x"  # Search indexed docs
make serve         # Start MCP server
make test          # Run tests
make build         # Build binary

Stack

Go - single binary, fast
Elasticsearch - hybrid search (BM25 + vectors with RRF)
MinIO - S3-compatible storage between scraper and indexer
Docker Model Runner - local LLM (Gemma3) and embeddings (qwen3)

Architecture

Event-driven choreography with Go channels—no central orchestrator.

sequenceDiagram
    participant CLI as bam-rag
    participant Scraper
    participant S3 as MinIO
    participant Engine as Ingestion
    participant LLM as Gemma3
    participant Embed as qwen3
    participant ES as Elasticsearch

    CLI->>Scraper: scrape URL
    loop each page
        Scraper->>S3: store .md
    end
    Scraper-->>Engine: ScrapeComplete

    loop each doc
        Engine->>S3: read content
        Engine->>LLM: generate tags/summary
        Engine->>Embed: generate vector
        Engine->>ES: index
    end
    Note over ES: Ready for hybrid search!

Key design choices:

S3 checkpoint — Re-run ingestion without re-scraping
Optional enrichment — Works without LLM/embeddings (graceful degradation)
Hybrid search — BM25 + KNN combined via Reciprocal Rank Fusion (RRF)

Configuration

Edit config/config.yaml:

embeddings:
  socket_path: ~/.docker/run/docker.sock  # Your Docker socket

llm:
  socket_path: ~/.docker/run/docker.sock

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
cmd/bam-rag		cmd/bam-rag
config		config
internal		internal
pkg/models		pkg/models
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
mcp-catalog.yaml		mcp-catalog.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BAM-RAG

What it does

Prerequisites

Enable Docker Model Runner

Quick Start

Available Commands

Stack

Architecture

Configuration

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

mfenderov/bam-rag

Folders and files

Latest commit

History

Repository files navigation

BAM-RAG

What it does

Prerequisites

Enable Docker Model Runner

Quick Start

Available Commands

Stack

Architecture

Configuration

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages