Memory-MCP

MongoDB Atlas-backed MCP server for AI memory management with semantic caching, hybrid search, and background enrichment.

Installation

Install directly from the git repository with pip (requires git):

pip install git+https://github.com/mongodb-partners/memory-mcp.git

Or install using uv:

uv add git+https://github.com/mongodb-partners/memory-mcp.git

Quick Start

Copy the example environment file and fill in your credentials:

cp .env.example .env

Set the required variables in .env:

MONGODB_CONNECTION_STRING=mongodb+srv://user:password@cluster.mongodb.net/memory_mcp
AWS_ACCESS_KEY_ID=your-access-key
AWS_SECRET_ACCESS_KEY=your-secret-key
AWS_REGION=us-east-1

Start the server:

memory-mcp

The server listens on http://0.0.0.0:8000 by default using the Streamable HTTP transport.

To run in stdio mode instead (for local use with Claude Code, Cursor, etc.):

TRANSPORT=stdio memory-mcp

Client Configuration

Add one of the following to your MCP client's mcp.json (see mcp.json.example).

HTTP: connect to a running server:

{
  "servers": {
    "memory-mcp": {
      "url": "http://localhost:8000/mcp",
      "type": "http"
    }
  }
}

Stdio: launch as a subprocess:

{
  "servers": {
    "memory-mcp": {
      "command": "memory-mcp",
      "env": {
        "TRANSPORT": "stdio",
        "MONGODB_CONNECTION_STRING": "mongodb+srv://user:pass@cluster.mongodb.net/memory_mcp"
      }
    }
  }
}

Features

Memory storage and recall: Store conversation messages as short-term memories (STM) with automatic long-term memory (LTM) candidate creation
Semantic search: Vector similarity search over stored memories using configurable embedding providers (AWS Bedrock, Voyage AI)
Hybrid search: Combined vector and full-text search using MongoDB $rankFusion
Semantic caching: Cache query-response pairs and retrieve them by similarity threshold
Decision stickiness: Key-value store for persistent user preferences and decisions with optional TTL
Background enrichment: Async worker that assesses memory importance and generates summaries via LLM
Memory consolidation: Background worker for STM compression, low-importance forgetting, and STM-to-LTM promotion
Memory evolution: Automatic reinforcement of similar memories and merge detection
Auto-capture: Transparent middleware that automatically stores tool interactions as STM memories, ensuring the memory store is populated even when the LLM does not call store_memory
Calibrated ranking: Three-component scoring combining recency, importance, and relevance
Startup seeding: Governance profiles, prompt templates, and system decisions are seeded to the database at startup, so no LLM intervention is required for the system to be fully functional
Authentication: API key and HS256 JWT authentication with optional governance and rate limiting
Governance-aware rate limiting: Per-role request quotas derived from governance profiles (admin, power_user, end_user), with automatic fallback to global defaults
Audit logging: Buffered audit trail for all operations with periodic background flush and configurable flush strategies
Admin tools: Memory health monitoring, user data wiping, and cache invalidation
Web search: Optional Tavily API integration for web search
Multi-tenant isolation: All queries scoped by user_id

MCP Tools

Memory-MCP exposes 12 tools over the Model Context Protocol:

Tool	Description
`store_memory`	Store conversation messages as short-term memories
`recall_memory`	Semantically search stored memories with ranked results
`delete_memory`	Soft-delete memories by ID, tags, or time range
`check_cache`	Check semantic cache for a similar previous query
`store_cache`	Cache a query-response pair for future lookups
`hybrid_search`	Combined vector + full-text search with RRF ranking
`search_web`	Web search via Tavily API
`memory_health`	Health statistics for a user's memory store
`wipe_user_data`	Permanently delete all data for a user
`cache_invalidate`	Invalidate cached entries by pattern or all
`store_decision`	Store a keyed decision with configurable TTL
`recall_decision`	Recall a previously stored decision by key

See docs/api-reference.md for full tool signatures and parameters.

Architecture

Memory-MCP runs as a single FastMCP process with the following internal services:

MemoryService: STM/LTM storage, recall with calibrated ranking, soft-delete
CacheService: Semantic cache with vector similarity lookup
AuditService: Buffered audit logging with MongoDB persistence and local file fallback
AuditFlushWorker: Periodic background task that flushes audit entries, preventing data loss on crash
EnrichmentWorker: Background async task for LTM importance assessment, summarization, and memory evolution
ConsolidationWorker: Background task for STM compression, forgetting, and STM-to-LTM promotion
AutoCaptureMiddleware: Wraps MCP tools to automatically store interactions as STM memories
DecisionService: Keyed key-value store for sticky decisions with optional TTL; seeds system defaults at startup
GovernanceService: Role-based access policies (optional, via GOVERNANCE_ENABLED); seeds default profiles at startup
RateLimiter: Per-user sliding-window rate limiting with governance-aware per-role limits (optional, via RATE_LIMIT_ENABLED)
PromptLibrary: Versioned prompt templates for enrichment prompts; seeds default templates at startup

Embedding and LLM operations are handled by pluggable providers (AWS Bedrock, Voyage AI). MongoDB Atlas provides vector search, full-text search, and TTL-based document expiration. Authentication supports static API keys and HS256 JWTs (optional, via AUTH_ENABLED).

At startup, the server seeds governance profiles, prompt templates, and system decisions to the database (idempotent, best-effort). This ensures the system is fully functional without requiring the LLM to call any setup tools.

See docs/architecture.md for data flow diagrams and design decisions.

Configuration

Configuration is managed through environment variables loaded from a .env file. Key variables:

Variable	Required	Default	Description
`MONGODB_CONNECTION_STRING`	Yes	—	MongoDB Atlas connection URI
`AWS_ACCESS_KEY_ID`	Yes	—	AWS credentials for Bedrock
`AWS_SECRET_ACCESS_KEY`	Yes	—	AWS credentials for Bedrock
`AWS_REGION`	No	`us-east-1`	AWS region for Bedrock
`TRANSPORT`	No	`streamable-http`	`streamable-http` or `stdio`
`EMBEDDING_PROVIDER`	No	`bedrock`	`bedrock` or `voyage`
`TAVILY_API_KEY`	No	—	Enables `search_web` tool
`AUTH_ENABLED`	No	`false`	Enable API key / JWT authentication

See docs/configuration.md for the complete reference of 50+ configuration variables.

Development

git clone https://github.com/mongodb-partners/memory-mcp.git
cd memory-mcp
uv sync --extra dev
uv run pytest

See docs/developer-guide.md for project structure, conventions, and common tasks.

Deployment

docker compose up -d

See docs/deployment.md for Docker configuration, health checks, and production settings.

Agent Instructions

To configure LLM agents (Claude Code, Cursor, Copilot, Gemini) to use memory-mcp, see docs/agent-instructions.md for ready-to-use templates for CLAUDE.md, .cursorrules, copilot-instructions.md, and GEMINI.md.

License

See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
auth		auth
core		core
docs		docs
providers		providers
services		services
tests		tests
tools		tools
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
__main__.py		__main__.py
docker-compose.yml		docker-compose.yml
mcp.json.example		mcp.json.example
pyproject.toml		pyproject.toml
server.py		server.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory-MCP

Installation

Quick Start

Client Configuration

Features

MCP Tools

Architecture

Configuration

Development

Deployment

Agent Instructions

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Memory-MCP

Installation

Quick Start

Client Configuration

Features

MCP Tools

Architecture

Configuration

Development

Deployment

Agent Instructions

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages