API Reference

Memory-MCP exposes tools over the Model Context Protocol (MCP) using Streamable HTTP transport on port 8000. Tools are called by MCP clients, not via REST endpoints.

Transport

Protocol: MCP (Model Context Protocol)
Transport: Streamable HTTP
Host: 0.0.0.0
Port: 8000

Memory Tools

Defined in tools/memory_tools.py.

`store_memory`

Store conversation messages as short-term memories. For human messages longer than 30 characters, also creates long-term memory candidates queued for background enrichment.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier for multi-tenant isolation
`conversation_id`	string	Yes	—	Conversation identifier for grouping messages
`messages`	list[dict]	Yes	—	List of message objects to store

Each message in messages should contain:

Field	Type	Description
`role`	string	Message role: `"human"` or `"ai"`
`content`	string	Message text content

Returns:

{
  "stm_ids": ["67a1b2c3d4e5f6a7b8c9d0e1", "67a1b2c3d4e5f6a7b8c9d0e2"],
  "count": 2
}

Field	Type	Description
`stm_ids`	list[string]	MongoDB ObjectId strings for created STM documents
`count`	integer	Number of STM documents created

Behavior:

Creates one STM document per message with a 24-hour TTL (configurable via STM_TTL_HOURS)
Human messages >30 characters also produce an LTM candidate with enrichment_status: "pending"
LTM candidate IDs are not returned (they are internal)
Each message is embedded using the configured embedding provider

`recall_memory`

Semantically search stored memories. Returns results ranked by a calibrated formula combining recency, importance, and vector similarity.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`query`	string	Yes	—	Natural language search query
`memory_type`	string \| null	No	`null`	Filter by memory type classification
`tags`	list[string] \| null	No	`null`	Filter by tags (all must match)
`limit`	integer	No	`10`	Maximum results to return (capped at `MAX_RESULTS_PER_QUERY`)
`tier`	list[string] \| null	No	`null`	Filter by tier: `["stm"]`, `["ltm"]`, or `["stm", "ltm"]`

Returns:

{
  "results": [
    {
      "_id": "67a1b2c3d4e5f6a7b8c9d0e1",
      "user_id": "user-123",
      "tier": "ltm",
      "content": "The user prefers dark mode interfaces.",
      "summary": "User preference for dark mode UI.",
      "importance": 0.7,
      "access_count": 3,
      "created_at": "2025-01-15T10:30:00",
      "final_score": 0.82
    }
  ],
  "count": 1
}

Field	Type	Description
`results`	list[dict]	Ranked memory documents (embeddings stripped)
`count`	integer	Number of results returned

Behavior:

Generates an embedding for the query and runs a vector search on the memories collection
Deduplicates STM/LTM pairs (keeps the higher-scoring result)
Applies ranking: score = α·recency + β·importance_boost + γ·relevance
Increments access_count and updates last_accessed on returned documents
Excludes soft-deleted documents

`delete_memory`

Soft-delete memories by ID, tags, or time range. Bulk deletes require explicit confirmation. Supports dry-run mode.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`memory_id`	string \| null	No	`null`	Specific memory ID to delete
`tags`	list[string] \| null	No	`null`	Delete memories matching all specified tags
`time_range`	dict \| null	No	`null`	Delete memories within time range (`{"start": "ISO8601", "end": "ISO8601"}`)
`confirm`	boolean	No	`false`	Required for bulk deletes (by tags or time range)
`dry_run`	boolean	No	`false`	Preview deletion count without modifying data

Returns:

{
  "deleted_count": 5,
  "dry_run": false
}

Field	Type	Description
`deleted_count`	integer	Number of documents (soft-)deleted or that would be deleted
`dry_run`	boolean	Present when `dry_run=true`

Behavior:

Single delete by memory_id does not require confirm
Bulk deletes (by tags or time_range) require confirm=true or the operation is rejected
Sets deleted_at timestamp and is_deleted=true on matched documents
Soft-deleted documents are purged after SOFT_DELETE_PURGE_DAYS (default: 30)

Cache Tools

Defined in tools/cache_tools.py.

`check_cache`

Check the semantic cache for a previously cached response to a similar query.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`query`	string	Yes	—	Query to check against cached entries
`similarity_threshold`	float \| null	No	`null`	Minimum similarity for a cache hit (defaults to `CACHE_SIMILARITY_THRESHOLD`: 0.95)

Returns (cache hit):

{
  "cache_hit": true,
  "query": "What is the project deadline?",
  "response": "The project deadline is March 31, 2025.",
  "score": 0.97
}

Returns (cache miss):

{
  "cache_hit": false
}

Field	Type	Description
`cache_hit`	boolean	Whether a cached response was found
`query`	string	Original cached query (on hit)
`response`	string	Cached response text (on hit)
`score`	float	Similarity score (on hit)

`store_cache`

Cache a query-response pair for future similarity lookups.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`query`	string	Yes	—	Query text to cache
`response`	string	Yes	—	Response text to cache

Returns:

{
  "cache_id": "67a1b2c3d4e5f6a7b8c9d0e3"
}

Field	Type	Description
`cache_id`	string	MongoDB ObjectId of the created cache entry

Behavior:

Generates an embedding for the query
Stores with a TTL of CACHE_TTL_SECONDS (default: 3600)
Expired entries are automatically purged by MongoDB TTL index

Search Tools

Defined in tools/search_tools.py.

`hybrid_search`

Combined vector and full-text search over memories using MongoDB $rankFusion for Reciprocal Rank Fusion (RRF). Requires MongoDB Atlas with both memories_vector_index and memories_fts_index configured.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`query`	string	Yes	—	Search query
`tier`	list[string] \| null	No	`null`	Filter by tier (defaults to `["stm", "ltm"]`)
`limit`	integer	No	`10`	Maximum results (capped at `MAX_RESULTS_PER_QUERY`)
`memory_type`	string \| null	No	`null`	Filter by memory type
`tags`	list[string] \| null	No	`null`	Filter by tags (all must match)

Returns:

{
  "results": [
    {
      "_id": "67a1b2c3d4e5f6a7b8c9d0e1",
      "user_id": "user-123",
      "tier": "ltm",
      "content": "Discussed project architecture decisions.",
      "importance": 0.8,
      "rrf_score": 0.034
    }
  ],
  "count": 1
}

Field	Type	Description
`results`	list[dict]	Merged and ranked memory documents (embeddings stripped)
`count`	integer	Number of results

Behavior:

Executes a single MongoDB $rankFusion aggregation with two sub-pipelines:
1. vectorPipeline: $vectorSearch on the embedding field
2. fullTextPipeline: $search on content and summary fields
MongoDB merges results server-side using Reciprocal Rank Fusion
Pipeline weights configurable via RRF_VECTOR_WEIGHT and RRF_TEXT_WEIGHT
Excludes soft-deleted documents

`search_web`

Web search via the Tavily API. Requires TAVILY_API_KEY to be configured.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier (used for audit logging)
`query`	string	Yes	—	Search query

Returns (success):

{
  "results": [
    {
      "title": "Example Result",
      "url": "https://example.com",
      "content": "Result snippet..."
    }
  ],
  "query": "search terms"
}

Returns (no API key):

{
  "error": "Web search service unavailable: Tavily API key not configured"
}

Field	Type	Description
`results`	list[dict]	Tavily search results
`query`	string	Original query
`error`	string	Error message (when Tavily is not configured)

Admin Tools

Defined in tools/admin_tools.py.

`memory_health`

Get health statistics for a user's memory store.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier

Returns:

{
  "user_id": "user-123",
  "total_memories": 42,
  "tier_stats": {"stm": 12, "ltm": 30},
  "enrichment_stats": {"completed": 28, "pending": 2}
}

Field	Type	Description
`user_id`	string	The queried user
`total_memories`	integer	Total non-deleted memories
`tier_stats`	dict	Memory count per tier (`stm`, `ltm`)
`enrichment_stats`	dict	Memory count per enrichment status (`pending`, `completed`)

`wipe_user_data`

Permanently delete ALL data for a user (memories, cache, audit log). This action is irreversible.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`confirm`	boolean	No	`false`	Must be `true` to proceed

Returns (without confirm):

{
  "error": "wipe_user_data requires confirm=true. This will permanently delete ALL user data."
}

Returns (success):

{
  "user_id": "user-123",
  "memories_deleted": 42,
  "cache_deleted": 5,
  "audit_deleted": 100
}

Field	Type	Description
`user_id`	string	The wiped user
`memories_deleted`	integer	Memories hard-deleted
`cache_deleted`	integer	Cache entries hard-deleted
`audit_deleted`	integer	Audit log entries hard-deleted

Behavior:

Hard-deletes from memories, cache, and audit_log collections
Requires confirm=true; returns an error otherwise
Irreversible: data cannot be recovered

`cache_invalidate`

Invalidate cached entries for a user. Use invalidate_all=true to clear all, or pattern to match queries.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`pattern`	string \| null	No	`null`	Pattern to match cached queries
`invalidate_all`	boolean	No	`false`	Clear all cache entries if `true`

Returns:

{
  "user_id": "user-123",
  "deleted_count": 3
}

Field	Type	Description
`user_id`	string	The user whose cache was invalidated
`deleted_count`	integer	Number of cache entries deleted

Decision Tools

Defined in tools/decision_tools.py.

`store_decision`

Store a keyed decision for a user. Decisions persist across conversations with configurable TTL. Use for preferences, choices, and sticky settings.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`key`	string	Yes	—	Decision key identifier
`value`	string	Yes	—	Decision value
`ttl_days`	integer \| null	No	`null`	Time-to-live in days (omit for no expiration)

Returns:

{
  "key": "preferred_language",
  "action": "created",
  "user_id": "user-123"
}

Field	Type	Description
`key`	string	The decision key
`action`	string	`"created"` or `"updated"`
`user_id`	string	The user who stored the decision

Behavior:

Upserts by (user_id, key): storing the same key again overwrites the value
action indicates whether the decision was newly created or updated
When ttl_days is set, the decision auto-expires after that many days via MongoDB TTL index

`recall_decision`

Recall a previously stored decision by key for a user. Returns the decision value and metadata, or not_found.

Parameters:

Name	Type	Required	Default	Description
`user_id`	string	Yes	—	User identifier
`key`	string	Yes	—	Decision key to retrieve

Returns (found):

{
  "key": "preferred_language",
  "found": true,
  "decision": {
    "key": "preferred_language",
    "value": "Python",
    "user_id": "user-123",
    "created_at": "2025-01-15T10:30:00",
    "updated_at": "2025-01-15T10:30:00"
  }
}

Returns (not found):

{
  "key": "preferred_language",
  "found": false
}

Field	Type	Description
`key`	string	The queried decision key
`found`	boolean	Whether a non-expired decision exists
`decision`	dict \| absent	Decision document (present only when `found=true`)

Behavior:

Returns found: false if the key doesn't exist or has expired
Does not modify the decision (read-only)

Audit Logging

Every tool call generates an audit log entry with:

Field	Description
`user_id`	User who made the call
`operation`	Operation type (e.g., `memory:write`, `cache:read`, `search`)
`tool_name`	Tool function name (e.g., `store_memory`, `check_cache`)
`status`	`success` or `error`
`duration_ms`	Execution time in milliseconds
`timestamp`	ISO 8601 timestamp
`metadata`	Tool-specific context (query, result count, error message, etc.)

Audit entries are buffered and flushed to the audit_log MongoDB collection. See configuration.md for audit buffer settings.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API Reference

Transport

Memory Tools

`store_memory`

`recall_memory`

`delete_memory`

Cache Tools

`check_cache`

`store_cache`

Search Tools

`hybrid_search`

`search_web`

Admin Tools

`memory_health`

`wipe_user_data`

`cache_invalidate`

Decision Tools

`store_decision`

`recall_decision`

Audit Logging

FilesExpand file tree

api-reference.md

Latest commit

History

api-reference.md

File metadata and controls

API Reference

Transport

Memory Tools

store_memory

recall_memory

delete_memory

Cache Tools

check_cache

store_cache

Search Tools

hybrid_search

search_web

Admin Tools

memory_health

wipe_user_data

cache_invalidate

Decision Tools

store_decision

recall_decision

Audit Logging

`store_memory`

`recall_memory`

`delete_memory`

`check_cache`

`store_cache`

`hybrid_search`

`search_web`

`memory_health`

`wipe_user_data`

`cache_invalidate`

`store_decision`

`recall_decision`