Skip to content

Cursor/eval methodology guard provenance#1

Merged
vornicx merged 2 commits into
mainfrom
cursor/eval-methodology-guard-provenance
Jun 10, 2026
Merged

Cursor/eval methodology guard provenance#1
vornicx merged 2 commits into
mainfrom
cursor/eval-methodology-guard-provenance

Conversation

@vornicx

@vornicx vornicx commented Jun 10, 2026

Copy link
Copy Markdown
Owner

No description provided.

vornicx and others added 2 commits June 9, 2026 23:46
Publish anti-cheating eval docs, a deterministic dumb-reader ablation,
conflicts-v1 stress tests, and retention traces so retrieval quality is
separable from reader recovery; add Guard/Armorer with provenance tags and
check_memory_use for external-action boundaries.

Co-authored-by: Cursor <cursoragent@cursor.com>
Add read-only inspect_memory and richer recall evidence over MCP, align LongMemEval benchmark commands/defaults on n=40, and include MCPB packaging plus privacy policy updates.

Tests: uv run --extra mcp --extra dev pytest tests/test_mcp_server.py tests/test_guard.py tests/test_capture.py tests/test_memory_supersede.py tests/test_sqlite_store.py

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
@vornicx vornicx merged commit abe5606 into main Jun 10, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant