Fix #90: render identity-threat framing in persona/reasoning context by DeveshParagiri · Pull Request #104 · exaforge/extropy

DeveshParagiri · 2026-02-17T19:03:50Z

Summary

Extend ReasoningContext with identity_threat_summary
Add deterministic identity-threat detection in engine context building using scenario text plus agent identity attributes (political, religious, race/ethnicity, gender/sexual identity, parental role, citizenship)
Inject a dedicated Identity Relevance prompt section so agents can explicitly reason when an issue feels identity-relevant
Add tests covering context construction and prompt inclusion

Testing

pytest -q tests/test_engine.py::TestTokenAccumulation::test_build_reasoning_context_adds_identity_threat_summary tests/test_reasoning_prompts.py::TestPhaseAPromptFeatures::test_identity_relevance_included

Closes #90

Recreated from closed PR #97 (base branch was deleted)

⚠️ CHANGES REQUESTED - DO NOT MERGE

This PR uses hardcoded scenario keywords for identity detection:

if political_value and scenario_mentions(
    ("liberal", "conservative", "book ban", "school board", ...)
):

Problems:

Not configurable - can't add/remove keywords without code changes
Scenario-specific leakage - book ban, school board are from test scenario
False positives - men/man matches management, manual, etc.
Not extensible - new identity dimensions require code changes

Suggested fix: Add identity_dimensions field to scenario spec:

identity_dimensions:
  - dimension: political_orientation
    reason: "The policy is framed along partisan lines"

See full review: #97 (review)

DeveshParagiri · 2026-02-17T19:16:49Z

Superseded by #106 which implements a proper LLM-driven approach for identity_dimensions detection.

RandomOscillations added 2 commits February 17, 2026 03:10

Fix #89 persist and classify THINK vs SAY divergence

26d1df3

Fix #90 add identity-threat framing to reasoning context

1563e0d

DeveshParagiri closed this Feb 17, 2026

DeveshParagiri deleted the codex/issue-90-identity-threat-framing branch February 23, 2026 01:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #90: render identity-threat framing in persona/reasoning context#104

Fix #90: render identity-threat framing in persona/reasoning context#104
DeveshParagiri wants to merge 2 commits intomainfrom
codex/issue-90-identity-threat-framing

DeveshParagiri commented Feb 17, 2026

Uh oh!

DeveshParagiri commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DeveshParagiri commented Feb 17, 2026

Summary

Testing

⚠️ CHANGES REQUESTED - DO NOT MERGE

Uh oh!

DeveshParagiri commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants