Pinned Loading
-
agentshield
agentshield PublicSecurity audit framework for agentic AI systems: STRIDE threat modeling, 100-scenario attack suite, 4-component detection pipeline (96% detection, 1.0% FPR)
Python
-
ambiguity-casebook
ambiguity-casebook PublicDual-Use Ambiguity Casebook: 30 structured cases at the AI-era biology dual-use decision boundary
Python
-
bio-constitution-rules
bio-constitution-rules Public30 machine-readable constitutional rules for biological dual-use content across 6 bio domains. JSON format for Constitutional Classifier pipeline integration.
Python
-
bio-overrefusal-v0.1
bio-overrefusal-v0.1 PublicDomain-expert-authored benchmark for LLM over-refusal on legitimate biology research queries.
Python
-
constitutional-bioguard
constitutional-bioguard PublicBiological dual-use content classifier using Constitutional Classifiers methodology — biosafety constitution, synthetic data pipeline, DeBERTa-v3-base classifier
Python
-
narrow-model-safety-eval
narrow-model-safety-eval PublicEmpirical dual-use risk assessment of protein language models (ESM-2) and structure-based design tools (ProteinMPNN)
Python
If the problem persists, check the GitHub status page or contact support.

