Biological dual-use content classifier using Constitutional Classifiers methodology — biosafety constitution, synthetic data pipeline, DeBERTa-v3-base classifier
ai-safety content-moderation deberta biosecurity anthropic constitutional-ai biosafety dual-use llm-classifier nsabb
-
Updated
May 10, 2026 - Python