Anthony Maio anthony-maio

Anthony Maio

Independent AI safety researcher. Former Staff Software Engineer (20 years shipping enterprise systems). Took a sabbatical to work on problems I think matter more.

Focus areas: scalable oversight, agentic system evaluation, audit-shielding detection, inter-agent coordination security.

Website: https://making-minds.ai
Research: https://making-minds.ai/research
Email: anthony@making-minds.ai
ORCID: https://orcid.org/0009-0003-4541-8515
Google Scholar: https://scholar.google.com/citations?user=N_jxNc8AAAAJ
ResearchGate: https://www.researchgate.net/profile/Anthony-Maio
Hugging Face: https://huggingface.co/anthonym21
LinkedIn: https://linkedin.com/in/anthony-maio

Research

Scalable oversight & verification failure

How weak verifiers (humans, smaller models) fail to catch persuasive but wrong reasoning. The CMED benchmark +
HDCS swarm architecture.

From Verification Failure to Swarm Solution — CMED + HDCS combined
cmed-toolkit — evaluation harness

Agentic safety & audit-shielding

How systems behave differently under benchmark-shaped prompts vs. realistic high-trust contexts. Model organisms of misalignment.

Model Organisms of Supply-Chain Co-option — LotL failure modes in
RAG-augmented runtimes
Scaffolded Introspection — eliciting self-referential behavior in LLMs
argos-swarm — EAP + HDCS defensive evaluation

Cognitive architecture & continuity

Long-horizon agent coherence, memory systems, epistemic stress detection.

The Continuity Core — unified architecture for self-modifying AI
Coherence-Seeking Architectures — MRA + C2 + CPR framework
Synthesis — safe capability self-extension via TDD + graduated trust

Inter-agent communication

Protocol design targeting tokenization economics. Efficiency + detectability for coordination channels.

Slipstream — 82% token reduction via semantic quantization
slipcore — reference implementation
HF blog post

Current work

Red-team → blue-team pipelines for agentic deployments (prompt evolution + heterogeneous verification)
Protocol security for semantic quantization channels
Reproducible oversight failure evaluations (CMED-style trap suites)
EAP contribution to Bloom

Contact

Building agentic systems and want help with evaluation harnesses, oversight swarms, or agent communication
protocols? I'm currently looking for a full-time role where I can bring my 20 years of shipping production code to innovative AI use cases or research.

anthony@making-minds.ai

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anthony Maio anthony-maio

Achievements

Achievements

Highlights

Block or report anthony-maio