The Reproducibility & Provenance Layer for Bayesian Scientific Analysis
BioAnchor is an autonomous scientific agent that executes Bayesian MCMC analyses, quantifies uncertainty, and permanently archives verifiable evidence on decentralized storage β creating a cryptographic chain-of-trust for scientific discovery.
Science has a reproducibility crisis. Over 70% of researchers have failed to reproduce another scientist's experiments (Nature, 2016). The root causes: lost data, opaque methods, and no verifiable provenance chain.
BioAnchor solves this by combining three capabilities that no other tool offers together:
- Autonomous Bayesian Analysis β Runs PyMC 5 / NUTS MCMC models with full posterior sampling and convergence diagnostics
- Uncertainty Quantification β Computes coefficient of variation (CV), R-hat, ESS, and credible intervals for every parameter
- Permanent Decentralized Archiving β Stores complete MCMC metadata (posteriors, diagnostics, model specs) on Arweave for immutable, censorship-resistant provenance
Raw Data β BioAnchor Agent β MCMC Analysis β Uncertainty Scores β Arweave Archive
β
Permanent TX ID
(verifiable by anyone)
- Ingest β Feed dose-response, genomic, or any biological dataset
- Analyze β BioAnchor autonomously configures and runs Bayesian MCMC (PyMC 5, NUTS sampler)
- Quantify β Generates uncertainty metrics (CV, posterior distributions, convergence diagnostics)
- Archive β Permanently stores analysis metadata on Arweave with a unique Transaction ID
- Verify β Anyone can retrieve and reproduce the analysis using the TX ID
pip install bioanchorfrom bioanchor import BioAnchor
# Initialize with Arweave wallet
anchor = BioAnchor(wallet_path="arweave-wallet.json")
# Run MCMC analysis and archive results
result = anchor.analyze_and_archive(
data=my_dataset,
model_type="hill_equation", # IC50 dose-response model
n_samples=2000,
n_chains=4
)
# Get permanent Arweave TX ID
print(f"Archived: {result.tx_id}")
print(f"Verify at: https://ar-io.dev/{result.tx_id}")BioAnchor has been validated with real pharmaceutical data:
- IC50 dose-response modeling using Hill equation with PyMC 5
- 4-chain NUTS sampling with full convergence diagnostics
- Live Arweave archive:
2lIZtjsu120ERqPKu6XbITpUnAveWeH7hI4owBfNCdY
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β BioAnchor Agent β
βββββββββββββββββββββββββββββββββββββββββββββββββββ€
β βββββββββββββ ββββββββββββ ββββββββββββββββ β
β β PyMC 5 β β ArviZ β β Uncertainty β β
β β Engine ββ βDiagnosticsβββ Scoring β β
β β (NUTS) β β (R-hat, β β (CV, CI, β β
β β β β ESS) β β posterior) β β
β βββββββββββββ ββββββββββββ ββββββββ¬ββββββββ β
β β β
β ββββββββββββββββββββββββββββββββββββββΌββββββββ β
β β Decentralized Storage Layer β β
β β ββββββββββββ ββββββββββββ ββββββββββββ β β
β β β Arweave β β Filecoin β β IPFS β β β
β β β (live) β β (planned)β β(planned) β β β
β β ββββββββββββ ββββββββββββ ββββββββββββ β β
β ββββββββββββββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
BioAnchor is designed to be the scientific truth verification layer for the decentralized science (DeSci) ecosystem:
- β Arweave permanent archiving (live)
- π Filecoin/IPFS bridge (in development)
- π Multi-chain verification endpoints
- Uncertainty-weighted reliability scoring for IP-NFTs
- Integration with Molecule / BIO Protocol IP framework
- CV-based data quality oracle for BioDAOs (VitaDAO, AthenaDAO, etc.)
- AI-driven hypothesis generation from literature
- Automated MCMC model selection and execution
- Self-archiving results with provenance chain
- Integration with BIO Protocol Scientific AI Agent platform
- Community-governed research prioritization
- Staking on reproducibility verification
- Token-incentivized peer review of MCMC analyses
BioAnchor provides a trust layer that any BioDAO can plug into:
| Use Case | How BioAnchor Helps |
|---|---|
| Drug Discovery | Verify IC50/EC50 dose-response analyses with full uncertainty quantification |
| IP-NFT Valuation | Score data quality using CV metrics before tokenizing research IP |
| Reproducibility Audits | Retrieve and re-run any archived MCMC analysis from its Arweave TX ID |
| Grant Accountability | Prove research outputs are real and verifiable on-chain |
BioAnchor includes a MockUploader for development and testing without Arweave wallet:
from bioanchor import BioAnchor, MockUploader
anchor = BioAnchor(uploader=MockUploader())
result = anchor.analyze_and_archive(data=test_data)
# Returns simulated TX ID for testing- Python 3.11+ with PyMC 5, ArviZ, SciPy
- Arweave via ar-io.dev gateway (Node.js uploader for production)
- Zenodo DOI: 10.5281/zenodo.19709077
If you use BioAnchor in your research, please cite:
@software{kim2026bioanchor,
author = {Kim, Dohoon},
title = {BioAnchor: Bridging Bayesian Biological Data Analysis with Decentralized Storage},
year = {2026},
publisher = {Zenodo},
doi = {10.5281/zenodo.19709077},
url = {https://doi.org/10.5281/zenodo.19709077}
}BioAnchor is developed by Dohoon Kim at Promptgenix LLC, building open-source scientific infrastructure for verifiable, reproducible research.
- π¦ PyPI:
pip install bioanchor - π GitHub: github.com/kdh4win4/bioanchor
- π Paper: Submitted to Bioinformatics (Oxford) β BIOINF-2026-1271
MIT License β see LICENSE for details.