Change the repository type filter
All
Repositories list
2 repositories
refusalbench
PublicReproducible, evergreen benchmark for LLM refusal on biological research prompts — 19 models, 141 prompts, 13,389 adjudicated trialsCardioSafe-benchmark
PublicCurated data deposit for the CardioSafe cardiac ion channel benchmark: labels, Tanimoto-controlled splits (tan70 / tan60), and supplementary artifacts for hERG,…
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.