You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Pre-screening lithium-ion battery cathode candidate materials before expensive DFT (Density Functional Theory) calculations. The model predicts energy above hull (E_hull) with calibrated uncertainty to classify candidates as KEEP (send to DFT), MAYBE (manual review), or KILL (discard).
Captures: Inherent noise and ambiguity in the training data
Epistemic Uncertainty (Reducible)
Source: Inter-model disagreement (standard deviation of q50 across 5 members)
Captures: Model ignorance, especially on novel chemistries
Conformal Calibration (Finite-Sample Valid)
Method: Symmetric delta adjustment to quantile intervals
Guarantee: 90% marginal coverage on calibration set distribution
Per-cluster: Cluster-conditional calibration when cluster size ≥ 50
Out-of-Distribution Detection
Three independent gates with sigmoid combination:
Composition distance: Jaccard distance from training compositions
Embedding kNN: Distance to k-nearest neighbors in learned feature space
Ensemble disagreement: Normalized inter-model standard deviation
Known Limitations
Critical Limitations
LOCO performance is poor: Spearman ρ ≈ 0 on structurally novel material families. The model cannot reliably rank materials from unseen crystal structure types.
Calibration degrades OOD: Coverage drops to 72% on LOCO splits (below 90% target). Conformal guarantees only hold for the calibration distribution.
Oxide-centric training: Limited to transition metal oxide frameworks. Performance on sulfides, phosphates, or mixed-anion systems is unvalidated.
Operational Limitations
CPU inference latency: ~2-3 seconds per structure on CPU. Not suitable for real-time screening of >10K candidates without GPU acceleration.
CIF input only: Requires crystallographic information file. No composition-only fast triage mode.
Single-chemistry scope: Li-ion cathodes only. Extension to Na-ion, solid-state, etc. requires retraining.
Bias & Fairness Considerations
Materials Project bias: Training data inherits the compositional and structural biases of the Materials Project database (over-representation of common crystal structures, under-representation of metastable phases)
Stability bias: The 0.05 eV/atom stability threshold is a convention, not a physical law. Some materials marginally above this threshold may be synthesizable.
Ethical Considerations
Not a substitute for experimental validation: Screening decisions should always be verified by DFT and, ultimately, by experimental synthesis and characterization.
Environmental impact: Model training required ~200 GPU-hours on RTX 2060. Inference is CPU-only.
Dual-use: While designed for battery materials, the ensemble methodology could be applied to other materials screening tasks.
Recommendations
For Users
Always verify KEEP decisions with DFT before experimental synthesis
Check OOD flags — high OOD scores indicate the model is uncertain about a novel chemistry
Use MAYBE decisions as candidates for active learning, not as rejections
Monitor drift metrics when deploying on new material libraries
For Developers
Retrain when Materials Project database updates significantly
Expand SOAP-LOCO evaluation when adding new crystal families
Consider multi-fidelity approaches for LOCO-weak regions
@software{cathodescreen2025,
title = {CathodeScreen: High-Throughput ML Screening of Li-Ion Battery Cathodes},
year = {2025},
url = {https://github.com/your-org/cathode-screening}
}
References
Batatia, I., et al. (2023). MACE-MP-0: A Foundation Model for Materials Science. arXiv:2401.00096.
Jain, A., et al. (2013). The Materials Project. APL Materials.
Lakshminarayanan, B., et al. (2017). Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles. NeurIPS.
Vovk, V., et al. (2005). Algorithmic Learning in a Random World. Springer.
Mitchell, M., et al. (2019). Model Cards for Model Reporting. FAT Conference*.
This model card follows the framework proposed by Mitchell et al. (2019) for transparent ML model documentation.Last updated: 2025-01-15 | Model version: 1.0.0