ReSolved - Multi-Solvent Reduction potetnail Estimation GNN

This repository contains code for predicting the reduction potentials of molecular species in various solvents. This approach enables simultaneous learning of electron affinity (EA) and solvent-dependent corrections for redox potentials and can generalise to previously unseen solvents.

Usage

This implementation supports two modes of training, with and without the explicit conditioning for the solvent-independent terms (-EA). This can be set up in run_experiment.py or train.py, by setting the explicit_ea = False

Data Preparation

Place your CSV file (e.g., ReSolvedData.csv) in the project folder. It should contain columns for:

smiles (string)
EA (float)
RP_ACN, RP_H2O, RP_THF, RP_DMSO, RP_DMF (float) — these are solvent-dependent properties.

Running the Experiment

Modify hyperparameters in run_experiment.py or train.py if desired (e.g., num_layers, emb_dim, epochs).
Run:
```
python run_experiment.py
```

Outputs

best_model.pth: the best-scoring model checkpoint.
loss_curve.png: training vs. validation loss plot.
Scatter plots of predicted vs. reference values for each solvent.
CSV files (e.g., train_data.csv, test_data.csv) with ground-truth and predicted values.

Evaluating trained model

Provide the path to the weigths in eval_trained.py or use the provided weigths in /weights.
Run:
```
python eval_trained.py
```

Methods

Haber Thermodynamic Cycle for Reduction Potential

We estimate the solution-phase free energy change for the reduction:

$$ A + e^- \rightarrow A^- $$

using a thermodynamic cycle:

$$ \Delta\Delta G_{\text{solution}} = \Delta G_{\text{gas}} + \Delta G_{\text{solv},A^-} - \Delta G_{\text{solv},A} $$

Here:

$\Delta\Delta G_{\text{solution}}$ is the gas-phase electron attachment free energy.
$\Delta G_{\text{solv},A}$ and $\Delta G_{\text{solv},A^-}$ are the solvation free energies for neutral $\ A$ and anionic $\ A^-$, respectively.

Electrode Potential

The electrode potential $\ E$ relates to the Gibbs free energy change via:

$$ \Delta G = -nFE \quad \text{and} \quad E_{\text{red}} = E^\circ - \frac{\Delta G}{nF}, $$

where $\ n$ is the number of electrons transferred, $\ F$ is the Faraday constant, $E^\circ$ is the electrode potential of the reference electrode (note that the MPNN is trained on the absolute potentials, i.e., $E^\circ$ = 0).

MPNN Architecture

In this codebase, node (atom) and edge (bond) features are updated using multiple rounds of message passing. Each MPNN layer:

Computes messages from neighboring nodes and edges.
Produces updated node features, which we combine residually with the previous layer’s node features.
Similarly, the edge states are updated in a residual fashion.

After the final layer, we concatenate the final node and edge embeddings.

Set Transformer Readout and Solvent Embedding

We pass the concatenated node-edge feature set through two parallel Set Transformer aggregations:

One dedicated to predicting electron affinity (EA).
Another to incorporate solvent information (via learnable embeddings of each solvent’s dielectric constant and refractive index) and generate the solvent-dependent correction.

By concatenating these aggregated representations with the solvent embeddings, the model predicts:

The negative of the EA.
The combined (-EA + solvent contribution) for each solvent.

Thus, each forward pass yields a multi-output prediction vector:

$$\ [-EA, -EA + \Delta_{solv} ] $$

Repository Structure

resolved_project/
├── Evolving_EA.ipynb  # Example of generation of new molecules in a traget range of EA (evomol needed)
├── data_utils.py      # Reading CSV, dataset creation, SMILES->PyG conversion
├── features.py        # RDKit-based atom and bond feature extraction
├── model/
│   ├── mpnn_layer.py  # Custom MPNN layer (MessagePassing in PyG)
│   ├── readout.py     # Set Transformer-based readout + solvent embeddings
│   └── mpnn_model.py  # Full model combining MPNN layers + readout
├── train.py           # Setting up the model & hyperparameters, evalauting the training
├── train_loop.py      # Main training loop
├── evaluate.py        # Evaluation loop, metrics and plotting tools
├── eval_trained.py    # Loading the weights, evaluating and plotting 
└── run_experiment.py  # Script orchestrating data loading, training, and evaluation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReSolved - Multi-Solvent Reduction potetnail Estimation GNN

Table of Contents

Usage

Data Preparation

Running the Experiment

Outputs

Evaluating trained model

Methods

Haber Thermodynamic Cycle for Reduction Potential

Electrode Potential

MPNN Architecture

Set Transformer Readout and Solvent Embedding

Repository Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
evo_gen/exp		evo_gen/exp
model		model
weights		weights
Evolving_EA.ipynb		Evolving_EA.ipynb
README.md		README.md
ReSolvedData.csv		ReSolvedData.csv
data_utils.py		data_utils.py
eval_trained.py		eval_trained.py
evaluate.py		evaluate.py
features.py		features.py
model_overview.png		model_overview.png
requirements.txt		requirements.txt
run_experiment.py		run_experiment.py
train.py		train.py
trainin_loop.py		trainin_loop.py

Folders and files

Latest commit

History

Repository files navigation

ReSolved - Multi-Solvent Reduction potetnail Estimation GNN

Table of Contents

Usage

Data Preparation

Running the Experiment

Outputs

Evaluating trained model

Methods

Haber Thermodynamic Cycle for Reduction Potential

Electrode Potential

MPNN Architecture

Set Transformer Readout and Solvent Embedding

Repository Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages