Self-Supervised Pretraining of ECG Signals Using Contrastive Learning

This repository implements self-supervised contrastive learning for 12-lead ECG signals using various augmentation strategies and encoder architectures. After pretraining on massive unlabeled ECG data, the learned encoder is fine-tuned for a downstream binary classification task.

🧪 Pretraining Phase

Self-supervised contrastive learning is performed using positive pairs from augmented views of the same ECG signal.
Training is done on a large unlabeled ECG dataset using NT-Xent (or similar) loss to bring representations of similar signals closer and dissimilar ones apart.

🔁 Contrastive Augmentation Strategies

Defined in CL_augmentations.py, these augmentations provide diverse views of the same ECG signal:

Time Wrapping: Alternating segments of the ECG are stretched or compressed to simulate temporal warping.
Permutation: ECG signals are split into m segments and randomly shuffled.
Zero Masking: Consecutive portions of the ECG are set to zero.
Dropout Masking: Randomly zeros out 10% of signal values per lead in each batch.
Gaussian Noise: Adds noise scaled to signal magnitude for robustness.
CLOCKS Augmentation: Implements spatial, temporal, and patient-level contrast based on CLOCS (Kiyasseh et al., ICML 2021).

🧠 Encoder Architectures

Implemented in models.py, multiple encoders are supported to extract meaningful ECG representations:

CNN – Temporal filters for local pattern learning.
CNN-LSTM – Combines convolution with temporal memory.
CNN-Attention-LSTM – Adds attention over LSTM outputs.
CNN-Transformer – Combines convolutional front-end with self-attention layers.

🔄 Fine-Tuning Phase

After pretraining:

A classifier is added on top of the pretrained encoder.
The full model is fine-tuned end-to-end using a limited labeled ECG dataset.

📊 Experimentation Strategy

Implemented in train.py, all experiments follow a repeated random sub-sampling protocol:

Randomly split patients into train, validation, and test sets.
Train the model on the training set.
Use validation performance to select the best checkpoint.
Evaluate on the held-out test set.
Repeat the full process K times with different seeds.
Report mean ± confidence interval for test performance.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
CL_augmentations.py		CL_augmentations.py
README.md		README.md
models.py		models.py
train.py		train.py
train_clocks.py		train_clocks.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Supervised Pretraining of ECG Signals Using Contrastive Learning

🧪 Pretraining Phase

🔁 Contrastive Augmentation Strategies

🧠 Encoder Architectures

🔄 Fine-Tuning Phase

📊 Experimentation Strategy

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Self-Supervised Pretraining of ECG Signals Using Contrastive Learning

🧪 Pretraining Phase

🔁 Contrastive Augmentation Strategies

🧠 Encoder Architectures

🔄 Fine-Tuning Phase

📊 Experimentation Strategy

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages