🏥 CURL: Contrastive Ultrasound Video Representation Learning

A novel self-supervised framework for fetal movement detection from extended ultrasound video recordings

🎯 Overview Paper

CURL (Contrastive Ultrasound Video Representation Learning) is a cutting-edge self-supervised framework designed specifically for fetal movement assessment from ultrasound videos. Our method employs a dual-contrastive loss that captures both spatial (anatomical) and temporal (motion-based) features, enabling robust representation learning for fetal movement dynamics.

🔬 Key Innovations

🎭 Dual-Contrastive Learning: Combines spatial (SimCLR-style NT-Xent) and temporal contrastive objectives
🎯 Task-Specific Sampling: Intelligent sampling strategy for movement vs. non-movement segments
🔄 Flexible Inference: Supports ultrasound recordings of arbitrary length through probabilistic fine-tuning
🏗️ Modular Architecture: Support for both SlowFast and Vision Transformer (ViT) backbones

Pipeline Overview: Starting from expertly annotated ultrasound videos (A), CURL splits clips into spatiotemporal patches (B), uses transformer backbones with dual-contrastive learning to extract robust features, fine-tunes with lightweight classifiers (C), and delivers clinically reliable fetal movement detection (D).

🚀 Quick Start

📋 Prerequisites

Python 3.8+
CUDA-capable GPU (recommended)
16GB+ RAM for video processing

🔧 Installation

Clone the repository

git clone https://github.com/Mr-TalhaIlyas/CURL.git
cd CURL

Create virtual environment

# Using conda (recommended)
conda create -n curl python=3.8
conda activate curl

# Or using pip
python -m venv curl
source curl/bin/activate  # Linux/Mac
curl\Scripts\activate     # Windows

Install dependencies

# Using pip
pip install -r requirements.txt

# Or using conda
conda create --name curl --file requirements.txt

📁 Data Preparation

Organize your data structure:

data/
├── videos/           # Raw ultrasound videos (.mp4)
├── optical_flow/     # Optical flow videos (.mp4) 
├── labels/           # Label files (.npy)
└── folds/           # Train/test split files
    ├── train_fold_1.txt
    ├── test_fold_1.txt
    └── ...

Update configuration:

# In configs/config.py
config = dict(
    vid_dir = "path/to/videos/",
    flow_dir = "path/to/optical_flow/", 
    lbl_dir = "path/to/labels/",
    folds = "path/to/folds/"
)

🎓 Training

1. 🔄 Self-Supervised Pre-training

Choose between two backbone architectures:

SlowFast + Dual Contrastive Loss

# Train with both spatial and temporal losses
python dual_contrastive_main.py \
    --enable_temporal_loss \
    --spatial_loss_weight 1.0 \
    --temporal_loss_weight 0.5 \
    --dual_loss_mode both \
    --epochs 100

# Spatial-only training
python dual_contrastive_main.py \
    --spatial_loss_weight 1.0 \
    --dual_loss_mode spatial_only

Vision Transformer (ViT) + Dual Contrastive Loss

# MAE-style contrastive learning
python run_mae_contrastive.py \
    --enable_temporal_loss \
    --spatial_loss_weight 1.0 \
    --temporal_loss_weight 0.7 \
    --embed_dim 1024 \
    --depth 24

2. 🎯 Fine-tuning for Classification

# Fine-tune pre-trained contrastive model
python run_finetune.py \
    --model_type contrastive_mae \
    --checkpoint_path /path/to/pretrained_model.pth \
    --epochs 30 \
    --lr 2e-4 \
    --loss_type focal

# Fine-tune standard MAE model
python run_finetune.py \
    --model_type standard_mae \
    --checkpoint_path /path/to/mae_model.pth \
    --epochs 30

🏗️ Architecture

🎭 Dual-Contrastive Loss Framework

# Spatial Contrastive Loss (NT-Xent)
spatial_loss = NT_XentLoss(spatial_features_i, spatial_features_j)

# Temporal Contrastive Loss (TC)
temporal_loss = temporal_contrastive_loss(
    temporal_features_i, 
    temporal_features_j, 
    temperature, 
    clusters=8
)

# Combined Loss
total_loss = α * spatial_loss + β * temporal_loss

🔧 Supported Architectures

Model	Backbone	Key Features
SimCLR + SlowFast	SlowFast ResNet	Two-stream processing for spatial-temporal features
Contrastive MAE	Vision Transformer	Patch-based processing with attention mechanisms
Hybrid Models	Custom	Combine benefits of both approaches

⚙️ Configuration

🔧 Key Parameters

# Dual contrastive learning
enable_temporal_loss = True
spatial_loss_weight = 1.0
temporal_loss_weight = 0.5
temperature_spatial = 0.5
temperature_temporal = 0.1

# Temporal contrastive loss
tc_clusters = 8
tc_num_iters = 10
tc_do_entro = True  # Enable IID regularization

# Model architecture  
mae_contrastive = dict(
    embed_dim = 1024,
    depth = 24,
    num_heads = 16,
    projection_dim = 256,
    temporal_projection_dim = 128
)

📁 Project Structure

CURL/
├── 📄 README.md
├── 📋 requirements.txt
├── 📂 scripts/
│   ├── 🔧 configs/
│   │   └── config.py
│   ├── 📊 data/
│   │   ├── simclr_loader.py
│   │   ├── dataloader.py
│   │   └── utils.py
│   ├── 🏗️ models/
│   │   ├── mae/
│   │   ├── slowfast/
│   │   └── contrastive_mae.py
│   ├── 🛠️ tools/
│   │   ├── nt_xnet.py           # Spatial contrastive loss
│   │   ├── tc_loss.py           # Temporal contrastive loss  
│   │   └── simclr_training.py
│   ├── 🎓 Training Scripts
│   │   ├── main_simclr.py
│   │   ├── main_mae_contrastive.py
│   │   └── finetune_contrastive_mae.py
│   └── 🚀 Run Scripts
│       ├── dual_contrastive_main.py
│       ├── run_mae_contrastive.py
│       └── run_finetune.py
└── 📸 screens/
    └── summary.jpg

🔬 Technical Details

🎯 Loss Functions

Spatial Contrastive Loss (NT-Xent)

Based on SimCLR framework
Learns anatomical feature representations
Temperature-scaled InfoNCE loss

Temporal Contrastive Loss (TC)

Novel clustering-based approach
Learns motion dynamics
Combines Cross-Level Distillation (CLD) and IID regularization

🎭 Data Augmentation

Spatial: Random cropping, color jittering, Gaussian blur
Temporal: Frame dropping, temporal jittering
Domain-specific: Ultrasound-aware transformations

📚 Citation

If you find this work useful, please cite our paper:

Paper is currently under review.

🐛 Issues

Found a bug? Please open an issue with:

Detailed description
Steps to reproduce
Environment details
Expected vs actual behavior

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Thanks to the medical imaging community for inspiration
Built upon excellent work in self-supervised learning
Special thanks to SimCLR and MAE teams

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
configs		configs
data		data
dataset		dataset
models		models
screens		screens
tests		tests
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
finetune_mae.py		finetune_mae.py
finetuner.py		finetuner.py
main_mae_cont.py		main_mae_cont.py
main_simclr.py		main_simclr.py
main_supervised.py		main_supervised.py
requirements.txt		requirements.txt
run_exp.py		run_exp.py
run_exp_mae.py		run_exp_mae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏥 CURL: Contrastive Ultrasound Video Representation Learning

🎯 Overview Paper

🔬 Key Innovations

🚀 Quick Start

📋 Prerequisites

🔧 Installation

📁 Data Preparation

🎓 Training

1. 🔄 Self-Supervised Pre-training

SlowFast + Dual Contrastive Loss

Vision Transformer (ViT) + Dual Contrastive Loss

2. 🎯 Fine-tuning for Classification

🏗️ Architecture

🎭 Dual-Contrastive Loss Framework

🔧 Supported Architectures

⚙️ Configuration

🔧 Key Parameters

📁 Project Structure

🔬 Technical Details

🎯 Loss Functions

Spatial Contrastive Loss (NT-Xent)

Temporal Contrastive Loss (TC)

🎭 Data Augmentation

📚 Citation

🐛 Issues

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🏥 CURL: Contrastive Ultrasound Video Representation Learning

🎯 Overview Paper

🔬 Key Innovations

🚀 Quick Start

📋 Prerequisites

🔧 Installation

📁 Data Preparation

🎓 Training

1. 🔄 Self-Supervised Pre-training

SlowFast + Dual Contrastive Loss

Vision Transformer (ViT) + Dual Contrastive Loss

2. 🎯 Fine-tuning for Classification

🏗️ Architecture

🎭 Dual-Contrastive Loss Framework

🔧 Supported Architectures

⚙️ Configuration

🔧 Key Parameters

📁 Project Structure

🔬 Technical Details

🎯 Loss Functions

Spatial Contrastive Loss (NT-Xent)

Temporal Contrastive Loss (TC)

🎭 Data Augmentation

📚 Citation

🐛 Issues

📄 License

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages