Joint Imbalance Adaptation for Radiology Report Generation (JIMA)

Source codes for our paper "Joint Imbalance Adaptation for Radiology Report Generation" which addresses the data imbalance challenge in medical report generation.

Citation

Please cite our work as:

@article{li2025joint,
  title={Joint Imbalance Adaptation for Radiology Report Generation},
  author={Li, Wang and Han, Guangzeng and Wu, Yuexin and Huang, I.-Chan and Huang, Xiaolei},
  journal={Journal of Healthcare Informatics Research},
  pages={1--23},
  year={2025},
  publisher={Springer},
  doi={10.1007/s41666-025-00205-9},
  url={https://link.springer.com/article/10.1007/s41666-025-00205-9}
}

Data Imbalance Challenge

Radiology report generation faces two critical imbalance challenges:

Token Imbalance: Medical tokens appear less frequently than regular tokens, but contain crucial clinical information
Label Imbalance: Normal cases dominate datasets (>85% in MIMIC-CXR), leading to poor performance on abnormal cases

This causes models to overfit on frequent patterns while underperforming on rare but clinically important cases.

JIMA: A Joint Imbalance Adaptation Approach

We propose Joint Imbalance Adaptation (JIMA), a curriculum learning-based approach:

Method Overview

JIMA employs a two-stage curriculum learning approach:

Entity Distribution Prediction: Extracts clinical entities to guide report generation
Joint Feature Fusion: Cross-concatenation and element-wise multiplication of image and entity features
Adaptive Training: Dynamic sample selection based on difficulty assessment

Experimental Results

Key Performance Gains

IU X-ray: 16.75%-50.50% average improvement, 72.10% clinical F1 improvement
MIMIC-CXR: 9.59%-16.26% average improvement, 31.29% clinical F1 improvement
Imbalance Handling: Significant improvements on low-frequency tokens and abnormal cases
Human Evaluation: Medical experts prefer JIMA for clinical accuracy (32 vs 21 votes overall)

Test Platform

Python 3.10, PyTorch 2.6, CUDA-enabled GPU recommended

Experiment Preparation

Environment Setup:

See requirements.txt.
Data Preprocessing:
- Download IU X-ray dataset from OpenI
- Download MIMIC-CXR dataset from PhysioNet
- Add entities to the datasets using RadGraph-preprocess.py
Model Training:

Note: Replace the file_path OR dataset_name in script/train_iu_xray.slurm with your actual path.
```
# Joint training (recommended)
cd script/
sbatch train_iu_xray.slurm
```
Evaluation:
```
cd script/
sbatch plot_iu_xray.slurm
```

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
git_images		git_images
models		models
modules		modules
pycocoevalcap		pycocoevalcap
script		script
.gitignore		.gitignore
README.md		README.md
RadGraph-preprocess.py		RadGraph-preprocess.py
compute_ce.py		compute_ce.py
environment.yml		environment.yml
main_plot.py		main_plot.py
main_train.py		main_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Joint Imbalance Adaptation for Radiology Report Generation (JIMA)

Citation

Data Imbalance Challenge

JIMA: A Joint Imbalance Adaptation Approach

Method Overview

Experimental Results

Key Performance Gains

Test Platform

Experiment Preparation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Joint Imbalance Adaptation for Radiology Report Generation (JIMA)

Citation

Data Imbalance Challenge

JIMA: A Joint Imbalance Adaptation Approach

Method Overview

Experimental Results

Key Performance Gains

Test Platform

Experiment Preparation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages