GitHub - YeLyuUT/semantic_segmentation: This is the repository for bidirectional multi-scale attention networks.

Paper | UAVid2020 Benchmark

Pytorch implementation of our paper Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV Imagery.
Our code is based on Hierarchical Multi-Scale Attention for Semantic Segmentation.[github]

Installation

The code is tested with pytorch 1.3 and python 3.6

Download Weights

Create a directory where you can keep large files. Ideally, not in this directory.

  > mkdir <large_asset_dir>

Update __C.ASSETS_PATH in config.py to point at that directory

__C.ASSETS_PATH=<large_asset_dir>
Download pretrained weights from google drive and put into <large_asset_dir>/seg_weights

Download/Prepare Data

Download UAVid data download, then update config.py to set the path:

__C.DATASET.UAVID_DIR = <path_to_uavid>

Running the code

The instructions below make use of a tool called runx, which we find useful to help automate experiment running and summarization. For more information about this tool, please see runx. In general, you can either use the runx-style commandlines shown below. Or you can call python train.py <args ...> directly if you like.

Train a model

Train uavid2020, using deeplabV3+ + WRN-38 + bidirectional multi-scale attention with pretrained model

> python -m runx.runx scripts/train_uavid_deepv3MS_bimsa.yml -i

Run inference on UAVid

> python -m runx.runx scripts/eval_uavid_deepv3MS_bimsa.yml -i

Before running inference, path for the snapshot model needs to be configured in cfg file "eval_uavid_deepv3MS_bimsa.yml",

snapshot: <path_to_file.pth>

The output inference will be saved in directory:

logs/eval_uavid_deepv3MS_bimsa/<random folder name>/submit

The output label images from the model is 8 bit, which need to be converted to 24 bit for evaluation. A simple script for conversion is provided. Inplace conversion is applied to all images in the specified folder recursively.

> python utils/img_conversion.py -d ./logs/eval_uavid_deepv3MS_bimsa/<random folder name>/submit/

You could just zip the subfolders for online benchmark submission.

Example of blended output is as follows,

Pretrained Weights

The link to our trained weights with 70.8% mIoU score are as provided from google drive,

https://drive.google.com/file/d/1jMVOHfHtO-z_eIq9cmA2GTo_MPF02ars/view?usp=sharing

Citation

If you use this toolbox or benchmark in your research, please cite this project.

@article{mmdetection,
  title   = {Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV Imagery},
  author  = {Ye Lyu and George Vosselman and Gui-Song Xia and Michael Ying Yang},
  journal = {arXiv preprint arXiv:2102.03099},
  year    = {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
datasets		datasets
imgs		imgs
loss		loss
network		network
scripts		scripts
transforms		transforms
utils		utils
.runx		.runx
Dockerfile		Dockerfile
LICENSE		LICENSE
PREPARE_DATASETS.md		PREPARE_DATASETS.md
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper | UAVid2020 Benchmark

Installation

Download Weights

Download/Prepare Data

Running the code

Train a model

Run inference on UAVid

Pretrained Weights

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Paper | UAVid2020 Benchmark

Installation

Download Weights

Download/Prepare Data

Running the code

Train a model

Run inference on UAVid

Pretrained Weights

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages