Multistage Attention ResU-Net for Semantic Segmentation of Fine-Resolution Remote Sensing Images

Rui Li*, Shunyi Zheng, Chenxi Duan, Jianlin Su, Ce Zhang

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

145 Citations (Scopus)

Abstract

The attention mechanism can refine the extracted feature maps and boost the classification performance of the deep network, which has become an essential technique in computer vision and natural language processing. However, the memory and computational costs of the dot-product attention mechanism increase quadratically with the spatiotemporal size of the input. Such growth hinders the usage of attention mechanisms considerably in application scenarios with large-scale inputs. In this letter, we propose a linear attention mechanism (LAM) to address this issue, which is approximately equivalent to dot-product attention with computational efficiency. Such a design makes the incorporation between attention mechanisms and deep networks much more flexible and versatile. Based on the proposed LAM, we refactor the skip connections in the raw U-Net and design a multistage attention ResU-Net (MAResU-Net) for semantic segmentation from fine-resolution remote sensing images. Experiments conducted on the Vaihingen data set demonstrated the effectiveness and efficiency of our MAResU-Net. Our code is available at <uri>https://github.com/lironui/MAResU-Net</uri>.

Original languageEnglish
Article number8009205
Number of pages5
JournalIEEE Geoscience and Remote Sensing Letters
Volume19
DOIs
Publication statusPublished - 15 Mar 2021

Bibliographical note

Publisher Copyright:
1558-0571 © 2021 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information.

Keywords

  • Complexity theory
  • Decoding
  • Feature extraction
  • Image segmentation
  • Remote sensing
  • Semantics
  • Task analysis

Fingerprint

Dive into the research topics of 'Multistage Attention ResU-Net for Semantic Segmentation of Fine-Resolution Remote Sensing Images'. Together they form a unique fingerprint.

Cite this