PlumX Metrics
Embed PlumX Metrics

A spatial-temporal attention-based method and a new dataset for remote sensing image change detection

Remote Sensing, ISSN: 2072-4292, Vol: 12, Issue: 10
2020
  • 1,163
    Citations
  • 0
    Usage
  • 317
    Captures
  • 1
    Mentions
  • 1
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

  • Citations
    1,163
    • Citation Indexes
      1,160
    • Policy Citations
      2
      • 2
    • Patent Family Citations
      1
      • 1
  • Captures
    317
  • Mentions
    1
    • Blog Mentions
      1
      • 1
  • Social Media
    1
    • Shares, Likes & Comments
      1
      • Facebook
        1

Most Recent Blog

2022-12-31: Paper Summary: "Beyond Classifiers: Remote Sensing Change Detection with Metric Learning" Zhang et al.

Semantic mapping of changes between images using Triplet Loss Metric Learning, Fig 8. from Zhang et al. I talked about two kinds of trust in my previous two posts, Evaluating Trust in User-Data Networks: What Can We Learn from Waze? and Trust Management in Multi-Agent Systems via Deep Reinforcement Learning. In the former, we looked at trust as a measure of the accuracy of data provided by user an

Article Description

Remote sensing image change detection (CD) is done to identify desired significant changes between bitemporal images. Given two co-registered images taken at different times, the illumination variations and misregistration errors overwhelm the real object changes. Exploring the relationships among different spatial-temporal pixels may improve the performances of CD methods. In our work, we propose a novel Siamese-based spatial-temporal attention neural network. In contrast to previous methods that separately encode the bitemporal images without referring to any useful spatial-temporal dependency, we design a CD self-attention mechanism to model the spatial-temporal relationships. We integrate a new CD self-attention module in the procedure of feature extraction. Our self-attention module calculates the attention weights between any two pixels at different times and positions and uses them to generate more discriminative features. Considering that the object may have different scales, we partition the image into multi-scale subregions and introduce the self-attention in each subregion. In this way, we could capture spatial-temporal dependencies at various scales, thereby generating better representations to accommodate objects of various sizes. We also introduce a CD dataset LEVIR-CD, which is two orders of magnitude larger than other public datasets of this field. LEVIR-CD consists of a large set of bitemporal Google Earth images, with 637 image pairs (1024 x 1024) and over 31 k independently labeled change instances. Our proposed attention module improves the F1-score of our baseline model from 83.9 to 87.3 with acceptable computational overhead. Experimental results on a public remote sensing image CD dataset show our method outperforms several other state-of-the-art methods.

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know