Skip to main content

ORIGINAL RESEARCH article

Front. Comput. Sci.
Sec. Computer Graphics and Visualization
Volume 6 - 2024 | doi: 10.3389/fcomp.2024.1455963

DMPNet: Dual-Path and Multi-Scale Pansharpening Network

Provisionally accepted
Gurpreet Kaur Gurpreet Kaur 1Manisha Malhotra Manisha Malhotra 1Dilbag Singh Dilbag Singh 2Sunita Singhal Sunita Singhal 3*
  • 1 Chandigarh University, Mohali, Punjab, India
  • 2 Lovely Professional University, Phagwara, Punjab, India
  • 3 Manipal University Jaipur, Jaipur, Rajasthan, India

The final, formatted version of the article will be published soon.

    Pansharpening is an important remote sensing task that attempts to produce high-resolution multispectral (MS) images by combining low-resolution MS images with high-resolution panchromatic (PAN) images. Though deep learning-based pansharpening has shown impressive results, majority of these models frequently struggle to balance spatial and spectral information, resulting in artifacts and a loss of detail in pansharpened images. Furthermore, these models may fail to properly integrate spatial and spectral information, resulting in poor performance in complex scenarios. These models also encounter issues including gradient vanishing and overfitting. Therefore, this paper proposes a dual-path and multi-scale pansharpening network (DMPNet). It consists three modules such as the feature extraction module (FEM), the multi-scale adaptive attention fusion module (MSAAF), and the image reconstruction module (IRM). The FEM is designed using two paths such as primary and secondary path. The primary path captures global spatial and spectral information using dilated convolutions. The secondary path focuses on fine-grained details using shallow convolutions and attention-guided feature extraction. The MSAAF module adaptively combines spatial and spectral data across different scales, employing a self-calibrated attention (SCA) mechanism for dynamic weighting of local and global contexts and a spectral alignment network (SAN) to assure spectral consistency. Finally, to achieve optimal spatial and spectral reconstruction, the IRM decomposes the fused features into low-and high-frequency components using discrete wavelet transform (DWT). Extensive experimental results and evaluations reveal that the DMPNet is more efficient and robust than competing pansharpening models.

    Keywords: Pansharpening, remote sensing, deep learning, image reconstruction, Spatial and Spectral Fidelity

    Received: 27 Jun 2024; Accepted: 26 Dec 2024.

    Copyright: © 2024 Kaur, Malhotra, Singh and Singhal. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Sunita Singhal, Manipal University Jaipur, Jaipur, 303007, Rajasthan, India

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.