Skip to main content

ORIGINAL RESEARCH article

Front. Mar. Sci.
Sec. Physical Oceanography
Volume 11 - 2024 | doi: 10.3389/fmars.2024.1492572
This article is part of the Research Topic Prediction Models and Disaster Assessment of Ocean Waves, and the Coupling Effects of Ocean Waves in Various Ocean-Air Processes View all 5 articles

Numerical investigation of the effective receptive field and its relationship with convolutional kernels and layers in convolutional neural network

Provisionally accepted
Longyu Jiang Longyu Jiang 1Quan Jin Quan Jin 1*Feng Hua Feng Hua 1Xingjie Jiang Xingjie Jiang 2ZEYU WANG ZEYU WANG 1,2Wei Gao Wei Gao 1Fuhua Huang Fuhua Huang 1Can Fang Can Fang 1Yongzeng Yang Yongzeng Yang 2
  • 1 Shantou University, Shantou, China
  • 2 First Institute of Oceanography, Ministry of Natural Resources, Qingdao, Shandong Province, China

The final, formatted version of the article will be published soon.

    The receptive field (RF) plays a crucial role in convolutional neural networks (CNNs) because it determines the amount of input information that each neuron in a CNN can perceive, which directly affects the feature extraction ability. As the number of convolutional layers in CNNs increases, there is a decay of the RF according to the two-dimensional Gaussian distribution. Thus, an effective receptive field (ERF) can be used to characterize the available part of the RF. The ERF is calculated by the kernel size and layer number within the neural network architecture. Currently, ERF calculation methods are typically applied to single-channel input data that are both independent and identically distributed. However, such methods may result in a loss of effective information if they are applied to more general (i.e., multi-channel) datasets. Therefore, we proposed a multi-channel ERF calculation method. By conducting a series of numerical experiments, we determined the relationship between the ERF and the convolutional kernel size in conjunction with the layer number.To validate the new method, we used the recently published global wave surrogate model for climate simulation (GWSM4C) and its accompanying dataset. According to the newly established relationship, we refined the kernel size and layer number in each neural network of the GWSM4C to produce the same ERF but lower RF attenuation rates than those of the original version. By visualizing the gradient map at several points in West African and East Pacific areas, the high gradient value regions confirmed the known swell sources, which indicated effective feature extraction in these areas. Furthermore, the new version of the GWSM4C yielded better prediction accuracy for significant wave height in global swell pools. The root mean square errors in the West African and East Pacific regions reduced from approximately 0.3 m, in the original model to about 0.15 m, in the new model. Moreover, these improvements were attributed to the higher efficiency of the newly modified neural network structure that allows the inclusion of more historical winds while maintaining acceptable computational consumption.

    Keywords: Convolutional Neural Network, Effective Receptive Field, multi-channel samples, Gradient map, significant wave height, GWSM4C

    Received: 07 Sep 2024; Accepted: 14 Oct 2024.

    Copyright: © 2024 Jiang, Jin, Hua, Jiang, WANG, Gao, Huang, Fang and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Quan Jin, Shantou University, Shantou, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.