Skip to main content

HYPOTHESIS AND THEORY article

Front. Mech. Eng.
Sec. Fluid Mechanics
Volume 10 - 2024 | doi: 10.3389/fmech.2024.1397131
This article is part of the Research Topic Hybrid Modeling - Blending Physics with Data View all 3 articles

On the Generalization Discrepancy of Spatiotemporal Dynamics-informed Graph Convolutional Networks

Provisionally accepted
Yue Sun Yue Sun 1Chao Chen Chao Chen 1Yuesheng Xu Yuesheng Xu 1*Sihong Xie Sihong Xie 2Rick S. Blum Rick S. Blum 1*Parv Venkitasubramaniam Parv Venkitasubramaniam 1*
  • 1 Lehigh University, Bethlehem, United States
  • 2 Hongkong University of Science and Technology(Guangzhou), Guangzhou, China

The final, formatted version of the article will be published soon.

    Graph Neural Networks (GNNs) have gained significant attention in diverse domains, ranging from urban planning to pandemic management. Ensuring both accuracy and robustness in GNNs remains a challenge, due to insufficient quality data containing sufficient features. With sufficient training data where all spatiotemporal patterns are well-represented, existing GNN models can make reasonably accurate predictions. However, existing methods fail when the training data are drawn from different circumstances (e.g., traffic patterns on regular days) compared to test data (e.g., traffic patterns after a natural disaster). Such challenges are usually classified under domain generalization. In this work, we show that one way to address this challenge in the context of spatiotemporal prediction is by incorporating domain differential equations into Graph Convolutional Networks (GCNs). We theoretically derive conditions where GCNs incorporating such domain differential equations are robust to mismatched training and testing data compared to baseline domain agnostic models. To support our theory, we propose two domain-differentialequation-informed networks called Reaction-Diffusion Graph Convolutional Network (RDGCN), which incorporates differential equations for traffic speed evolution, and Susceptible-Infectious-Recovered Graph Convolutional Network (SIRGCN), which incorporates a disease propagation model. Both RDGCN and SIRGCN are based on reliable and interpretable domain differential equations that allows the models to generalize to unseen patterns. We experimentally show that RDGCN and SIRGCN are more robust with mismatched testing data than the state-of-the-art deep learning methods.

    Keywords: ODE-based computation model, graph convolutional networks, out-of-distribution generalization, spatiotemporal prediction, reaction-diffusion equation, time series

    Received: 06 Mar 2024; Accepted: 16 May 2024.

    Copyright: © 2024 Sun, Chen, Xu, Xie, Blum and Venkitasubramaniam. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Yuesheng Xu, Lehigh University, Bethlehem, United States
    Rick S. Blum, Lehigh University, Bethlehem, United States
    Parv Venkitasubramaniam, Lehigh University, Bethlehem, United States

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.