The final, formatted version of the article will be published soon.
ORIGINAL RESEARCH article
Front. Genet.
Sec. Computational Genomics
Volume 15 - 2024 |
doi: 10.3389/fgene.2024.1436947
This article is part of the Research Topic Advancements in AI for the Analysis and Interpretation of Large-scale Data by Omics Techniques View all articles
AnchorFCI: Harnessing Genetic Anchors for Enhanced Causal Discovery of Cardiometabolic Disease Pathways
Provisionally accepted- 1 Institute of Medical Informatics, Faculty of Medicine, University of Münster, Münster, North Rhine-Westphalia, Germany
- 2 Department of Statistics, Center for Exact Sciences and Technology, Federal University of São Carlos, São Carlos, São Paulo, Brazil
- 3 Department of Nutrition, Faculty of Public Health, University of São Paulo, São Paulo, São Paulo, Brazil
- 4 Schoool of Arts, Sciences and Humanities, University of Sao Paulo, São Paulo, São Paulo, Brazil
Cardiometabolic diseases, a leading global health concern, arise from a complex interplay of lifestyle choices, genetic predispositions, and biochemical markers. Although extensive research has uncovered strong associations among various risk factors and these diseases, grasping their causal relationships is vital for gaining deeper mechanistic insights and designing effective prevention and intervention strategies. We address this gap by introducing anchorFCI, a novel adaptation of the conservative Really Fast Causal Inference (RFCI) algorithm designed to enhance the discovery of causal relationships by strategically selecting and integrating reliable anchor variables from an additional set known not to be caused by the variables of interest. This approach is particularly well-suited for learning causal networks involving phenotypic, clinical, and sociodemographic factors, leveraging genetic variables recognized as not being influenced by these factors. By integrating these anchor variables along with knowledge of their non-ancestral relationships, anchorFCI effectively handles latent confounding while enhancing both robustness and discovery power. We demonstrate its effectiveness using from the 2015 ISA-Nutrition study in S ão Paulo, Brazil, and further estimate the effect sizes of the uncovered causal relationships with state-of-the-art tools from Judea Pearl's framework, presenting a fully data-driven causal inference pipeline. The results not only support many established causal relationships but also elucidate their interconnections within a complex network, enhancing our understanding of the broader dynamics and the multifaceted nature of cardiometabolic risk.
Keywords: causal discovery, Explainability, RFCI, genetic anchors, Unfaithfulness, partial ancestral graphs, causal effect identification, Cardiometabolic risk factors
Received: 22 May 2024; Accepted: 20 Nov 2024.
Copyright: © 2024 Ribeiro, Crnkovic, Pereira, Fisberg, Sarti, ROGERO, Heider and Cerqueira. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Milena Crnkovic, Department of Statistics, Center for Exact Sciences and Technology, Federal University of São Carlos, São Carlos, 13565-905, São Paulo, Brazil
Dominik Heider, Institute of Medical Informatics, Faculty of Medicine, University of Münster, Münster, 48149, North Rhine-Westphalia, Germany
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.