Skip to main content

EDITORIAL article

Front. Mol. Biosci., 15 January 2024
Sec. Protein Biochemistry for Basic and Applied Sciences
This article is part of the Research Topic Computational and Experimental Protein Variant Interpretation in the Era of Precision Medicine View all 10 articles

Editorial: Computational and experimental protein variant interpretation in the era of precision medicine

  • 1Department of Medical Sciences, Computational Biomedicine Unit, University of Torino, Torino, Italy
  • 2Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
  • 3Department of Physics, University of Roma Tor Vergata, Roma, Italy
  • 4Istituto Nazionale di Fisica Nucleare, University of Roma Tor Vergata, Roma, Italy
  • 5Department of Biochemical Sciences “A. Rossi Fanelli”, Sapienza University of Roma, Roma, Italy
  • 6Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, United States
  • 7Department of Plant and Microbial Biology and Center for Computational Biology, University of California, Berkeley, Berkeley, CA, United States
  • 8Department of Neurosciences, Biomedicine and Movement Sciences, Section of Biological Chemistry, University of Verona, Verona, Italy

Most traits of the human phenotype depend on the combination of various genetic factors with environmental influences, and a major challenge is the understanding of the relationship among genetic and phenotype variations (Casadio et al., 2011). In the last years, both advancements in human genome sequencing technologies and the creation of databases collecting information on human variations at the gene and protein levels have hugely enhanced the investigations on the role of these variations in determining health and disease (Austin-Tse et al., 2022). At the same time, the increasing amount of data generated by these resources are requiring new accurate and reliable computer-aided tools to predict phenotype–genotype associations (Brandes et al., 2023; Cheng et al., 2023).

Efficient and powerful analytical methods are necessary for the discovery of unknown etiologies, which is important for rare diseases (Greene et al., 2023). Licata et al. highlighted the most relevant online resources and computational tools for single-nucleotide variant interpretation that can enhance the diagnosis, clinical management, and development of treatments for rare disorders.

A large number of computational methods have been developed for the identification of potentially pathogenic missense mutations. An example provided in this Research Topic is PON-All, a machine learning tool that exploits features of evolutionary conservation, changes in physicochemical properties of amino acids, and biological function annotations from Gene Ontology. The novelty, introduced by Yang et al. with this tool, was to improve the variant interpretation through the inclusion of non-human variants in the learning process, achieving high accuracy on a blind test set.

A complementary approach to understand the effects of missense mutations is by computational predictors of stability. Protein stability perturbations have already been associated with pathogenic missense variants, and they have been shown to significantly contribute to the loss of function in haploinsufficient genes (Birolo et al., 2021). The effects of these variants on protein stability can be measured as the difference in the free energy change of unfolding (ΔΔG) between the mutated protein and its wild-type form. Predicting protein stability changes upon genetic variations is still an open challenge (Rollo et al., 2023). Current tools, which can either require the knowledge of the protein tertiary structure or rely on protein sequences only (Pancotti et al., 2021), are less accurate in predicting stabilizing variations than destabilizing ones (Pancotti et al., 2022). Benevenuta et al. investigated possible reasons for such a difference by focusing on the relationship between experimentally measured ΔΔG and some protein properties (protein structural information, different physical properties, and statistical potentials). The results highlighted both the need to design predictive methods able to exploit input features highly correlated with the stabilizing variants and the importance of evaluating these tools on stabilizing, neutral, and destabilizing variants separately. Since this classification is associated with the sign of the protein melting temperature variation, Nobili et al. proposed a full atomistic protein description able to improve the estimation of free energy by modeling its change as a function of the number of hydrogen bonds computed using well-tempered metadynamics and maximal constrained entropy. The authors found a good agreement in the sign of representative values of ΔΔG upon unfolding and the sign of the shift in the melting temperature compared to experimental results.

Comparing experimental characterization and computational predictions, Pacheco-Garcia et al. investigated naturally occurring variants of NAD(P)H:quinone oxidoreductase 1 (NQO1), a multifactorial protein associated with an increased risk of developing cancer and neurological disorders. The authors used computational tools to probe 5,187 variants, and the effects of the clinically relevant missense NQO1 variants were then experimentally characterized in terms of protein levels during bacterial expression, solubility, thermal stability, and coenzyme binding.

Disease-causing variants are supposed to directly affect experimentally measurable features, such as protein function and stability, and the kinetics and thermodynamics of protein–protein recognition, interaction, and binding. Morante et al. used circular dichroism, fluorescence spectroscopy, and melting temperature measurements to investigate key structural aspects of the interaction between wild-type frataxin and some of its variants found in cancer tissues upon Co2+ binding, highlighting the peculiar role of the N-terminal disordered tail in modulating the protein ability to interact with the metal. Dal Cortivo et al. provided a comprehensive biophysical investigation of calmodulin (CaM) by assessing structural, thermodynamic, and kinetic properties of protein–peptide interactions, involving two protein variants associated with congenital arrhythmia (N97I and Q135P) and a highly conserved CaM-binding region in ryanodine receptors RyR1 and RyR2. Specifically, the integration of spectroscopic investigation with molecular dynamics (MD) simulations and protein structure network analysis showed that these disease-associated CaM mutations alter CaM selectivity for the specific RyR channel.

The impact of MD simulations in molecular biology and drug discovery has expanded dramatically in recent years (Hollingsworth and Dror, 2018) since they capture the behavior of proteins and other biomolecules in full atomic detail and at a very fine temporal resolution. Shinwari et al. applied MD to characterize the structural and functional impacts of high-risk non-synonymous single-nucleotide polymorphisms on the TCIRG1 protein, causing congenital neutropenia and osteopetrosis. The analysis identified 15 variants that are likely to be highly deleterious, significantly destabilizing the wild-type protein structure and function. Hashimi et al. systematically investigated the dynamic properties involved in the double-stranded DNA (dsDNA) recognition by the EBNA1 protein, a key nuclear antigen of Epstein–Barr virus (EBV). Stability, flexibility, structural compactness, hydrogen bonding frequency, and binding affinity were altered by disrupting the native protein–DNA contacts, thereby decreasing the binding affinity. Their results revealed hotspot residues (arginine substitutions R521A and R522A), which are likely to become crucial in designing structure-based drugs against EBV infections.

In conclusion, this Research Topic has provided an overview of the current progress in both computational and experimental research and of their interplay in the annotation and interpretation of protein variants to detect pathogenic variations, analyzing their effects at the molecular level. These studies may help to predict the risk of developing specific diseases, the susceptibility to environmental factors, and the personal response to specific drugs.

Author contributions

TS: Writing–original draft, Writing–review and editing. PT: Writing–review and editing. SM: Writing–review and editing. VC: Writing–review and editing. AL: Writing–review and editing. CB: Writing–review and editing. DDO: Writing–review and editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work has received financial support from the Italian Ministry of University and Research (PRIN 2017, 201744NR8S).

Acknowledgments

TS thanks the EU projects GenoMed4All (H2020 RIA n.101017549) and BRAINTEASER (H2020 RIA n.101017598).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Austin-Tse, C. A., Jobanputra, V., Perry, D. L., Bick, D., Taft, R. J., Venner, E., et al. (2022). Best practices for the interpretation and reporting of clinical whole genome sequencing. npj Genomic Med. 7, 27–13. doi:10.1038/s41525-022-00295-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Birolo, G., Benevenuta, S., Fariselli, P., Capriotti, E., Giorgio, E., and Sanavia, T. (2021). Protein stability perturbation contributes to the loss of function in haploinsufficient genes. Front. Mol. Biosci. 8, 620793. doi:10.3389/fmolb.2021.620793

PubMed Abstract | CrossRef Full Text | Google Scholar

Brandes, N., Goldman, G., Wang, C. H., Ye, C. J., and Ntranos, V. (2023). Genome-wide prediction of disease variant effects with a deep protein language model. Nat. Genet. 55, 1512–1522. doi:10.1038/s41588-023-01465-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Casadio, R., Vassura, M., Tiwari, S., Fariselli, P., and Luigi Martelli, P. (2011). Correlating disease-related mutations to their effect on protein stability: a large-scale analysis of the human proteome. Hum. Mutat. 32, 1161–1170. doi:10.1002/humu.21555

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, J., Novati, G., Pan, J., Bycroft, C., Žemgulytė, A., Applebaum, T., et al. (2023). Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science 381. eadg7492. doi:10.1126/science.adg7492

CrossRef Full Text | Google Scholar

Greene, D., Pirri, D., Frudd, K., Sackey, E., Al-Owain, M., Giese, A. P. J., et al. (2023). Genetic association analysis of 77,539 genomes reveals rare disease etiologies. Nat. Med. 29, 679–688. doi:10.1038/s41591-023-02211-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Hollingsworth, S. A., and Dror, R. O. (2018). Molecular dynamics simulation for all. Neuron 99, 1129–1143. doi:10.1016/j.neuron.2018.08.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Pancotti, C., Benevenuta, S., Birolo, G., Alberini, V., Repetto, V., Sanavia, T., et al. (2022). Predicting protein stability changes upon single-point mutation: a thorough comparison of the available tools on a new dataset. Brief. Bioinform. 23, bbab555. bbab555. doi:10.1093/bib/bbab555

PubMed Abstract | CrossRef Full Text | Google Scholar

Pancotti, C., Benevenuta, S., Repetto, V., Birolo, G., Capriotti, E., Sanavia, T., et al. (2021). A deep-learning sequence-based method to predict protein stability changes upon genetic variations. Genes 12, 911. doi:10.3390/genes12060911

PubMed Abstract | CrossRef Full Text | Google Scholar

Rollo, C., Pancotti, C., Birolo, G., Rossi, I., Sanavia, T., and Fariselli, P. (2023). Influence of model structures on predictors of protein stability changes from single-point mutations. Genes 14, 2228. doi:10.3390/genes14122228

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: single-point mutation, protein stability, genotype-phenotype correlation, molecular dynamics simulation, kinetics, pathogenicity, variant interpretation, machine learning

Citation: Sanavia T, Turina P, Morante S, Consalvi V, Lesk AM, Bakolitsa C and Dell'Orco D (2024) Editorial: Computational and experimental protein variant interpretation in the era of precision medicine. Front. Mol. Biosci. 11:1363813. doi: 10.3389/fmolb.2024.1363813

Received: 31 December 2023; Accepted: 04 January 2024;
Published: 15 January 2024.

Edited and reviewed by:

Andrea Mozzarelli, University of Parma, Italy

Copyright © 2024 Sanavia, Turina, Morante, Consalvi, Lesk, Bakolitsa and Dell'Orco. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tiziana Sanavia, tiziana.sanavia@unito.it

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.