Erratum: Integrative genomic analyses combined with molecular dynamics simulations reveal the impact of deleterious mutations of Bcl-2 gene on the apoptotic machinery and implications in carcinogenesis
- 1Molecular Bio-Computation and Drug Design Laboratory, School of Health Sciences, University of KwaZulu-Natal, Westville Campus, Durban, South Africa
- 2Department of Pharmaceutical Chemistry, College of Pharmacy, Karary University, Khartoum, Sudan
- 3School of Chemistry, Dalian University of Technology, Dalian, Liaoning, China
- 4Ezintsha, Faculty of Health Sciences, University of Witwatersrand, Johannesburg, South Africa
- 5Biotechnology and Food Science, Durban University of Technology, Durban, South Africa
- 6HIV Pathogenesis Programme, School of Laboratory Medicine and Medical Science, The Doris Duke Medical Research Institute, Nelson R. Mandela School of Medicine, University of KwaZulu-Natal, Durban, South Africa
- 7School of Laboratory Medicine and Medical Sciences, University of KwaZulu-Natal, Durban, South Africa
Objectives: Unlike other diseases, cancer is not just a genome disease but should broadly be viewed as a disease of the cellular machinery. Therefore, integrative multifaceted approaches are crucial to understanding the complex nature of cancer biology. Bcl-2 (B-cell lymphoma 2), encoded by the human BCL-2 gene, is an anti-apoptotic molecule that plays a key role in apoptosis and genetic variation of Bcl-2 proteins and is vital in disrupting the apoptotic machinery. Single nucleotide polymorphisms (SNPs) are considered viable diagnostic and therapeutic biomarkers for various cancers. Therefore, this study explores the association between SNPs in Bcl-2 and the structural, functional, protein-protein interactions (PPIs), drug binding and dynamic characteristics.
Methods: Comprehensive cross-validated bioinformatics tools and molecular dynamics (MD) simulations. Multiple sequence, genetic, structural and disease phenotype analyses were applied in this study.
Results: Analysis revealed that out of 130 mutations, approximately 8.5% of these mutations were classified as pathogenic. Furthermore, two particular variants, namely, Bcl-2G101V and Bcl-2F104L, were found to be the most deleterious across all analyses. Following 500 ns, MD simulations showed that these mutations caused a significant distortion in the protein conformational, protein-protein interactions (PPIs), and drug binding landscape compared to Bcl-2WT.
Conclusion: Despite being a predictive study, the findings presented in this report would offer a perspective insight for further experimental investigation, rational drug design, and cancer gene therapy.
1 Introduction
The complex nature of cancer biology imposes a major challenge in cancer research and, consequently, the development of effective treatment regimes. The World Health Organization (WHO) estimated that 10 million patients globally died from various forms of cancer in 2020 alone. Many clinical trials do not provide significant success despite significant advances in diagnosis and innovative therapy methods (Karimi et al., 2022). Even though targeted therapy has been a successful approach in treating cancer, heterogeneous cancer still has a variety of clinical profiles and molecular alterations. Certain genetic alterations in cancer targets can make drugs more effective or, more often, cause them to become resistant to treatment (Jin et al., 2019). Drug resistance caused by mutations is a common occurrence in cancer. Thus, the mutation profile of patient malignancies plays a major role in determining the effectiveness of targeted therapy. Accurate molecular and genetic profiling of tumour cells is becoming a crucial step before implementing targeted therapy in patients (Jin et al., 2019).
The oncoprotein B-cell lymphoma-2 (Bcl-2) family proteins control apoptosis and are implicated in various tumour progressions (Rosser et al., 2003; Goff et al., 2013). This gene was the first to promote prolonged cell survival and growth rather than boost proliferation, demonstrating the importance of inhibiting cell death in tumorigenesis (Cory and Adams, 2002). Bcl-2 inhibits cytochrome c (cyt-c) release from the mitochondria, preventing caspases involved in apoptosis from activating (Yin et al., 1994). Bcl-2 overexpression or aberrant expression has been associated with many cancers’ emergence, progression, and relapse (Delbridge et al., 2016; Kitada et al., 2002). Consequently, Bcl-2 activity and protein levels have emerged as essential measures for determining the success or failure of clinical treatment and predicting patient outcomes (Delbridge et al., 2016). The sensitivity of malignant tumor cells to apoptosis can be efficiently boosted by either lowering Bcl-2 protein levels or suppressing Bcl-2 function (Qian et al., 2022). Multidrug resistance (MDR) in cancer cells can be overcome by selectively inhibiting Bcl-2, resulting in cell cycle arrest, senescence, and eventual cell death in response to radiotherapy and chemotherapy (Tang et al., 2020; Wang et al., 2020). Therefore, inhibition of Bcl-2 inactivation has become a highly attractive strategy in the battle against cancer, and BH3 mimetics are the main category of promising therapeutic agents (Perini et al., 2018; Delbridge and Strasser, 2015). BH3 mimetics inhibit Bcl-2 activity by competing with its physiological ligands, BH3 domain-containing pro-apoptotic proteins, at the hydrophobic (binding) groove (Czabotar et al., 2014).
Despite the promising initial clinical effectiveness of BH3 mimetic agents in various cancers, the mutation is a common way cancer cells evade therapies (Roberts et al., 2016; Stilgenbauer et al., 2018). The most common mutation is a change from glycine to valine at amino acid position 101 (G101V), which substantially decreases Bcl-2 affinity towards the BH3 mimetics agent (Venetoclax) and prevents the drug from displacing pro-apoptotic mediators from Bcl-2 in the cells (Blombery et al., 2019; Blombery et al., 2020). Most human genetic variations are attributable to single nucleotide polymorphisms (SNPs) (Dakal et al., 2017). This genetic variation generated by SNPs in genetic codons influences the translation outcome, resulting in a mutant protein with a different structure and function. Nevertheless, not all SNPs impact protein function and structure; a few are harmful, but many are not (Kucukkal et al., 2015).
Bioinformatics offers enormous array of databases and techniques that are necessary for the analysis, integration, and interpretation of cancer multi-omics data (Jiménez-Santos et al., 2022). It is noteworthy that in silico techniques have recently emerged as valuable tool to assess the distinct genomic alterations and transcriptome profiles of tumors, as well as understanding the underlying mechanisms of cancer (Yalcin-Ozkat, 2021; Edelman et al., 2010; Elamin et al., 2024).
Herein, combined in silico, bioinformatic approaches and molecular dynamics simulations were employed to comprehensively analyze the genomic and proteomic changes in Bcl-2 (Figure 1) and their implications on carcinogenicity. In order to do cross-validation and ensure the validity of the generated data, we choose to employ a variety of bioinformatics algorithms for each type of analysis we carried out in this work. Several mutations have been analyzed for their potential impact on the genesis and progression of cancer, and their deleterious effects on Bcl-2 structure and function have been described. Subsequently, the most deleterious mutations, Bcl-2G101V and Bcl-2F104L, were selected for further dynamic analysis to probe their impact on the protein conformational landscape using molecular dynamics (MD) simulations and post-dynamic analyses.
We believe that the extensive and multifaceted analyses provided in this study will offer a thorough grasp of the effects of deleterious Bcl-2 gene mutations on the apoptotic machinery and their implications for carcinogenesis. This understanding will then inform future directions in drug design and the development of anti-cancer therapeutics.
2 Methods
2.1 Generation of the datasets
The Bcl-2 FASTA sequence was obtained from UniProt (UniProt ID: P10415) (https://www.uniprot.org/) (Bateman et al., 2017). The dbSNP (https://www.ncbi.nlm.nih.gov/snp/) and Ensembl (https://www.ensembl.org/) databases and an extensive literature search were used to compile the list of mutations (Sherry et al., 2001; Hubbard et al., 2002). Gene synonyms (Bcl-2, PPP1R50) (transcript ID: ENST00000333681.5) of the Bcl-2 protein were selected for this study. Duplicate variants and other redundant data were excluded from the analysis. High-resolution crystal structures of the Bcl-2 protein, both wild-type and mutated (G101V and F104L) (PDB ID:6O0K, 6O0L, and 6O0M), were obtained from the Protein Data Bank (https://www.rcsb.org/) (Birkinshaw et al., 2019).
2.2 Sequence-based analyses for point mutation
We utilised eight different bioinformatics tools to obtain a reliable cross-validated sequence-based analysis to determine the deleterious effects of residue mutations on the protein. These are, the Sorting Intolerant From Tolerant (SIFT) algorithm (https://sift.bii.a-star.edu.sg) which determines the deleterious effects of residue mutations on proteins (Kumar et al., 2009); Polymorphism Phenotyping 2 (PolyPhen-2) (http://genetics.bwh.harvard.edu/pph2/) (Adzhubei et al., 2013), which is tailored to the study of high-throughput Next-Generation Sequencing (NGS) data and features multiple sequence alignments and classifiers based on machine learning; Combined Annotation Dependent Depletion (CADD) (https://cadd.gs.washington.edu/) that is designed to estimate the deleterious effect of residue variation on protein sequences (Rentzsch et al., 2019); Rare Exome Variant Ensemble Learner (REVEL) (https://sites.google.com/site/revelgenomics/) (Ioannidis et al., 2016); MetaLR (https://sites.google.com/site/jpopgen/dbNSFP) which predicts the deleteriousness of missense variants using logistic regression, which incorporates nine independent variant deleteriousness scores and allele frequency information (Liu et al., 2016); Mutation Assessor (http://mutationassessor.org/r3/) uses the evolutionary conservation of the impacted residues in protein homologs to speculate on the functional consequences of residue changes in proteins (Reva et al., 2011); Functional Analysis Through Hidden Markov Models (FATHMM) which is a high-throughput web server capable of predicting the functional consequences of both coding variants, that is, non-synonymous single nucleotide variants (nsSNVs) and non-coding variants in the human genome (http://fathmm.biocompute.org.uk/); and Predict-SNP (https://loschmidt.chemi.muni.cz/predictsnp1/) (Bendl et al., 2014).
2.3 Structure-based analyses for point mutation
Various algorithms were employed to predict the effect of missense mutations on the protein stability. These include, mCSM (https://biosig.lab.uq.edu.au/mcsm/) which uses various residues atomic distance patterns to train the predictive models (Pires et al., 2014a); Site-directed mutator2 (SDM2) (http://marid.bioc.cam.ac.uk/sdm2) which can also estimate the relative stability of wild-type and mutated protein structures by comparing them to known homologous 3D structures; DUET (http://biosig.unimelb.edu.au/duet/) which uses Support Vector Machines (SVM) to produce a consensual estimate (Pires et al., 2014b); PremPS (https://lilab.jysw.suda.edu.cn/research/PremPS/) which estimates changes in the Gibbs free energy of protein unfolding to assess the impact of single mutations on protein stability (Chen et al., 2020); CUPSAT (http://cupsat.tu-bs.de/) (Parthiban et al., 2006); ENCoM (https://labworm.com/tool/encom) (Frappier et al., 2015); MutPred2 (http://mutpred.mutdb.org/) (Pejaver et al., 2020); and DynaMut (https://biosig.lab.uq.edu.au/dynamut/) which takes the changes in vibrational entropy into account (Rodrigues et al., 2018).
2.4 Disease phenotype prediction analysis
Several machine learning and neural network algorithms were employed for disease phenotype prediction. These include, PhD-SNP (https://bio.tools/phd-snp) which uses neural networks that have been trained on a large library of standard and pathogenic mutations (Capriotti and Fariselli, 2017); Protein ANalysis THrough Evolutionary Relationships (PANTHER) (http://www.pantherdb.org/) which is designed to estimate the likelihood of a particular non-synonymous (residue changing) coding SNP that causes a functional impact on the protein (Thomas et al., 2022); SNPs and GO (https://snps.biofold.org/snps-and-go/) is another a precise technique that uses the associated protein functional annotation to determine whether or not a variation is associated with a disease based on a protein sequence (Capriotti et al., 2013); PMut (http://mmb.irbbarcelona.org/PMut/) which identifies pathogenic protein variants with up to 80% predictive accuracy in humans (López-Ferrando et al., 2017); and Meta-SNP (https://snps.biofold.org/meta-snp/) which is a randomised forest-based classification algorithm that distinguishes between polymorphic non-synonymous SNVs and disease-related one.
2.5 Post-transcriptional modification (PTM) sites prediction
PTM site predictions comprised several rearranged residues that produced many proteins. Ubiquitination, phosphorylation, and methylation are some of the PTM sites that have been characterised. These sites are essential in vital cellular organising processes such as pathological signaling cascades and protein-protein interactions. Thus, PTM prediction assisted in elucidating whether genetic variants were associated with or contributed to disease pathogenesis. We used four tools for this purpose, namely,; NetPhos 3.1 (https://services.healthtech.dtu.dk/service.php?NetPhos-3.1); Group-based Prediction System (GPS) 6.0 (http://gps.biocuckoo.cn/) (Xue et al., 2005); BDM-PUB (http://bdmpub.biocuckoo.org/) which is for protein ubiquitination site prediction using the Bayesian Discriminant Method; and UbPred (http://www.ubpred.org/).
2.6 Gene-gene interaction network analysis
The gene function can be better understood by studying the genes with which it interacts. The GeneMANIA and STRING databases were used to investigate the relationship between the Bcl-2 gene and other genes and to predict the effect of Bcl-2 nsSNPs on other associated genes. GeneMANIA (https://genemania.org/) is a database for identifying genes related to input genes using an extensive set of functional association data (Warde-Farley et al., 2010). These association data included co-expression, colocalisation, pathways, protein domain similarity, and interactions between proteins and genes. GeneMANIA can identify novel pathway members or complex members, genes missed during the screening process, or genes that perform a specific function, such as protein kinases. STRING (https://string-db.org/) is a database of both experimentally verified and theoretically predicted interactions between proteins (Szklarczyk et al., 2021). These interactions occur through computational prediction, inter-organism information transmission, and aggregation of interactions from other (primary) databases, and they can be either direct (physical) or indirect (functional).
2.7 Effect of point mutation on the structural and functional integrity of the protein
The formation of a protein complex is critical in controlling many biological activities. Therefore, different algorithms were employed to investigate the effect of Bcl-2G101V and Bcl-2F104L structural and functional properties. mCSM-PPI2 (http://biosig.unimelb.edu.au/mcsm_ppi2/) was used to predict the effects of missense mutations on protein-protein affinity (Rodrigues et al., 2019). mCSM-PPI2 uses graph-based structural signatures to model the effects of variations on the inter-residue interaction network, evolutionary information, complex network metrics, and energy terms to generate an optimised predictor. ConSurf (https://consurf.tau.ac.il/) is another tool we employed to estimate the evolutionary conservation of residue positions in a protein molecule based on the phylogenetic relationships between homologous sequences (Ashkenazy et al., 2016). The degree to which the residue position is evolutionarily conserved strongly depends on its structural and functional importance. The ConSurf value varied from 1 to 9, with one denoting residues with the least conservation and nine denoting residues with the most conservation. Other tools such as FTSite (https://ftsite.bu.edu/) (Ngan et al., 2012), HOPE (https://www3.cmbi.umcn.nl/hope/) and Stride (http://webclu.bio.wzw.tum.de/stride/) (Heinig and Frishman, 2004), were also used to provide deeper insight on the structural and functional integrity of the protein upon mutation.
2.8 Molecular dynamics (MD) simulations
2.8.1 Systems preparation
The Protein Data Bank Repository (RCSB PDB) (https://www.rcsb.org/) provided a crystallized X-ray structure of the Bcl-2WT, Bcl-2G101V, and Bcl-2F104L with PDB entries of 6O0K, 6O0L, and 6O0M, respectively. The water molecules in the crystal structure were removed, and the missing hydrogen atoms were substituted for them, with the correct charges assigned at neutral pH. The Schrödinger suite’s Protein Preparation Wizard was employed for initial structure processing and energy minimization. To further reduce steric clashes between residues, we used the OPLS-2005 force field to minimize energy while setting the RMSD threshold to 0.30 for all structures (Shivakumar et al., 2012).
2.8.2 Molecular dynamics simulations and post-dynamic analysis
MD simulations were carried out using AMBER18 software and its Particle Mesh Ewald Molecular Dynamics (PMEMD) module (Case et al., 2024; Darden et al., 1993). Protein systems were modelled, and atomic charges were assigned state using the standard Amber (FF14SB) force field within the Amber package. An in-house pdb4amber script was used to modify, rename, and protonate (histidine) Bcl-2 (Maier et al., 2015). The LEAP module was employed to generate Bcl-2 parameters and topology files. This was also used for system neutralization. Molecular minimisation was carried out using a constraint potential of 500 kcal/mol, with partial minimisation for 2,500 steps and full minimization taking 5,000 steps. Furthermore, a gradual heating from 0 to 310 K was implemented in the system. The unconstrained equilibration was performed for 5 ns while the atmospheric pressure was maintained at 1 bar with the help of a Berendsen barostat (Berendsen et al., 1984). Subsequently, production stages were conducted over 500 ns to understand the structural consequences of the mutations on Bcl-2.
The enzyme coordinates of Bcl-2WT, Bcl-2G101V, and Bcl-2F104L were saved every 1 ps, and their resultant trajectories were analysed using the AMBER18 integrated CPPTRAJ module (Roe and Cheatham, 2013). Post-MD analyses included root-mean-square deviation (RMSD), root-mean-square fluctuations (RMSF), radius of gyration (Rg), solvent accessible surface area (SASA), intramolecular hydrogen bonding, and dynamic cross-correlation matrix (DCCM). Furthermore, principal component analysis (PCA) was calculated to unravel the protein’s atomic displacement extent. The generated data and subsequent complexes were visualized using Microcal Origin analytical software (www.originlab.com), NMWiz implemented in Visual Molecular Dynamics (VMD) (https://www.ks.uiuc.edu/Research/vmd/) (Seifert, 2014; Humphrey et al., 1996).
3 Results
The Bcl-2 SNP dataset was obtained from the dbSNP and Ensembl databases. Approximately 52,619 variations in Bcl-2 have been identified, with 49,593 SNPs located in the intronic region, 163 SNPs classified as missense variants, 1,401 SNPs located in the 3′UTR area, 832 SNPs located in the 5′UTR region, and 115 synonymous variants, as reported by dbSNP and Ensembl. Missense mutations in the coding region were the current target of this study. As a result of further filtering to remove duplicate variations, 130 variants were selected for further investigation.
3.1 Sequence-based analysis of point mutation
Eight tools, namely, SIFT, PolyPhen2, CADD, REVEL, MetaLR, Mutation Assessor, FATHMM, and Predict-SNP were used to conduct sequence-based prediction and analyze the potential effects of Bcl-2 mutations. These eight tools separated deleterious mutations from tolerated ones (Supplementary Table S1). Out of 130 variants, SIFT and PolyPhen2 estimated 45 (∼35%) to be deleterious while CADD, REVEL, Mutation Assessor, FATHMM, and Predict-SNP predicted 19 (∼15%), 6 (∼5%), 30 (∼23%), 26 (∼20%), and 38 (∼29%) mutations as deleterious, respectively. However, the MetaLR algorithm predicted that all 130 (100%) variants were tolerated (Figure 2).
Figure 2. Deleterious and tolerated variations in Bcl-2 predicted through sequence-based algorithms.
3.2 Structure-based analysis
Multiple computational algorithms, including mCSM, SDM2, DUET, PremPS, CUPSAT, ENCoM, MutPred-2, and DynaMut were used to provide structure-based predictions of the effect of mutations. These tools distinguished between destabilizing and stabilizing mutations (Supplementary Table S2). The analysis concluded that out of 130 mutations, mCSM: 120 (∼92%), SDM2: 85 (∼65%), DUET: 97 (∼75%), PremPS: 94 (∼72%), CUPSAT: 84 (∼65%), ENCoM: 60 (∼46%), MutPred: 2–27 (∼21%), and DynaMut: 61 (∼47%) mutations were estimated to be destabilizing the structure of the protein (Figure 3).
Figure 3. Destabilizing and stabilizing variations in Bcl-2 predicted through structure-based algorithms.
3.3 Disease phenotype analysis
The pathogenicity of the targeted mutations was assessed utilizing PhD-SNP, PANTHER, SNPs and GO, PMut, and Meta-SNP. These algorithms use their prediction values to determine whether a specific mutation is disease-causing or neutral. From the 130 mutations, PhD-SNP predicted 27 (∼21%) mutations to be pathogenic, while PANTHER, SNPs and GO, PMut, and Meta-SNP predicted 40 (∼31%), 20 (∼15%), 45 (∼35%), and 23 (∼18%) mutations associated with the disease, respectively (Figure 4). However, only 11 of these mutations were predicted to be disease-causing across all the prediction algorithms (R12G, V15L, H94P, L97P, R98L, R129P, G141E, V142G, N143S, M166T, and G193R) (Supplementary Table S3).
Figure 4. Disease and neutral variations in Bcl-2 predicted through disease phenotype prediction algorithms.
3.4 Post-transcriptional modification (PTM) sites prediction
GPS-MSP 6.0 was used for methylation and determined the number of Bcl-2 sites that would be modified. However, GPS-MSP 6.0 predicted that phosphorylation would occur at 35 residues [Ser:15 (43%), Thr:12 (34%), and Tyr:8 (23%)]. In contrast, it was predicted by Netphose 3.1 those 20 different residues could be phosphorylated [Ser:11 (55%), Thr:7 (35%), and Tyr:2 (10%)].
Ubiquitination was predicted using BDMPUB and UbPred. BDMPUB anticipated that two lysine residues would be ubiquitinated, whereas UbPred projected those four lysine residues would be ubiquitinated.
3.5 Gene interaction network
The interaction between Bcl-2 and other genes was evaluated using the GeneMANIA and STRING web servers. GeneMANIA analysis showed that Bcl-2 physically interacted with all ten genes and has no co-localization or genetic interaction with any other gene. However, Bcl-2 was co-expressed with BAX, BCL2L1, NLRP1, BBC3, and BID. Moreover, Bcl-2 shared protein domains with BCL2L1, BAX, BIK, and BID (Figure 5A).
The STRING database offers an integrated and comprehensive evaluation of indirect (functional) and direct (physical) protein-protein interactions. The network analysis revealed that Bcl-2 interacted directly with BECN1, BAX, TP53, BAD, BCL2L11, BIK, BBC3, BID, BCL2L1, and FKBP8 (Figure 5B).
3.6 Effect of mutations on the structural and functional integrity of Bcl-2
3.6.1 Estimation of impact of mutation on protein-protein interactions (PPIs)
The effect of mutations on the binding affinity of protein interactions was evaluated using mCSM-PPI2, which evaluates the effect of mutation by simulating the impact of variations on the network of non-covalent interactions between residues utilizing graph kernels, energetic terms, complex network metrics, and evolutionary data. The decreased binding affinity of protein-protein interaction was observed at the active site residues of the mCSM-PPI2-predicted Bcl-2 interaction, with a change in affinity (ΔΔGaffinity) of −0.559 kcal/mol for the G101V variant and −1.053 kcal/mol for the F104L variant. The interaction network revealed that the wild-type protein residue Gly101 established hydrogen bonds with Tyr18, Leu97, Arg98, Phe104, and Ser105, as well as van der Waals interactions with Gln99 and Glu152; however, in the mutant, Val101 established a hydrogen bond with Leu97, Arg98, Phe104, Ser105, and Glu152 (Figure 6). Likewise, the Phe104 in the wild-type generated hydrogen bonds with Ala100, Gly101, and Tyr108, and van der Waals interactions with Ala100, Asp102, Arg106, Tyr108, and Phe123, while in the mutant, leucine formed hydrogen bonds with the same residues in the wild-type (Figure 6).
Figure 6. G101 and F104 residue interactions network of Bcl-2; (A) wild G101, (A) G101V variant, (B) wild F104, and (B) F104L variant as predicted by mCSM-PPI2.
3.6.2 Conservation analysis of Bcl-2
The conservation of residues is the primary factor that ensures the structural integrity of proteins. The Bcl-2 structure’s conservation of residues was investigated using the ConSurf web server to comprehend its significance and localized evolution. The arrangement of residues and their degree of conservation was uncovered utilizing the ConSurf analysis. Several residues in Bcl-2 were shown to be relatively conserved using ConSurf, with particular emphasis on G101 and F104, suggesting that genetic variations at these positions might substantially impact Bcl-2 (Figure 7).
3.6.3 Mapping ligand binding sites of Bcl-2
The FT-site web server was used to identify Bcl-2 binding sites based on experimental evidence. The FT-site server depicted three ligand sites in Bcl-2. The ligand sites in Bcl-2 were represented by three different mesh-like structures on the FT-site server (pink, green, and purple), with corresponding residues that are within 5.0 Å of the binding site represented by ball and stick in these sites (Figure 8). The position of the F104 residue is detected in the first and second ligand-binding sites, while G101 is detected in the second ligand-binding site (Table 1). Consequently, mutations G101V and F104L may be more deleterious, as they potentially impact the Bcl-2 ligand-binding affinity.
Figure 8. FT-site server prediction of the Bcl-2 protein ligand binding sites represented in mesh-like structure: pink (binding site 1), green (binding site 2), and purple (binding site 3).
The HOPE project PDB viewer was used to visualize the structural features of the Bcl-2WT, Bcl-2G101V, and Bcl-2F104L (Figure 9). Each residue demonstrated a unique size, charge, and hydrophobicity. These values frequently varied between the original wild-type and the newly introduced mutant residues. For the Bcl-2G101V, the mutant residue was bigger and more hydrophobic than the Bcl-2WT residue. Although the mutated residue is not directly involved in ligand binding, it may indirectly affect ligand interactions made by other residues due to changes in local stability. The mutated residue is located within a special BH3 motif. Therefore, the different properties of residues caused the motif to become disrupted and consequently impair its function. Glycine had the highest degree of flexibility compared to other residues, which may be necessary for protein function. This function can be abolished by mutating this glycine. For Bcl-2F104L, the mutant residue was smaller than the Bcl-2WT residue. The Bcl-2WT residue interacted with Venetoclax, and the difference in properties between the Bcl-2WT and mutant can easily cause a loss of interactions with the ligand. Protein function was frequently dependent on ligand binding, and this mutation may impair this function. The mutated residue was located within a special BH3 motif near a highly conserved position. Consequently, the motif was disturbed owing to the different properties of the residues, which would impede its function.
Figure 9. Close-ups (different angles) of the mutant and wild system; (A) Bcl-2G101V and (B) Bcl-2F104L.
3.6.4 Investigating the effect of the mutations on the protein secondary structure
MD trajectories of 500 ns were used to investigate the dynamics of secondary structural elements in Bcl-2WT, Bcl-2G101V, and Bcl-2F104L. This study contributed to a better understanding of the effects of genetic variations on the Bcl-2’s secondary structure through simulations. The STRIDE web server was used to detect the change in secondary structure at 10, 100, 200, 300, 400, and 500 ns (Figure 10). The secondary structural components in Bcl-2, such as α-helix, 3–10 helix, and turns, were divided into specific residues at each time interval. The Bcl-2G101V and Bcl-2F104L were observed to switch from a helix to a turn configuration at these residues.
Figure 10. The secondary structural analysis of the Bcl-2WT, Bcl-2G101V, and Bcl-2F104L at 10, 100, 200, 300, 400, and 500 ns using the STRIDE web server.
3.7 Dynamic and conformational stability and fluctuations
The inherent behavior of a protein is associated with conformational changes and structural aberrations. Modifying a protein’s structure can significantly affect its function]. Therefore, understanding mutation-induced structural changes requires a more in-depth investigation of the conformational dynamics of proteins. For this reason, the effects of Bcl-2 mutations (G101V and F104L) were investigated over 500 ns MD simulations. The dynamics and stability of Bcl-2WT, Bcl-2G101V, and Bcl-2F104L were determined by evaluating the time variable considering the RMSD of Cα atoms from computed trajectories. All systems reached convergence after 100 ns of the simulation period (Figure 11A). The Bcl-2WT exhibited the lowest deviated RMSD value, 1.14 Å, while the Bcl-2G101V and Bcl-2F104L revealed higher RMSD values, 1.43 and 1.62 Å, respectively. The Bcl-2G101V disrupted the RMSD pattern of Bcl-2WT and caused it to fluctuate more than the Bcl-2WT and Bcl-2F104L during the simulation. The findings showed that Bcl-2WT and Bcl-2F104L displayed the least deviation of Cα atoms compared to Bcl-2G101V, indicating that the mutation of Gly to Val reduced the structural stability of Bcl-2. Furthermore, no significant variations in structural snaps were noticed, excluding the α3-α4 helices (hydrophobic groove) of superimposed Bcl-2WT, Bcl-2G101V, and Bcl-2F104L every 100 ns during the simulation (Supplementary Figure S1). Here, α3-α4 helices become more dynamic and flexible as the simulation progresses, thus inducing expansion or shrinking in the hydrophobic groove, which appears most effectively in the Bcl-2G101V.
Figure 11. (A) RMSD, (B) RMSF, (C) Rg, and (D) SASA values across Cα of Bcl-2WT (gray), Bcl-2F104L (orange), and Bcl-2G101V (green) over 500 ns MD simulations.
The relative rigidity and flexibility of residues determined protein conformational changes and their associated functions. Consequently, the RMSF values of Bcl-2WT, Bcl-2G101V, and Bcl-2F104L can be computed and analyzed to see how Bcl-2’s residual fluctuations change due to mutations (Figure 11B). Bcl-2WT demonstrated the least fluctuations of the residues with an average RMSF value of 1.10 Å when compared to 1.13 and 1.20 Å for the Bcl-2F104L and Bcl-2G101V, respectively. The calculated trajectory showed a slightly higher pattern of fluctuations, especially for the Bcl-2G101V variant. As a result of these mutations, the regions surrounding the various sites become more dynamic and internally disturbed, reflecting higher fluctuations in Bcl-2. The RMSF distribution correlated with the RMSD pattern, with mutated systems exhibiting more significant fluctuations. The substantial variations in the mutants’ residual fluctuations could be attributed to Bcl-2 structural inactivation.
Furthermore, the Rg values of all three systems were analyzed to determine the folding behavior and overall conformational changes in the Bcl-2 structure before and after mutation induction. The compactness, stability, and folding of a protein can be determined from the change in Rg values over time. The Rg values of the Bcl-2WT, Bcl-2G101V, and Bcl-2F104L were estimated from the MD trajectories and plotted (Figure 11C). Bcl-2WT had the lowest Rg value (14.54 Å, while the Bcl-2F104L and Bcl-2G101V showed slight increases at 14.59 and 14.63 Å, respectively. Altogether, Rg analysis of Bcl-2 revealed that the mutants were less stable, more flexible, and less compact than the native protein.
Moreover, the Bcl-2 structure’s hydrophilic and hydrophobic residues were analyzed using SASA. The SASA values for the Bcl-2WT, Bcl-2G101V, and Bcl-2F104L were obtained and plotted throughout the 500 ns of MD simulation (Figure 11D). Following exposing the system to the solvent, Bcl-2WT had a median SASA value of 7,824 Å2. The Bcl-2G101V exhibited a higher SASA value of 8,049 Å2 than that of the Bcl-2F104L, which displayed a value of 7,985 Å2. The SASA values of all three systems agreed with the Rg results. The differences in the SASA values for the three systems throughout the simulation reflect Bcl-2 unfolding and folding. The overall SASA values for Bcl-2WT and Bcl-2F104L were slightly different, suggesting that the structural mutation from Phenylalanine to Leucine at position 104 in Bcl-2 provides better exposure to solvent compared with Bcl-2G101V and, thus, favors the enhanced activity of the Bcl-2F104L over that of the Bcl-2G101V.
3.7.1 Hydrogen bonding analysis
Analysis of intramolecular hydrogen bonds primarily assists in evaluating the overall conformation and stability of the protein structure through MD simulations. Time-dependent intramolecular hydrogen bond analysis was performed and plotted to evaluate the effect of mutations on the structure of Bcl-2 (Figure 12). The average values of intramolecular hydrogen bonds in Bcl-2WT, Bcl-2G101V, and Bcl-2F104L ranged from about (43–100), (41–98), and (40–96), respectively, indicating a slight change before and after mutation formation. The Bcl-2F104L and Bcl-2WT models were more compact and stable than the Bcl-2G101V model, and the results maintained a roughly similar trajectory pattern.
Figure 12. Intramolecular hydrogen bonding in Bcl-2WT (gray), Bcl-2F104L (orange), and Bcl-2G101V (green) over 500 ns MD simulations.
3.7.2 Dynamic cross-correlation matrix (DCCM)
To examine the differences in the dynamics of Bcl-2WT, Bcl-2G101V, and Bcl-2F104L, DCCM plots were generated for anti-correlated and correlated protein structural motions. The residues’ motion values range from −1 to +1. Positive values indicate positively correlated motions (brown colour), whereas negative values indicate anticorrelated motions (black colour) between residues (Figure 13). The scatter plots revealed that motion modes between residues of Bcl-2F104L are similar to those of Bcl-2WT, whereas the Bcl-2G101V showed a slightly different pattern, mutation obviously enhances the positively correlated motions occurring in the Bcl-2.
3.7.3 Principal component analysis (PCA)
Intensive movements in Bcl-2WT, Bcl-2G101V, and Bcl-2F104L were evaluated using PC analysis with the first two eigenvectors (EVs) to qualitatively examine the influence of induced mutations on the major conformational movements of each residue (Kumalo et al., 2016). The eigenvectors illustrate the directions of Bcl-2 motion, and the eigenvalues represent the overall motion strength; these are obtained by diagonalizing the covariance matrix (Chen et al., 2021; Chen et al., 2022). The conformational changes of Bcl-2 and its variants were shown in a 2D scatter plot (Figure 14), indicating a significant change in Bcl-2 overall movements after acquiring the mutations, especially Bcl-2G101V. Moreover, Figure 14 shows that the Bcl-2G101V and Bcl-2F104L with the trace covariance matric of 12.46 and 22.46 Å2, respectively, imposed highly fluctuated anti-correlated effects as the negative values of 2D scatter point into the protein. In the case of Bcl-2WT, the trace covariance matrices were 24.09 Å2, indicating the presence of prominent correlated motions with minimal system fluctuations. Consequently, the findings demonstrated that the Bcl-2G101V caused substantial fluctuations in the simulated Bcl-2 dynamics.
Figure 14. PCA for Bcl-2WT (gray), Bcl-2F104L (orange), and Bcl-2G101V (green) over the 500ns MD simulations.
4 Discussions
4.1 Sequence, structure, phenotype-mutational analysis and gene interactions
To ascertain the deleterious effect of residue mutation on the protein, we employed various sequence-based point mutation algorithms. Out of 130 mutations, SIFT and PolyPhen2 algorithms displayed the highest estimation, deeming 45 mutations (∼35%) deleterious. With the exception of the MetaLR algorithm, which predicted that all 130 (100%) variants were tolerated, other algorithms displayed results ranging from around 5 to 23 percent (Figure 2). We hypothesize that the inclusion of machine learning and high-throughput Next-Generation Sequencing (NGS) data in the PolyPhen2 method broadened the search field, contributing to the high prediction rate. Similarly, various algorithms were adopted to predict the effect of missense mutations on the protein stability, and to distinguish between destabilizing and stabilizing mutations. Out of 130 mutations, 3 algorithms (ENCoM, MutPred, and DynaMut) assessed between 21% and 46% of mutations are destabilizing, while 4 predictive tools (mCSM, SDM2, DUET, PremPS, and CUPSAT) estimated between 65% and 92% of mutations are destabilizing (Figure 3). We believe that the analysis adopted here is robust and reliable as we opted to combine various algorithms that take into account critical structural features such as protein folding and Gibbs’s free energy (PremPS), site-directed mutations relative to wild type (SDM2), vibrational entropy (DynaMut) and consensual estimation (DUET). A number of machine learning and neural network techniques were used to predict disease phenotypes (Figure 4; Supplementary Table S3), yet only 11 mutations were shown to be disease-causing by all prediction algorithms. These mutations are R12G, V15L, H94P, L97P, R98L, R129P, G141E, V142G, N143S, M166T, and G193R. GeneMANIA and STRING database offer an integrated and comprehensive evaluation of indirect (functional) and direct (physical) protein-protein interactions. The network analysis revealed that Bcl-2, Bcl-2 shared protein domains with BCL2L1, BAX, BIK, and BID (Figure 5A), and interacted directly with BECN1, BAX, TP53, BAD, BCL2L11, BIK, BBC3, BID, BCL2L1, and FKBP8 (Figure 5).
4.2 Impact of mutations on protein-protein interactions
To explore the impact of Bcl-2G101V and Bcl-2F104L on their structural and functional characteristics, we utilized various techniques. The mCSM-PPI2 algorithm predicted a reduction in the binding affinity of protein-protein interaction, G101V variant change affinity (ΔΔGaffinity) with −0.559 kcal/mol, compared with −1.053 kcal/mol for F104L variant. According to the interaction network, Gly101in the wild-type protein, generated hydrogen bonds with Tyr18, Leu97, Arg98, Phe104, and Ser105, and exhibited van der Waals interactions with Gln99 and Glu152. However, in the mutant, Val101 established hydrogen bonds with Leu97, Arg98, Phe104, Ser105, and Glu152 (Figure 6). Furthermore, in the wild-type, Phe104 established hydrogen bonds with Ala100, Gly101, and Tyr108 and van der Waals interactions with Ala100, Asp102, Arg106, Tyr108, and Phe123. While in the mutant, leucine established hydrogen bonds with the same residues (Figure 6). The ConSurf web server was utilized to confirm the structural integrity of the Bcl-2 protein. Several residues in the Bcl-2 protein were shown to be relatively conserved, with a specific focus on G101 and F104., suggesting that genetic variations at these positions might substantially impact Bcl-2 (Figure 7). Additionally, FTSite, HOPE, and Stride were employed to gain further understanding of the structural and functional integrity of the Bcl-2 protein following mutation. The FT-site server depicted three ligand sites in Bcl-2 (Figure 8). According to Table 1, the first and second ligand-binding sites detect the position of the F104 residue, while the second ligand-binding site detects the G101 residue. Considering they may affect the Bcl-2 ligand-binding affinity, mutations G101V and F104L may thus be more deleterious. The Bcl-2WT, Bcl-2G101V, and Bcl-2F104L structural characteristics were visualized using the HOPE project PDB viewer (Figure 9). The Bcl-2G101V mutant residue exhibited a bigger size and greater hydrophobicity compared to the Bcl-2WT residue. The mutant residue of Bcl-2F104L was smaller than the residue of Bcl-2WT. Venetoclax was bound to the Bcl-2WT residue, and because the two amino acids had different characteristics, the mutant form of Bcl-2WT can readily lose its binding affinity for the ligand. Finally, the STRIDE web server was utilized to identify alterations in the secondary structure at specific time points: 10, 100, 200, 300, 400, and 500 ns (Figure 10). The conformational switch from a helix to a turn was seen in Bcl-2G101V and Bcl-2F104L at these residues.
4.3 Effect of mutations on the structural and dynamic landscape of the protein
We employed the MD simulations to conduct a comprehensive analysis of the conformational dynamics of proteins to understand the structural alterations caused by mutations. These mutations affected Bcl-2’s stability, flexibility, solvent-accessible surface area, and rigidity, as demonstrated by 500 ns MD simulations (Figure 11). Moreover, mutations impacted Bcl2’s hydrogen bond formation, and the Bcl-2F104L and Bcl-2WT models exhibited greater compactness and stability compared to the Bcl-2G101V model (Figure 12). To explore mutation-induced effect on conformational alterations of Bcl-2, DCCMs and PCA are estimated. The results showed that the Bcl-2G101V mutation clearly affects the positively correlated motions occurring in the Bcl-2 and causes substantial fluctuations in the simulated Bcl-2 dynamics (Figures 13, 14).
Overall, the findings of this study hold several biological significances, for instance having information on SNPs in the Bcl-2 gene would help identify potential biomarkers for cancer diagnosis and treatment. Furthermore, by examining the structural and functional effects of SNPs in Bcl-2, our finding may pinpoint novel targets for cancer therapy. Treatments that specifically target genetic variants or protein interactions linked to Bcl-2 SNPs may be able to return cancer cells to normal apoptotic pathways, which would ultimately result in their elimination. Information presented here on how SNPs in Bcl-2 influence protein-protein interactions can provide insights into the molecular mechanisms underlying cancer development and progression. Modulating apoptotic pathways through the disruption or enhancement of certain protein interactions linked to Bcl-2 SNPs may have therapeutic benefits.
5 Conclusion
The impact of SNPs on the structure and function of Bcl-2 was investigated using state-of-the-art bioinformatics approaches and molecular dynamics simulations. Disease phenotype analysis indicated that 11 mutations of Bcl-2 were found to be pathogenic. Furthermore, Bcl-2G101V and Bcl-2F104L variants were found to be the most deleterious and impacted negatively on the binding affinity of protein-protein interaction. These mutations have also altered van der Waals and hydrogen bonds interactions, conservation scores, ligand-binding affinity, residues size, residues charge, and residues hydrophobicity. As a result, a significant conformational deviation in the Bcl-2 structure and a slight change the secondary structure of Bcl-2 throughout the entire MD trajectory were observed. The findings from this study would serve useful for future in vitro and population genetics research and would pave the way for further rational drug design of anti-cancer therapy.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Ethics statement
Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.
Author contributions
GE: Formal Analysis, Investigation, Methodology, Writing–original draft. ZZ: Writing–review and editing. DD: Writing–review and editing. KK: Writing–review and editing, Software. JM: Writing–review and editing, Investigation, Resources. PM: Writing–review and editing. NM: Writing–review and editing, Supervision. MS: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was funded by the University of KwaZulu-Natal Research Office.
Acknowledgments
The authors thank the Centre for High-Performance Computing (www.chpc.ac.za), Cape Town, South Africa, for the computational resources.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2024.1502152/full#supplementary-material
References
Adzhubei, I., Jordan, D. M., and Sunyaev, S. R. (2013). Predicting functional effect of human missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet. 76 (1), 20. doi:10.1002/0471142905.hg0720s76
Ashkenazy, H., Abadi, S., Martz, E., Chay, O., Mayrose, I., Pupko, T., et al. (2016). ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 44 (W1), W344–W350. doi:10.1093/nar/gkw408
Bateman, A., Martin, M. J., O’Donovan, C., et al. (2017). UniProt: the universal protein knowledgebase. Nucleic Acids Res. 45 (D1), D158–D169. doi:10.1093/NAR/GKW1099
Bendl, J., Stourac, J., Salanda, O., Pavelka, A., Wieben, E. D., Zendulka, J., et al. (2014). PredictSNP: robust and accurate consensus classifier for prediction of disease-related mutations. PLoS Comput. Biol. 10 (1), e1003440. doi:10.1371/journal.pcbi.1003440
Berendsen, H. J. C., Postma, J. P. M., van Gunsteren, W. F., DiNola, A., and Haak, J. R. (1984). Molecular dynamics with coupling to an external bath. J. Chem. Phys. 81 (8), 3684–3690. doi:10.1063/1.448118
Birkinshaw, R. W., Gong, J. nan, Luo, C. S., Lio, D., White, C. A., Anderson, M. A., et al. (2019). Structures of BCL-2 in complex with venetoclax reveal the molecular basis of resistance mutations. Nat. Commun. 10 (1), 2385. doi:10.1038/s41467-019-10363-1
Blombery, P., Anderson, M. A., Gong, J. N., Thijssen, R., Birkinshaw, R. W., Thompson, E. R., et al. (2019). Acquisition of the recurrent Gly101Val mutation in BCL2 confers resistance to venetoclax in patients with progressive chronic lymphocytic leukemia. Cancer Discov. 9 (3), 342–353. doi:10.1158/2159-8290.CD-18-1119
Blombery, P., Thompson, E. R., Nguyen, T., Birkinshaw, R. W., Gong, J. N., Chen, X., et al. (2020). Multiple BCL2 mutations cooccurring with Gly101Val emerge in chronic lymphocytic leukemia progression on venetoclax. Blood 135 (10), 773–777. doi:10.1182/BLOOD.2019004205
Capriotti, E., Calabrese, R., Fariselli, P., Martelli, P., Altman, R. B., and Casadio, R. (2013). WS-SNPs&GO: a web server for predicting the deleterious effect of human protein variants using functional annotation. BMC Genomics 14 (Suppl. 3), S6. doi:10.1186/1471-2164-14-S3-S6
Capriotti, E., and Fariselli, P. (2017). PhD-SNPg: a webserver and lightweight tool for scoring single nucleotide variants. Nucleic Acids Res. 45 (W1), W247–W252. doi:10.1093/NAR/GKX369
Case, D. A., Walker, R. C., Cheatham, T. E., Case, D. A., Walker, R. C., et al. (2024). Amber 2018. Univ. Calif. San. Fr. 2018.
Chen, J., Zeng, Q., Wang, W., Sun, H., and Hu, G. (2022). Decoding the identification mechanism of an SAM-III riboswitch on ligands through multiple independent Gaussian-accelerated molecular dynamics simulations. J. Chem. Inf. Model. 62 (23), 6118–6132. doi:10.1021/acs.jcim.2c00961
Chen, J., Zhang, S., Wang, W., Pang, L., Zhang, Q., and Liu, X. (2021). Mutation-induced impacts on the switch transformations of the GDP- and GTP-bound K-ras: insights from multiple replica Gaussian accelerated molecular dynamics and free energy analysis. J. Chem. Inf. Model. 61 (4), 1954–1969. doi:10.1021/acs.jcim.0c01470
Chen, Y., Lu, H., Zhang, N., Zhu, Z., Wang, S., and Li, M. (2020). PremPS: predicting the impact of missense mutations on protein stability. PLoS Comput. Biol. 16 (12), e1008543. doi:10.1371/journal.pcbi.1008543
Cory, S., and Adams, J. M. (2002). The Bcl2 family: regulators of the cellular life-or-death switch. Nat. Rev. Cancer 2 (9), 647–656. doi:10.1038/nrc883
Czabotar, P. E., Lessene, G., Strasser, A., and Adams, J. M. (2014). Control of apoptosis by the BCL-2 protein family: implications for physiology and therapy. Nat. Rev. Mol. Cell. Biol. 15 (1), 49–63. doi:10.1038/NRM3722
Dakal, T. C., Kala, D., Dhiman, G., Yadav, V., Krokhotin, A., and Dokholyan, N. V. (2017). Predicting the functional consequences of non-synonymous single nucleotide polymorphisms in IL8 gene. Sci. Rep. 7 (1), 6525–6618. doi:10.1038/s41598-017-06575-4
Darden, T., York, D., and Pedersen, L. (1993). Particle mesh Ewald: an N ⋅log(N) method for Ewald sums in large systems. J. Chem. Phys. 98 (12), 10089–10092. doi:10.1063/1.464397
Delbridge, A. R. D., and Strasser, A. (2015). The BCL-2 protein family, BH3-mimetics and cancer therapy. Cell. Death Differ. 22 (7), 1071–1080. doi:10.1038/cdd.2015.50
Delbridge, ARDD, Grabow, S., Strasser, A., and Vaux, D. L. (2016). Thirty years of BCL-2: translating cell death discoveries into novel cancer therapies. Nat. Rev. Cancer 16 (2), 99–109. doi:10.1038/nrc.2015.17
Edelman, L. B., Eddy, J. A., and Price, N. D. (2010). In silico models of cancer. WIREs Syst. Biol. Med. 2 (4), 438–459. doi:10.1002/wsbm.75
Elamin, G., Aljoundi, A. E. S., and Soliman, M. (2024). From biological activity to stereoselectivity: a portrait of molecular and mechanistic profiles of the therapeutic potential of G-1 and LNS8801 as GPER-1 activator in the treatment of waldenström’s macroglobulinemia. Innov. Discov. 1, 7. doi:10.53964/id.2024007
Frappier, V., Chartier, M., and Najmanovich, R. J. (2015). ENCoM server: exploring protein conformational space and the effect of mutations on protein function and stability. Nucleic Acids Res. 43 (W1), W395–W400. doi:10.1093/nar/gkv343
Goff, D. J., Recart, A. C., Sadarangani, A., Chun, H. J., Barrett, C. L., Krajewska, M., et al. (2013). A pan-BCL2 inhibitor renders bone-marrow-resident human leukemia stem cells sensitive to tyrosine kinase inhibition. Cell. Stem Cell. 12 (3), 316–328. doi:10.1016/j.stem.2012.12.011
Heinig, M., and Frishman, D. (2004). STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res. 32 (Web Server), W500–W502. doi:10.1093/nar/gkh429
Hubbard, T., Barker, D., Birney, E., Cameron, G., Chen, Y., Clark, L., et al. (2002). The Ensembl genome database project. Nucleic Acids Res. 30 (1), 38–41. doi:10.1093/NAR/30.1.38
Humphrey, W., Dalke, A., and Schulten, K. (1996). VMD: Visual molecular dynamics. J. Mol. Graph 14 (1), 33–28. doi:10.1016/0263-7855(96)00018-5
Ioannidis, N. M., Rothstein, J. H., Pejaver, V., Middha, S., McDonnell, S. K., Baheti, S., et al. (2016). REVEL: an Ensemble method for predicting the pathogenicity of Rare missense variants. Am. J. Hum. Genet. 99 (4), 877–885. doi:10.1016/j.ajhg.2016.08.016
Jiménez-Santos, M. J., García-Martín, S., Fustero-Torre, C., Di Domenico, T., Gómez-López, G., and Al-Shahrour, F. (2022). Bioinformatics roadmap for therapy selection in cancer genomics. Mol. Oncol. 16 (21), 3881–3908. doi:10.1002/1878-0261.13286
Jin, J., Wu, X., Yin, J., Li, M., Shen, J., Li, J., et al. (2019). Identification of genetic mutations in cancer: challenge and opportunity in the new era of targeted therapy. Front. Oncol. 9, 263. doi:10.3389/fonc.2019.00263
Karimi, M. R., Karimi, A. H., Abolmaali, S., Sadeghi, M., and Schmitz, U. (2022). Prospects and challenges of cancer systems medicine: from genes to disease networks. Brief. Bioinform 23 (1), bbab343. doi:10.1093/bib/bbab343
Kitada, S., Pedersen, I. M., Schimmer, A. D., and Reed, J. C. (2002). Dysregulation of apoptosis genes in hematopoietic malignancies. Oncogene 21 (21), 3459–3474. doi:10.1038/sj.onc.1205327
Kucukkal, T. G., Petukh, M., Li, L., and Alexov, E. (2015). Structural and physico-chemical effects of disease and non-disease nsSNPs on proteins. Curr. Opin. Struct. Biol. 32, 18–24. doi:10.1016/j.sbi.2015.01.003
Kumalo, H. M., Bhakat, S., and Soliman, M. E. (2016). Investigation of flap flexibility of β-secretase using molecular dynamic simulations. J. Biomol. Struct. Dyn. 34 (5), 1008–1019. doi:10.1080/07391102.2015.1064831
Kumar, P., Henikoff, S., and Ng, P. C. (2009). Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4 (7), 1073–1081. doi:10.1038/NPROT.2009.86
Liu, X., Wu, C., Li, C., and Boerwinkle, E. (2016). dbNSFP v3.0: a one-stop database of functional predictions and annotations for human nonsynonymous and splice-site SNVs. Hum. Mutat. 37 (3), 235–241. doi:10.1002/humu.22932
López-Ferrando, V., Gazzo, A., de la Cruz, X., Orozco, M., and Gelpí, J. L. (2017). PMut: a web-based tool for the annotation of pathological variants on proteins, 2017 update. Nucleic Acids Res. 45 (W1), W222–W228. doi:10.1093/nar/gkx313
Maier, J. A., Martinez, C., Kasavajhala, K., Wickstrom, L., Hauser, K. E., and Simmerling, C. (2015). ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theory Comput. 11 (8), 3696–3713. doi:10.1021/acs.jctc.5b00255
Ngan, C. H., Hall, D. R., Zerbe, B., Grove, L. E., Kozakov, D., and Vajda, S. (2012). FTSite: high accuracy detection of ligand binding sites on unbound protein structures. Bioinformatics 28 (2), 286–287. doi:10.1093/bioinformatics/btr651
Parthiban, V., Gromiha, M. M., and Schomburg, D. (2006). CUPSAT: prediction of protein stability upon point mutations. Nucleic Acids Res. 34 (Web Server), W239–W242. doi:10.1093/nar/gkl190
Pejaver, V., Urresti, J., Lugo-Martinez, J., Pagel, K. A., Lin, G. N., Nam, H. J., et al. (2020). Inferring the molecular and phenotypic impact of amino acid variants with MutPred2. Nat. Commun. 11 (1), 5918–6013. doi:10.1038/s41467-020-19669-x
Perini, G. F., Ribeiro, G. N., Pinto Neto, J. V., Campos, L. T., and Hamerschlak, N. (2018). BCL-2 as therapeutic target for hematological malignancies. J. Hematol. Oncol. 11 (1), 65. doi:10.1186/s13045-018-0608-2
Pires, D. E. V., Ascher, D. B., and Blundell, T. L. (2014a). mCSM: predicting the effects of mutations in proteins using graph-based signatures. Bioinformatics 30 (3), 335–342. doi:10.1093/bioinformatics/btt691
Pires, D. E. V., Ascher, D. B., and Blundell, T. L. (2014b). DUET: a server for predicting effects of mutations on protein stability using an integrated computational approach. Nucleic Acids Res. 42 (W1), W314–W319. doi:10.1093/nar/gku411
Qian, S., Wei, Z., Yang, W., Huang, J., Yang, Y., and Wang, J. (2022). The role of BCL-2 family proteins in regulating apoptosis and cancer therapy. Front. Oncol. 12, 985363. doi:10.3389/fonc.2022.985363
Rentzsch, P., Witten, D., Cooper, G. M., Shendure, J., and Kircher, M. (2019). CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 47 (D1), D886–D894. doi:10.1093/nar/gky1016
Reva, B., Antipin, Y., and Sander, C. (2011). Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. 39 (17), e118. doi:10.1093/nar/gkr407
Roberts, A. W., Davids, M. S., Pagel, J. M., Kahl, B. S., Puvvada, S. D., Gerecitano, J. F., et al. (2016). Targeting BCL2 with venetoclax in relapsed chronic lymphocytic leukemia. N. Engl. J. Med. 374 (4), 311–322. doi:10.1056/NEJMoa1513257
Rodrigues, C. H., Pires, D. E., and Ascher, D. B. (2018). DynaMut: predicting the impact of mutations on protein conformation, flexibility and stability. Nucleic Acids Res. 46 (W1), W350–W355. doi:10.1093/nar/gky300
Rodrigues, C. H. M., Myung, Y., Pires, D. E. V., and Ascher, D. B. (2019). mCSM-PPI2: predicting the effects of mutations on protein–protein interactions. Nucleic Acids Res. 47 (W1), W338–W344. doi:10.1093/nar/gkz383
Roe, D. R., and Cheatham, T. E. (2013). PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 9 (7), 3084–3095. doi:10.1021/ct400341p
Rosser, C. J., Reyes, A. O., Vakar-Lopez, F., Levy, L. B., Kuban, D. A., Hoover, D. C., et al. (2003). Bcl-2 is significantly overexpressed in localized radio-recurrent prostate carcinoma, compared with localized radio-naive prostate carcinoma. Int. J. Radiat. Oncology*Biology*Physics 56 (1), 1–6. doi:10.1016/S0360-3016(02)04468-1
Seifert, E. (2014). OriginPro 9.1: scientific data analysis and graphing software-software review. J. Chem. Inf. Model. 54 (5), 1552. doi:10.1021/ci500161d
Sherry, S. T., Ward, M. H., Kholodov, M., Baker, J., Phan, L., Smigielski, E. M., et al. (2001). dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 29 (1), 308–311. doi:10.1093/NAR/29.1.308
Shivakumar, D., Harder, E., Damm, W., Friesner, R. A., and Sherman, W. (2012). Improving the prediction of absolute solvation free energies using the next generation OPLS force field. J. Chem. Theory Comput. 8 (8), 2553–2558. doi:10.1021/ct300203w
Stilgenbauer, S., Eichhorst, B., Schetelig, J., Hillmen, P., Seymour, J. F., Coutre, S., et al. (2018). Venetoclax for patients with chronic lymphocytic leukemia with 17p deletion: results from the full population of a phase II pivotal trial. J. Clin. Oncol. 36 (19), 1973–1980. doi:10.1200/JCO.2017.76.6840
Szklarczyk, D., Gable, A. L., Nastou, K. C., Lyon, D., Kirsch, R., Pyysalo, S., et al. (2021). The STRING database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 49 (D1), D605–D612. doi:10.1093/nar/gkaa1074
Tang, X., Wuest, M., Benesch, M. G. K., Dufour, J., Zhao, Y., Curtis, J. M., et al. (2020). Inhibition of autotaxin with GLPG1690 increases the efficacy of radiotherapy and chemotherapy in a mouse model of breast cancer. Mol. Cancer Ther. 19 (1), 63–74. doi:10.1158/1535-7163.MCT-19-0386
Thomas, P. D., Ebert, D., Muruganujan, A., Mushayahama, T., Albou, L. P., and Mi, H. (2022). PANTHER: making genome-scale phylogenetics accessible to all. Protein Sci. 31 (1), 8–22. doi:10.1002/pro.4218
Wang, J. Q., Li, J. Y., Teng, Q. X., Lei, Z. N., Ji, N., Cui, Q., et al. (2020). Venetoclax, a BCL-2 inhibitor, enhances the efficacy of chemotherapeutic agents in wild-type ABCG2-overexpression-mediated MDR cancer cells. Cancers (Basel) 12 (2), 466. doi:10.3390/CANCERS12020466
Warde-Farley, D., Donaldson, S. L., Comes, O., Zuberi, K., Badrawi, R., Chao, P., et al. (2010). The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 38, W214–W220. doi:10.1093/NAR/GKQ537
Xue, Y., Zhou, F., Zhu, M., Ahmed, K., Chen, G., and Yao, X. (2005). GPS: a comprehensive www server for phosphorylation sites prediction. Nucleic Acids Res. 33 (Web Server), W184–W187. doi:10.1093/nar/gki393
Yalcin-Ozkat, G. (2021). Molecular modeling strategies of cancer multidrug resistance. Drug Resist. Updat. 59, 100789. doi:10.1016/j.drup.2021.100789
Keywords: Bcl-2, nsSNPs, mutations, genomic analyses, molecular dynamics simulations
Citation: Elamin G, Zhang Z, Dwarka D, Kasumbwe K, Mellem J, Mkhwanazi NP, Madlala P and Soliman MES (2025) Integrative genomic analyses combined with molecular dynamics simulations reveal the impact of deleterious mutations of Bcl-2 gene on the apoptotic machinery and implications in carcinogenesis. Front. Genet. 15:1502152. doi: 10.3389/fgene.2024.1502152
Received: 26 September 2024; Accepted: 11 December 2024;
Published: 07 January 2025.
Edited by:
Vladimir F. Niculescu, Other, GermanyReviewed by:
Dharmendra Kumar Yadav, Gachon University, Republic of KoreaOlga V. Anatskaya, Russian Academy of Sciences (RAS), Russia
Jiann-Ruey Hong, National Cheng Kung University, Taiwan
Copyright © 2025 Elamin, Zhang, Dwarka, Kasumbwe, Mellem, Mkhwanazi, Madlala and Soliman. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mahmoud E. S. Soliman, c29saW1hbkB1a3puLmFjLnph