Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 26 February 2024
Sec. Genomics of Plants and the Phytoecosystem
This article is part of the Research Topic Genomics Assisted Improvement Of Crop Plants For Adaptation To Marginal Environments View all 9 articles

Comparative genomics and bioinformatics approaches revealed the role of CC-NBS-LRR genes under multiple stresses in passion fruit

  • 1Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad, Pakistan
  • 2College of Horticulture, Shanxi Agricultural University, Taigu, Shanxi, China
  • 3Department of Pharmacology and Toxicology, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia

Passion fruit is widely cultivated in tropical, subtropical regions of the world. The attack of bacterial and fungal diseases, and environmental factors heavily affect the yield and productivity of the passion fruit. The CC-NBS-LRR (CNL) gene family being a subclass of R-genes protects the plant against the attack of pathogens and plays a major role in effector-triggered immunity (ETI). However, no information is available regarding this gene family in passion fruit. To address the underlying problem a total of 25 and 21 CNL genes have been identified in the genome of purple (Passiflora edulis Sims.) and yellow (Passiflora edulis f. flavicarpa) passion fruit respectively. Phylogenetic tree was divided into four groups with PeCNLs present in 3 groups only. Gene structure analysis revealed that number of exons ranged from 1 to 9 with 1 being most common. Most of the PeCNL genes were clustered at the chromosome 3 and underwent strong purifying selection, expanded through segmental (17 gene pairs) and tandem duplications (17 gene pairs). PeCNL genes contained cis-elements involved in plant growth, hormones, and stress response. Transcriptome data indicated that PeCNL3, PeCNL13, and PeCNL14 were found to be differentially expressed under Cucumber mosaic virus and cold stress. Three genes were validated to be multi-stress responsive by applying Random Forest model of machine learning. To comprehend the biological functions of PeCNL proteins, their 3D structure and gene ontology (GO) enrichment analysis were done. Our research analyzed the CNL gene family in passion fruit to understand stress regulation and improve resilience. This study lays the groundwork for future investigations aimed at enhancing the genetic composition of passion fruit to ensure robust growth and productivity in challenging environments.

1 Introduction

Fresh fruits are consumed all over the world as they are rich sources of vitamins and help boost the immune system to fight against diseases. Passion fruit (P. edulis) is also widely cultivated in countries across the globe due to its nutritional benefits and used in the production of juice, oil, jelly, etc. P. edulis belongs to the Passifloraceae family and is available in a variety of botanical forms including yellow passion fruit (P. edulis f. flavicarpa), water lemon (Passiflora laurifolia), purple passion fruit (P. edulis Sims.), fragrant granadilla (Passiflora alata), and others (Passiflora et al., 2021; Correia et al., 2022; Fonseca et al., 2022). A recent study involving comparative analysis of P. edulis Sims. and P. edulis f. flavicarpa demonstrated that the purple cultivar is more resistant to the pathogens than the yellow cultivar which highlights the importance of the purple cultivar (Rizwan et al., 2021). Apart from the uses of P. edulis in the food industry, it can also be useful for disease prevention due to the presence of antioxidants and phytochemicals in it. A well-known example in this regard is Passiflora incarnata, a plant with a well-established history in traditional herbal medicine, which has been utilized for its potential medicinal properties in alleviating hypertension, anxiety, and insomnia (Miroddi et al., 2013). Producers of passion fruit include Brazil, Asia, South Africa, and South America. The overall production of P. edulis gets reduced due to a variety of diseases including bacteriosis, anthracnose, fusarium wilt, and fruit woodiness which cause loss to the P. edulis producers (Joy P.P. and Sherin C.G, 1983; Xu et al., 2022).

To better interpret the defense mechanism of P. edulis towards these diseases there is a need to identify disease resistance genes in this fruit. Two defense mechanisms are utilized by plants when they undergo pathogen stress including immunity activated by pathogen-associated molecular pattern (PTI) and effector-triggered immunity (ETI) (Delplace et al., 2022). The PTI involves the recognition of pathogens by specific pathogen recognition receptors (PRRs) at the cell membrane thereby inducing immunity in plants. However, the pathogens can release effectors as a contradictory effect to PTI thus leading to the activation of the ETI that protects plants by resisting the invasion of pathogens. When the former defense mechanism is unable to protect the plant from pathogen invasion then in later stages of plant immune response the effector-triggered immunity becomes active that is where the whole NBS-LRR (NLR) gene family has a crucial function i.e., CC-NBS-LRR (CNL) and Toll interleukin-NBS-LRR (TNL) act as sensors to the pathogenic effectors thereby initiating signaling mechanism where RPW8-NBS-LRR (RNL) function in assisting the plant resistance towards pathogen as depicted in (Figure 1) (Kaur et al., 2022).

FIGURE 1
www.frontiersin.org

FIGURE 1. (A) Two major varieties of P. edulis and the associated diseases caused by the attack of pathogens. (B) Overview of how P. edulis responds to the attack of pathogens.

The NLR gene family represents the most extensive group of R genes responsible for providing disease resistance in plants. This gene family is characterized by the presence of nucleotide binding site (NBS) and leucine-rich repeat (LRR) domains. This gene family has been classified into two main subfamilies including CC-NBS-LRR (CNL) and TIR-NBS-LRR (TNL) by the presence of coiled-coil and toll interleukin receptor domains at the protein’s N terminal region (Bezerra-Neto et al., 2019). Passion fruit holds significant economic, agricultural, industrial, and ornamental value. Owing to its multifaceted importance, addressing the challenges posed by pathogenic attacks and environmental stress becomes imperative to ensure sustained passion fruit yield and mitigate global reductions in fruit productivity (Chavarría-Perez et al., 2020; Rizwan et al., 2021). The modern era holds promise to improve the breeding strategies of plants by employing artificial intelligence and machine learning-based approaches to facilitate multi-omics data analysis eventually moving into the era of precision agriculture (Zhang et al., 2022). Once the R genes are identified in the genome of passion fruit, it will become easier to develop plants with improved resistance to pathogens and environmental stresses, eventually leading to increased productivity and yield (Gururani et al., 2012).

The CNL subclass has been previously reported in several plants including 51 members in Arabidopsis thaliana (Meyers et al., 2003), 159 members in Oryza sativa (Zhou et al., 2004), 119 members in Populus trichocarpa (Kohler et al., 2008), 33 members in Cucumis sativus (Zhang et al., 2022), 40 members in Brassica rapa (Y. Liu et al., 2021), 361 in Solanum tuberosum (Jupe et al., 2012), 78 in Solanum pimpinellifolium (Wei et al., 2020), 14 in Lagenaria siceraria, 146 in Triticum urartu (Qian et al., 2021), 166 in Discorea rotundata, 103 in Glycine max (Afzal et al., 2022), 467 in Hordeum vulgare (Liu et al., 2017), 54 in Broussonetia papyrifera (X. Zhang et al., 2023), 95 in Elaeis guineensis (Rosli et al., 2018), 10 in Citrus sinensis (Yin et al., 2023), 47 in Alphonso, 27 in Hong Xiang Ya, and 36 in Tommy atkins (ul Qamar et al., 2023). Machine learning approaches have also been applied in studies reported previously for the elucidation of candidate genes implicated in multi-stress responsiveness in Oryza sativa and Sorghum bicolor (Woldesemayat et al., 2018; Ramkumar et al., 2022).

The identification of passion fruit CNLs sheds light on their role in plant defense mechanisms against environmental stresses. This study provides a comprehensive structural evaluation, encompassing gene structure, motif analysis, phylogenetics, chromosomal distribution, cis-elements, gene enrichment, and 3D structure prediction. Additionally, it investigates the differential expression of these genes under disease and cold conditions, identifying multi-stress-responsive genes. The involvement of these CNLs in multi-stress responsiveness is further validated using a machine learning classifier algorithm. This research significantly contributes to our understanding of the CNL gene family in passion fruit, highlighting their importance in conferring resistance against various environmental stresses. The insights gained from this study will be invaluable for future researchers in the field.

2 Methods

2.1 Identification and physiochemical characterization of CNL genes in P. edulis Sims. and P. edulis f. flavicarpa

The 51 CNL protein sequences of A. thaliana obtained from the Ensembl Plants database (https://plants.ensembl.org/index.html) were used as query sequences against P. edulis Sims. (https://ftp.cngb.org/pub/CNSA/data3/CNP0001287/CNS0275691/CNA0017758/). proteome database by utilizing the standalone version of BLASTp. The same 51 CNL protein sequences of A. thaliana were queried against the P. edulis f. flavicarpa proteome database by performing the BLASTp search at the Passion fruit genomic database (http://passionfruit.com.cn/). After manual verification, all the duplicates were removed and a list of all unique IDs was further processed. Subsequently, this list underwent further processing to check the presence of specific CNL domains, namely, the coiled-coil (CC) domain, NB-ARC domain, and LRR domain, utilizing the Pfam (https://pfam-legacy.xfam.org) (Mistry et al., 2021) CDD (https://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi) (Lu et al., 2020), HMMER (https://www.ebi.ac.uk/Tools/hmmer/search/hmmscan) (Potter et al., 2018), and Interpro (https://www.ebi.ac.uk/interpro/search/sequence/) (Hunter et al., 2009) databases for domain identification. The presence of the coiled-coil domain was validated through Paircoil2 (http://cb.csail.mit.edu/cb/paircoil2/paircoil2.html) (McDonnell et al., 2006). The IDs were selected for further analysis that contained CNL-specific domains.

To gain a more profound understanding of the properties of the identified PeCNL proteins, their length (aa), molecular weight (MW), isoelectric point (pI), aliphatic index (AI), and grand average of hydropathicity (GRAVY) values were calculated using EXPASY ProtParam tool (https://web.expasy.org/protparam/) (Artimo et al., 2012). The Plant-mPLoc web server (http://www.csbio.sjtu.edu.cn/cgi-bin/PlantmPLoc.cgi) (Chou and Shen, 2010) has been utilized to identify the subcellular location of PeCNL proteins.

2.2 Multiple sequence alignment and phylogenetic analysis

The multiple sequence alignment of the underlying protein sequences of P. edulis Sims. (25), A. thaliana (51), M. domestica (21), C. sativus (33), and B. oleracea (33) that belonged to CNL subclass were submitted to ClustalW at MEGA 7.0 software to identify the highly conserved amino acid residues (Kumar et al., 2016). To infer the evolutionary relationships of CNL proteins of P. edulis with other plants the aligned sequences were subjected to construct a phylogenetic tree based on the Neighbor-Joining (NJ) method with 1000 bootstrap using PAUP4 software (Wilgenbusch and Swofford, 2003), and iTOL V6 was utilized for the editing of the phylogenetic tree (https://itol.embl.de) (Letunic and Bork, 2021).

2.3 Conserved motifs and gene structures

The complete and accurate representation of genetic structures of identified PeCNL genes will be demonstrated by utilizing the CDS and gene sequences of P. edulis Sims. The CDS and gene sequences were retrieved from the CNSA resource. The retrieved sequences were submitted to the Gene Structure Display Server 2.0 (GSDS; https://gsds.gao-lab.org) web server (Hu et al., 2015) for visualizing the gene structures. The prediction of highly conserved motifs associated with the proper functioning of the PeCNL proteins, the protein sequences were submitted to MEME suite 5.4.1 (https://meme-suite.org/meme/tools/meme) (Bailey et al., 2009), with the maximum number of motifs set to 10 and the other parameters were set to the default.

2.4 Analysis of gene location, gene duplication, and cis-regulatory elements (CREs)

To check the tendency of how well the CNL genes tend to cluster together at the respective chromosomes, genes were mapped to their respective positions at chromosomes. The information related to chromosome number and position of each PeCNL gene was acquired by using the annotation file (.gff3) of P. edulis Sims. downloaded from the CNSA database (https://ftp.cngb.org/pub/CNSA/data3/CNP0001287/CNS0275691/CNA0017758/) (Guo et al., 2020). To visualize the distribution patterns of PeCNL genes at chromosomes TBtools software v1.116 (Chen et al., 2020) has been utilized. Also to get insights into the duplication type and its impact on the evolution gene duplication analysis has been conducted. Among all the identified PeCNL genes, the sequences that shared the sequence identity of ≥70% were considered to be duplicates. DnaSP v6 software (Librado and Rozas, 2009) was used to calculate the rate of both synonymous (Ks) and non-synonymous (Ka) substitutions. The Ka/Ks ratio was used to demonstrate the selection pressure that aided in the evolution of the CNL gene family in P. edulis Sims. Duplication time was calculated based on the formula: T = Ks/2x (where x = 6.56 × 10−9 for dicots) (Zameer et al., 2022; Zia et al., 2022; Sadaqat et al., 2023).

To decipher the transcription factors and their associated functions in the identified genes promoter regions were analyzed to find the appropriate cis-element present in each gene. The 1000bp upstream promoter sequences of the identified PeCNL genes were retrieved via TBtools v1.116 and submitted to the PlantCare database (https://bioinformatics.psb.ugent.be/webtools/plantcare/html/) (Lescot et al., 2002) to find the potential cis-regulatory elements.

2.5 Protein-protein interaction (PPI) and miRNA target prediction

To check the interaction of PeCNL proteins with the interactions reported earlier in other plants the identified PeCNL protein sequences were subjected to the STRING database (https://string-db.org) (Szklarczyk et al., 2021). miRNAs were predicted to be able to particularly control the expression patterns of CNL genes after experimental validation because to silence or increase the gene expression the corresponding miRNA can be targeted in each case. A list of predicted miRNAs for P. edulis was downloaded from the already reported study (Paul et al., 2020) and the psRNATarget web server (https://www.zhaolab.org/psRNATarget/) (Dai et al., 2018) was utilized to determine how these miRNAs were regulating the expression of the target PeCNL genes. The miRNA target gene network and PPI network were visualized by Cytoscape v3.8.2 (Paul Shannon et al., 1971).

2.6 Expression profiling of PeCNL genes under multiple stresses

To elucidate the expression patterns of P. edulis under the influence of Cucumber Mosaic Virus (CMV) infection and cold stress conditions, RNA-seq data were retrieved from the NCBI-SRA database under BioProject: PRJNA633743 and PRJNA634206 respectively. The genome (.fa) and annotation files (.gff3) of P. edulis Sims. were retrieved from the CNSA database (Guo et al., 2020). Cleaned paired-end reads were aligned to the reference genome by using a fast and sensitive alignment tool HISAT2 (Wen, 2017). To quantify the expression of PeCNL genes, Featurecounts (Liao et al., 2014) were used. Based on count values circular heatmaps were generated to visually represent the differential expression patterns of genes through chiplot (https://www.chiplot.online). The process of analyzing expression patterns of each PeCNL gene will be increasingly helpful in identifying PeCNL genes that are differentially expressed in various stresses.

2.7 Validation of PeCNLs under multiple stresses via machine learning

To explore the potential impact of PeCNL genes under multiple stresses machine learning approaches have been applied to the cold and CMV stress dataset of P. edulis. Cleaned reads were subjected to HISAT2 for aligning reads to the reference genome. To obtain the counts dataset of PeCNL genes under both stress conditions Featurecounts was utilized. Then DESeq2 (Michael et al., 2023) was applied to analyze the differentially expressed genes and to normalize the read counts. A Random Forest classifier (Chaudhary et al., 2016) was trained over counts data under CMV conditions. Then a threshold of logFC >0.05 and Padj. Val <0.05 was specified for upregulated genes and logFC < −0.05 and Padj.val <0.05 was selected for downregulated genes to identify common genes in each case. The Common genes were then used to test the model performance in terms of accuracy, sensitivity, and specificity towards predicting the multi-stress responsive genes.

2.8 3D structure prediction and gene ontology (GO) enrichment analysis of PeCNL proteins

To get detailed information regarding the structural conformation of the multi-stress related proteins their 3d structures have been predicted. Analyzing the impact of expression patterns of PeCNL genes in CMV-infected condition and cold only those proteins were selected for 3D structure prediction that were responsible for multi-stress responsiveness. Protein sequences of PeCNL3, PeCNL13, and PeCNL14 were submitted to the trRosseta web server (https://yanglab.nankai.edu.cn/trRosetta/) (Du et al., 2021). For validation of the predicted structures of selected PeCNL proteins, the SAVES server (https://saves.mbi.ucla.edu) was utilized to select model with the most favorable structure conformation and stability. To visualize the predicted 3D structures, PyMOL software was used (Yuan et al., 2017). To comprehend the biological function of the PeCNLs, the GO analysis was done by using the Pannzer2 database (http://ekhidna2.biocenter.helsinki.fi/sanspanz/) (Törönen et al., 2018). The GO has been classified into three categories: Biological Processes (BP), Cellular Components (CCs), and Molecular Functions (MF).

3 Results

3.1 Identification and physiochemical characterization of CNL genes in P. edulis Sims and P. edulis f. flavicarpa

The presence of CNL-specific domains resulted in the successful identification of 25 PeCNL genes in Passiflora edulis Sims and 21 PeCNL genes in Passiflora edulis f. flavicarpa. The identified CNL genes in P. edulis Sims have been named according to the order in which they are present at chromosomes. The specific information regarding the properties of PeCNL proteins is given in (Supplementary Table S1). The conserved domains found in these proteins include Rx_N, NB-ARC, LRR_8, and RPW8. All the PeCNL proteins were predicted to have a CC domain. Most of the proteins contained Rx_N, NB-ARC, and LRR_8 domains. While the RPW8 domain was present only in PeCNL3. All the predicted domains were involved in disease resistance in P. edulis and other plants as mentioned in previous studies (Figure 2).

FIGURE 2
www.frontiersin.org

FIGURE 2. Sankey plot representing the variability in number of members identified in the NLR gene family and the number of members in the CNL gene family across different plants.

The length of PeCNL proteins ranges from 741 to 1541 aa, while their molecular weight (MW) ranges from 84156.4 to 175592 (Da). The majority of the PeCNL proteins were acidic and only a few were basic according to the isoelectric points i.e., 5.12 to 9.09. All of the identified PeCNL proteins were unstable because the instability index was found to be greater than 40. The GRAVY value was negative for 24 PeCNL proteins suggesting that these were hydrophilic while only PeCNL12 had a GRAVY value positive meaning that it was hydrophobic (Figure 3). The proteins that are present outside the cell membrane or at the cell surface are always hydrophobic while the proteins that are present inside the cell are hydrophilic.

FIGURE 3
www.frontiersin.org

FIGURE 3. Visual representation of PeCNL proteins in Passiflora edulis Sims. calculated by Expasy Protparam server. (A) Length of PeCNL proteins (B) Molecular weight of PeCNL proteins, (C) Isoelectric point of PeCNLs, (D) Instability index of PeCNL proteins, (E) Aliphatic index of PeCNL proteins, (F) GRAVY value for PeCNL proteins.

Most of the PeCNL proteins were predicted to be localized in the cytoplasm while some of the proteins were localized in the cell membrane (Supplementary Table S2). The conserved domains were the same in both cultivars while the properties of PeCNL proteins were variable in both cultivars and are given as follows. Characteristics for the yellow cultivar proteins were as follows. The length of PeCNL proteins ranged from 604 to 1478 (aa). PeCNL proteins have their molecular weights in the range from 90673 to 168831 (Da). Only 4 PeCNL proteins were basic and all of the remaining proteins were acidic, while all the proteins were unstable and hydrophilic (Figure 4). PeCNL proteins were predicted to be present in the cell membrane and cytoplasmic sections inside the cell.

FIGURE 4
www.frontiersin.org

FIGURE 4. Graphical representation of physical and chemical properties of PeCNL proteins in Passiflora edulis f. flavicarpa calculated by Expasy Protparam. (A) Length of PeCNL proteins (B) Molecular weight of PeCNL proteins, (C) Isoelectric point of PeCNLs, (D) Instability index of PeCNL proteins, (E) Aliphatic index of PeCNL proteins, (F) GRAVY value for PeCNL proteins.

3.2 Multiple sequence alignment and phylogenetic analysis

To analyze the evolutionary relationships of PeCNL proteins with other plants, the aligned protein sequences of A. thaliana, C. sativus, B. oleracea, P. edulis, and M. domestica were subjected to phylogenetic analysis. The resultant tree was divided into four groups namely, A to D. All of the PeCNL proteins were present in groups A to C while none of the PeCNL protein was present in group D (Figure 5). Group A contained 23 members (3 PeCNLs, 4 CsCNLs, 1 BoCNL, 5 AtCNLs, and 10 MdCNLs), group B contained 60 members (4 PeCNLs, 7 CsCNLs, 21 BoCNLs, 23 AtCNLs, and 5 MdCNLs), group C contained 60 members (18 PeCNLs, 22 CsCNLs, 6 BoCNLs, 8 AtCNLs, and 6 MdCNLs) and group D contained least number of members i.e., 20 members (5 BoCNLs and 15 AtCNLs). The distribution of members in each group was consistent with those in AtCNLs, CsCNLs, and BoCNLs indicating that similar evolution patterns were shared by other plants. A monophyletic clade was formed for all plants present in group B indicating that all members of a monophyletic clade share a common evolutionary history and are more closely related to each other than they are to any other group of organisms. Due to the presence of the same conserved domains in other plants, it can be inferred that they could be involved in similar functions in each plant given the mode of evolution was different i.e., some of the members could be the products of speciation event giving rise to orthologs while others could be the products of duplication event i.e., paralogs but each of them shared close homology. Based on the phylogenetic tree it can be inferred that P. edulis shared close evolutionary relationships with M. domestica suggesting that they share a common ancestor. Besides, the AtCNL members were close orthologs of PeCNL members.

FIGURE 5
www.frontiersin.org

FIGURE 5. A phylogenetic tree encompassing CNL proteins from diverse plant species, including A. thaliana, P. edulis Sims., C. sativus, B. oleracea, and M. domestica was constructed. The tree was constructed using PAUP4 software relying on Neighbor-Joining (NJ) method, with 1000 bootstraps replicates. Each distinct group in the phylogenetic tree is represented by different colors.

3.3 Conserved motifs and gene structures

The exon-intron patterns were roughly the same for nearly all genes as PeCNL21 contained 9 exons and 8 introns and PeCNL1, PeCNL3, and PeCNL9 had 5 exons and 4 introns. The number of exons in the remaining genes varied from 1 to 4, with 1 being the most prevalent among them. No intron was present in PeCNL genes with only one exon, and for the others, the number of introns ranged from 1 to 3 (Figures 6A, B).

FIGURE 6
www.frontiersin.org

FIGURE 6. (A) For visualization of phylogenetic tree of PeCNL proteins iTOL was utilized. (B) Gene structure of PeCNL genes constructed by GSDS2.0., and (C) Conserved motifs in PeCNL proteins that have been predicted by using MEME suite 5.4.1.

A total of 10 conserved motifs that were predicted to be present in PeCNL proteins including motif 1 that represented CNBS-1 and RNBS-D motifs, motif 3 represented the P-loop, motif 5 represented the RNBS-B motif, motif 6 represented the GLPL motif, and motif 9 represented the kinase-2 motif (Figure 6C). The conserved motifs associated with proper functioning of CNL proteins have been conserved in all three subgroups except PeCNL8, PeCNL15, PeCNL16, PeCNL18, and PeCNL20 that lack motif 9 i.e., kinase-2 and other conserved motifs responsible for unknown functions.

Motifs 1, 3, 5, 6, and 9 represent the motifs particularly responsible are crucial for the structural confirmation and functioning of CNL proteins. These motifs are conserved across all three sub-groups, except for motif 9, which is absent in some proteins, such as PeCNL8, PeCNL15, PeCNL16, PeCNL18, and PeCNL20 of group 3, possibly due to diversity in conservation patterns. Motifs often play crucial roles in protein folding, stability, and interactions with other molecules. The absence of motif 4 in Subgroup 3 may suggest indicate a distinct functional specialization or structural variation within this subgroup (Figure 6C).

3.4 Analysis of gene location, gene duplication, and cis-regulatory elements (CREs)

PeCNL genes followed uneven distribution patterns at 7 chromosomes. Among the 25 PeCNL genes identified none of the genes was present at chromosomes 6 and 7. Chromosome 5 and Chromosome 9 contained only one gene each namely, PeCNL21 and PeCNL25 and chromosome 3 had 7 genes present in the form of a cluster PeCNL8, PeCNL9, PeCNL10, PeCNL11, PeCNL12, PeCNL13, and PeCNL14 (Figure 7). Besides, there were different numbers of genes present on each chromosome including 5 at chromosome 1, 2 at chromosome 2, 6 at chromosome 4, and 3 at chromosome 8.

FIGURE 7
www.frontiersin.org

FIGURE 7. Distribution of 25 PeCNL genes at chromosomes based on their respective location. The vertical bar at left represents the size of chromosomes in Megabases. Tandem duplicates are indicated by red lines and segmental duplicates are indicated by dark blue colored lines. Different colors are used to represent the groups to which each gene belongs in the phylogenetic tree.

A total of 34 duplicated gene pairs have been identified as a result of gene duplication analysis (Supplementary Table S3). 17 gene pairs have been segmentally duplicated including PeCNL8/PeCNL15, PeCNL8/PeCNL16, PeCNL8/PeCNL18, PeCNL8/PeCNL19, PeCNL8/PeCNL20, PeCNL12/PeCNL17, PeCNL13/PeCNL15, PeCNL13/PeCNL16, PeCNL13/PeCNL17, PeCNL13/PeCNL19, PeCNL13/PeCNL20, PeCNL14/PeCNL15, PeCNL14/PeCNL16, PeCNL14/PeCNL17, PeCNL14/PeCNL18, PeCNL14/PeCNL19, PeCNL14/PeCNL20 while 17 gene pairs have been tandemly duplicated including PeCNL6/PeCNL7, PeCNL8/PeCNL13, PeCNL8/PeCNL14, PeCNL9/PeCNL10, PeCNL9/PeCNL11, PeCNL10/PeCNL11, PeCNL13/PeCNL14, PeCNL15/PeCNL16, PeCNL15/PeCNL18, PeCNL15/PeCNL19, PeCNL15/PeCNL20, PeCNL16/PeCNL18, PeCNL16/PeCNL19, PeCNL16/PeCNL20, PeCNL18/PeCNL19, PeCNL18/PeCNL20, PeCNL19/PeCNL20. It was noteworthy that among 34 duplicated gene pairs 33 gene pairs emerged as a product of purifying selection and only 1 gene pair PeCNL8/PeCNL18 was the product of positive selection as the Ka/Ks ratio was <1 for the former duplicated gene pairs and >1 for the later gene pair. The divergence time taken by genes to duplicate has ranged from 0.77 MYA to 51.09 MYA. Thus, segmental and tandem duplications have collectively contributed to the evolutionary dynamics of the PeCNL genes in P. edulis Sims.

Three different types of cis-regulatory elements were found in the promoter regions of PeCNL genes namely hormone-related, growth-related, and defense and stress-related. Hormone-related cis-elements belong to 10 different categories, growth-related cis-elements belong to 28 different types, and defense and stress-related cis-elements belong to 6 different types. Hormone-responsive cis-elements entailed following names AuxRR-core, TATC-box, ABRE, TGA-element, GARE-motif, P-box, CGTCA-motif, TGACG-motif, TCA-element, and TGA-box were involved in auxin, abscisic acid, methyl jasmonate, salicylic acid and gibberellin responsiveness. It is believed that hormone related cis-elements are responsible for pathogen induced immune response by mediating multiple signaling pathways. Hormone-related cis-elements, such as salicylic acid, jasmonic acid, and ethylene, along with other cis-elements like AS-1, G-box, GCC-box, and H-box, contribute to pathogen-induced immune responses in various plant species, enhancing resistance to pathogen attacks through signal transduction pathways activation. Growth and development-related cis-elements include light responsive, meristem expression, circadian control, endosperm expression, seed-specific regulation, zein metabolism regulation, differentiation of palisade mesophyll cells, anaerobic induction, and anoxic specific inducibility. Defense and stress-related cis-elements include low-temperature responsiveness, drought responsiveness, wound responsive element, and defense-related cis-element. Thus, CRE analysis revealed that the PeCNL genes are involved in the defense mechanism of P. edulis Sims. against a variety of pathogens and environmental stresses. The presence of TC-rich repeats, WUN-motif, ARE, GC-motif, LTR, and MBS has provided evidence for their involvement in the defense-related mechanism (Figure 8; Supplementary Table S4).

FIGURE 8
www.frontiersin.org

FIGURE 8. (A) Different categories for cis-elements present in promoter sequences of PeCNL genes. (B) Location of cis-element on each PeCNL gene.

3.5 PPI and miRNA target prediction

The interaction network was visualized at the second level of connection of PeCNL and other proteins. Among the identified potentially interacting proteins, the Toll/interleukin-1 receptor (TIR) exhibited the most significant degree of interaction, while PeCNL24 displayed the second-highest degree of connectivity. The interaction network consisted of seven proteins from P. edulis Sims, namely, PeCNL2, PeCNL3, PeCNL14, PeCNL21, PeCNL23, PeCNL24, and PeCNL25, that were interacting with 10 proteins of A. thaliana. (Figure 9A).

FIGURE 9
www.frontiersin.org

FIGURE 9. (A) Protein-Protein interaction network of PeCNL proteins with of A. thaliana’s proteins made by STRING database. (B) miRNA target gene network where the number of miRNAs that target each gene varies.

The Toll interleukin 1 protein of A. thaliana belongs to the TNL subclass of the NLR gene family and plays a significant role in the plant’s disease resistance mechanism. Additionally, the PeCNL24 protein is also involved in the plant’s disease resistance mechanism by recognizing RIN4 and conferring disease resistance. RIPK, RIN4, RIN1, PeCNL24, PeCNL25, SOBIR1, PeCNL3, AT3G57750, and PeCNL2 proteins were interacting with disease-resistant proteins in A. thaliana, and their mode of interaction was experimentally validated. Conversely, the remaining interactions were established through text-mining or other methods, yet their specific interactions have not been characterized experimentally.

A total of 15 miRNAs were found targeting 19 PeCNL genes having regulatory association with these miRNAs. Four miRNAs were targeting PeCNL3 and PeCNL12, whereas three miRNAs were targeting PeCNL17 respectively. The PeCNL6, PeCNL9, PeCNL11, and PeCNL15 were targeted by two miRNAs while all of the remaining PeCNL genes were targeted by only one miRNA (Figure 9B; Supplementary Table S5). PeCNL1, PeCNL2, PeCNL4, PeCNL23, PeCNL24, and PeCNL25 were not targeted by any of the miRNAs. The expectation score exhibited a range of 3.5 to 5. The prevailing function of most miRNAs is the inhibition of target transcript cleavage, while only three miRNAs perform the function of inhibiting target gene cleavage.

3.6 Expression profiling of PeCNL genes under multiple stresses

The expression patterns were validated for 25 PeCNL genes of P. edulis which was subjected to CMV infection and cold condition. The heatmap represented that PeCNL2, PeCNL4, PeCNL6, PeCNL7, PeCNL8, PeCNL10, PeCNL11, PeCNL12, PeCNL17, PeCNL18, PeCNL23 and PeCNL24 were upregulated and PeCNL3, PeCNL5, PeCNL13, PeCNL14, PeCNL16, PeCNL21, and PeCNL25 were downregulated under CMV infection (Figure 10A; Supplementary Table S6). The PeCNL9 had no change in expression level under the CMV condition. Under cold condition PeCNL3, PeCNL4, PeCNL6, PeCNL7, PeCNL16, PeCNL17, PeCNL19, PeCNL23, and PeCNL24 were slightly downregulated in both cultivars. While PeCNL9, PeCNL13, PeCNL14, PeCNL15, PeCNL18, PeCNL22, and PeCNL25 were upregulated under the cold conditions in both cultivars (Figure 10B). Whereas, gene expression levels for remaining PeCNL genes were variable i.e., upregulated in one cultivar and downregulated in the second cultivar. The genes that are differentially expressed were the potential targets of in-vitro studies in the future after experimental validation and these include PeCNL3, PeCNL13, PeCNL14, PeCNL23, and PeCNL21.

FIGURE 10
www.frontiersin.org

FIGURE 10. (A) The heatmap illustrates the expression levels of PeCNL genes under cucumber mosaic virus (CMV). (B) The heatmap depicts the expression levels of PeCNL genes by providing cold condition to the two cultivars of P. edulis Sims. namely, Tainong1hao and Huangjinguo. In the heatmap, dark cyan color indicates downregulated genes, white color represents no change in expression and red color signifies upregulated genes. The scale for the heatmap represents the log2 transformed count values.

3.7 Validation of PeCNLs under multiple stresses via machine learning

A total of 3 common genes namely, PeCNL3, PeCNL13, and PeCNL14 were found to be differentially expressed that satisfied the criteria. All the 3 genes were upregulated in CMV infected condition while these were downregulated in cold condition. These genes are potentially significant to be used for making stress-resistant P. edulis Sims. varieties. These genes were used to test the performance of the Random Forest classifier already trained on CMV infected condition. PeCNL3 yielded the best performance in terms of Accuracy, sensitivity, specificity, and AUC visualization (Supplementary Figure S1). Validating the expression of PeCNLs via machine learning would help explore the genes that are particularly responsible for multi-stress responsiveness. This can be used to improve P. edulis cultivar varieties soon which would have increased chances of survival by withstanding multiple stress conditions.

3.8 3D structure prediction and GO enrichment analysis of PeCNL proteins

Based on the machine learning evaluation, three-dimensional structures were predicted for 3 PeCNL proteins namely, PeCNL3, PeCNL13, and PeCNL14 that were responsible for multi-stress responsiveness. PeCNL3 was found to have 35 alpha helices and 22 beta sheets, and PeCNL13 comprised 46 alpha helices and 29 beta sheets. While PeCNL14 contained 53 alpha helices and 30 beta sheets (Figure 11A). The variability in the number of alpha helices and beta sheets suggest that proteins might have undergone structural and functional divergence during the process of evolution to manage the survival of plant under changing conditions and pathogenic attack.

FIGURE 11
www.frontiersin.org

FIGURE 11. (A) 3D structures of the three PeCNL proteins predicted by trRosseta. Cyan color represents the alpha helices, purple color represents beta sheets, and light pink color represents the loops. (B) GO enrichment analysis of PeCNL proteins determined by using Pannzer2.

The GO enrichment analysis demonstrated the potential functions, biological processes, and cellular components in which each of the PeCNL proteins was involved. The majority of the PeCNL proteins were involved in ADP binding, ATP binding, and myosin phosphatase activity. Fewer proteins tend to be involved in other processes including ATP hydrolysis, hydrolase activity, and carbohydrate-binding activity. Accordingly, most of the PeCNL proteins were predicted to be present in the membranous part of the cell as already confirmed by subcellular localization. Others were located in the cytoplasm, nuclear, plastid, plasma membrane, and chloroplast sections. The GO enrichment analysis confirmed the involvement of PeCNL genes in the defense mechanism of P. edulis Sims. towards a variety of pathogens and environmental stresses (Figure 11B; Supplementary Table S7).

4 Discussion

The present work has utilized the most recent genome assembly of P. edulis Sims. and P. edulis f. flavicarpa and identified 25 PeCNL genes in P. edulis Sims. and 21 PeCNL genes in P. edulis f. flavicarpa. The identified PeCNL genes are smaller than those in A. thaliana (56) (Meyers et al., 2003), Secale cereale (581) (Qian et al., 2021), Glycine max (188) (Nepal and Benson, 2015), Discorea rotundata (166) (Zhang et al., 2020), C. sativus (33) (Zhang et al., 2022), B. rapa (40) (Liu et al., 2021), Oryza sativa L. var. Nipponbare (159) (Zhou et al., 2004). The disparity in the number of identified CNL genes among other crops provides compelling evidence that this variation is a result of gene duplications or gene contraction events that likely occurred during evolution. The NLRs are immune receptors integral to the mechanism of ETI in plants. These receptors function as cytoplasmic proteins, responsible for discerning strain-specific effectors originating from pathogens. The localization of PeCNL proteins in cytoplasm and membrane gives evidence for the involvement of these proteins being a crucial part of the signaling pathway in targeting effectors released by pathogens.

The CNLs have been reported to be present in both monocots and dicots (Jacob et al., 2013) RCY1 (Sekine et al., 2008), HRT (Takahashi et al., 2002), RPP8/RPP13 (Bittner-eddy et al., 2000), RPM1 (El Kasmi et al., 2017), RPS2 (Ilag et al., 2000), and RPS5 (Qi et al., 2012) are Arabidopsis AtCNL genes that are validated through in vitro methods to be involved in conferring disease resistance in A. thaliana against various diseases including CMV, Turnip crinkle virus, Downy mildew of cucurbits, bacterial blight. However, the CNL genes in other plants have also been found to be validated experimentally for conferring disease resistance including five CNLs in Solanum lycopersicum, seven CNLs in Triticum aestivum, three CNLs in Hordeum vulgare, and eleven in O. sativa (Zhang et al., 2019). All of these findings suggest that as CNL genes have been proven to be involved in conferring disease resistance in other plants they will also be involved in conferring disease resistance in passion fruit. The characteristics of PeCNLs were consistent with the characteristics of CsCNL proteins (Zhang et al., 2022), where the majority of the proteins were acidic only 13 proteins were basic, and the majority were localized to cytoplasmic and nuclear sections. The BrCNL proteins (Liu et al., 2021) were also similar to the PeCNL members because the majority were acidic.

Phylogeny inference based on the NJ method allowed the analysis of how PeCNL proteins linked to other proteins in the course of evolution. AtCNL proteins were divided into four groups in the phylogenetic tree namely, groups A, B, C, and D where the clade for group B was largest with 26 AtCNL members, and the clade for group A was smallest with 6 members. A comprehensive analysis incorporating the Viridiplantae kingdom in an already reported study unveiled that the genes encoding plant NLR proteins emerged from a shared ancestor of green plants and subsequently underwent divergent evolution, giving rise to three distinct subclasses during the early stages of plant evolution (Shao et al., 2019). All the PeCNL proteins were found to have close evolutionary relationships with CNL proteins of A. thaliana and M. domestica. The tree was divided into four groups namely, groups A, B, C, and D with a varied number of members in each group. Surprisingly none of the member of P. edulis Sims. was present in group D which was quite similar to the trends observed for CNL proteins of C. sativus. Thus, it can be inferred that AtCNLs and MdCNLs tend to be the orthologs of PeCNL proteins which indicates that they share the same ancestor.

All of the conserved motifs linked with the proper functioning of the PeCNL genes were found to be conserved in PeCNL proteins namely, P-loop, GLPL, Kinase-2, RNBS-B, RNBS-D, (Shao et al., 2016). The group C accompanied some PeCNL proteins that lack Kinase-2 motif and other motifs of unknown function. The motifs account for structural conformation of PeCNL proteins. The AtCNL proteins contained RNBS-A, RNBS-C, RNBS-D, and MHDV in addition to other conserved motifs (Meyers et al., 2003). G. max, contained seven conserved motifs (RNBS-A and RNBS-C) and along with other CNL specific motifs (Nepal and Benson, 2015). Exactly same set of conserved motifs were present in Secale cereale as in P. edulis Sims. (Qian et al., 2021). Predicted conserved motifs in D. rotundata were same except for RSNB-D and P-loop (Zhang et al., 2020). However, the C. sativus contained additional motifs that were conserved in CsCNL proteins. In B. rapa the motifs 1, 5, 8, and 9 were responsible for unknown functions while other motifs were encoding NB-ARC and LRR domains. It can be concluded that the motifs are highly conserved across other plants because they are important for maintaining the structure and function of CNL proteins. The variability in motifs could be because each plant has undergone different environmental and selection pressures in the process of evolution.

Most of the PeCNL genes (11) had only one exon which represent 44% of the identified genes and 6 PeCNL genes had 3 exons that represent 24% of the total PeCNL genes. PeCNL8 had 9 exons and 8 introns. Group A had exons ranging from 2 to 3 and introns ranging from 1 to 2. Group B had exons ranging from 1 to 3 and introns ranging from 1 to 2. Whereas, Group C had exons ranging from 1 to 9 and introns ranging from 1 to 4, and only one gene had 8 introns. In C. sativus (Zhang et al., 2022) Group A had 1, 3, and 5 exons respectively. Group B had exons in the range of 1 to 7. Group C had exons in the range of 1 to 5 with 1 being the most frequent. Among BrCNL genes Group I had 1, 2, and 11 exons respectively with 1 being the most frequent. Group II had an exon range given as 1 to 2. Group III had exons ranging from 1 to 3. Group IV had exons ranging from 1 to 5. Group V had exons ranging from 1 to 4. AtCNL genes and their gene products were encoded by single exons (Meyers et al., 2003). The number of introns impacts the expression speed of genes, so genes with a smaller number of introns can be faster edited and translated (Yaghobi and Heidari, 2023; Zaman et al., 2023). The differences in the number of exons and introns indicate the diversity in genic and intergenic regions of CNL genes in other plants and the variability in a number of gene family members in each plant.

All the PeCNL genes were distributed unevenly at 7 chromosomes and were present in the form of clusters. The CNL gene family being a subclass of the NLR gene family also tends genes to be clustered together likewise in the case of NLR where the size of these clusters varies considerably, with certain species possessing large clusters that include over 10 NLRs (Cesari et al., 2013). PeCNL genes formed the largest gene cluster at chromosome 3 with 7 genes representing 28% of the identified genes. Amongst the identified PeCNL genes none of them was present at chromosomes 6 and 7 which could be possibly due to gene contraction or gene transposition or due to the impact of environmental factors. In the case of A. thaliana, a total of 56 AtCNL genes were also distributed in the form of gene clusters at the five chromosomes. Based on a 10 ORF sliding window approach 41 gene clusters have been identified in the genome of G. max. Chromosome 10 did not contain any of the CNL gene and 105 genes (56%) were present at 5 out of 20 chromosomes (Nepal and Benson, 2015). Out of 582 CNL genes identified in the genome of S. cereale 111 ScCNL genes were present at chromosome 4 and almost half of these genes were present at chromosome 2 (Qian et al., 2021). The largest gene cluster of 22 genes was present at chromosome 3 of D. rotundata and the smallest gene cluster was at chromosome 21 with 3 genes (Zhang et al., 2020). The BrCNL genes formed a gene cluster at chr-A09 with 11 genes and the second largest cluster of genes was at chr-A06 with 8 genes respectively. BrCNL genes were completely absent at chr-A04 and chr-A07 and only one gene at chr-A02. Interestingly, CsCNL genes formed the largest gene cluster at chr2 with 10 genes. All these findings suggest the conservation of presence of CNL gene clusters on chromosomes across species.

A total of 34 duplicated gene pairs were found with an equal number of duplication events for both segmental and tandem duplicates, leading to the conclusion that both these duplication events contributed to expansions of the CNL gene family in P. edulis Sims. All of the duplicated gene pairs underwent strong purifying selection except PeCNL8/PeCNL18 which was the product of positive selection. In A. thaliana a total of 149 NLR genes have been identified including CNL, TNL, and other subgroups. Out of the identified 149 NLR genes, 124 genes were the segmental duplication products indicating the association of gene duplication with the expansion of CNL, TNL, and other subgroups (Meyers et al., 2003). All of the identified CNL genes in G. max were the products of tandem duplication (Nepal and Benson, 2015). The dispersed, tandem and segmental duplications collectively accounted for the expansion of the CNL gene family in S. cereale with the dispersed playing the major counterpart (i.e., 60%) than the other two (i.e., 39% for tandem and 1% for segmental) (Qian et al., 2021). A total of 18 segmentally duplicated genes were found to be present in D. rotundata (Zhang et al., 2020). The Ka/Ks analysis of the NLR gene family in Lagneria siceraria (Wang et al., 2022) revealed that among 14 duplicated gene pairs, two gene pairs were segmentally duplicated, and the remaining were tandemly duplicated indicating that the tandem duplication was more favorable and all the duplicated gene pairs were products of negative selection. Ka/Ks analysis of the NLR gene family in C. sinensis (Yin et al., 2023) demonstrated that 16 duplicated gene pairs were tandemly duplicated and were a product of negative selection. The segmental and tandem duplications are equally contributing to the expansion of CNL gene family across the different plants.

The cis-elements were linked to growth and development, hormone response, and stress response. The majority of the cis-elements were involved in growth and development in comparison with CNL genes in B. rapa which contained mostly cis-elements for disease resistance. WBOX was the potential cis-element predicted to be present in the promoter regions of G. max thereby, regulating the defense-associated activity of CNL genes (Nepal and Benson, 2015). The hormone-related cis-elements are also responsible for pathogen-induced immune response where salicylic acid, Jasmonic acid (JA), and ethylene (ET), trigger signal transduction to activate PTI (Corina Vlot et al., 2009; Robert-Seilaniantz et al., 2011). In rice AS-1, G-box, GCC-box, and H-box are the potential cis-elements that induce pathogen defense (Kong et al., 2018). Similarly, Brassica juncea also has cis-elements associated with pathogen defense in the abiotic, biotic, and hormone related categories (Ali et al., 2017). By applying salicylic acid and Jasmonic acid treatment to plants the resistance of plants to pathogen attack gets increased or they promulgate pathogen-induced immune response (Argueso et al., 2012). The cis-element reported to confer pathogen resistance in A. thaliana include MYB, MYC, WRE3, W-box, STRE, and ARE (Saidi et al., 2024). The protein-protein interactions were found for the disease-resistance proteins of A. thaliana with PeCNLs. The TIR and PeCNL24 had a high degree of connectivity indicating that their function will be important for the survival of the plant in disease-related mechanisms.

The miRNAs are usually 18-20 nucleotides long and are responsible for regulating the function of PeCNL proteins. A total of 15 miRNAs targeted 19 PeCNL genes that further gained significant importance due to the way they regulate the functions of these proteins. The miRNAs offer a useful way for future disease management by targeting appropriate miRNAs.

The expression profiling of PeCNL genes was validated under CMV infection and cold stress. The PeCNL3, PeCNL13, and PeCNL14 have differentially expressed genes under CMV and cold stress condition. PeCNL3 and PeCNL14 were downregulated under CMV condition and upregulated under cold condition. Contrastingly, PeCNL13 was upregulated under CMV condition and downregulated in cold condition. PeCNL3, PeCNL13, and PeCNL14 genes can withstand multiple stresses in P. edulis Sims. thus, suitable for developing stress-tolerant varieties of P. edulis Sims.

The expression patterns of PeCNL genes have been demonstrated under multiple stresses to find out the potential genes that are responsible for multi-stress responsiveness and useful for the defense mechanism of P. edulis Sims. to accommodate the underlying conditions. PeCNL3, PeCNL13, and PeCNL14 were differentially expressed under multiple stresses. Machine learning approach i.e., the Random Forest model for regression has been applied to validate the expression of genes potentially involved in multi-stress responsiveness. PeCNL3, PeCNL13, and PeCNL14 were found to be having a significant role in the multi-stress responsiveness of PeCNL genes thus indicating that they can be utilized as potential targets for making transgenic P. edulis varieties. The expression patterns of CsCNL genes were observed in seven tissues i.e., leaf, stem, root, male flower, female flower, tendril, and ovary, and abiotic and biotic stresses including Powdery mildew, downy mildew, salt stress, and low-temperature stress at different stages (Yin et al., 2023). The heatmaps demonstrated that most of the CsCNL genes have their expression level upregulated under abiotic and biotic stresses leading to the conclusion that these are involved in abiotic and biotic stresses and only a few genes were not exhibiting any change in their expression levels (Zhang et al., 2022b). 3D structures were predicted for the aforementioned proteins i.e., PeCNL3, PeCNL13, and PeCNL14. The number of alpha helices and beta sheets varied for each protein. The GO analysis confirmed the involvement of the PeCNLs in the mechanism of disease resistance. Thus, the identification of PeCNL genes in the genome of P. edulis would be crucial for gaining insights into how the P. edulis genome has expanded or evolved in the course of evolution to cope with changing environments and pathogens. Based on our analysis, it can be concluded that CNL genes could play a significant role in improving the genetic makeup of Passion fruit. These genes can be incorporated into breeding or genetic manipulation programs to provide disease resistance and enhance tolerance to abiotic stresses. Furthermore, the multi-stress responsiveness of these genes makes them valuable candidates for further breeding programs seeking to develop mango varieties that are adaptable to diverse environmental conditions. By breeding for PeCNL gene-related traits, we could achieve healthier plants, reduced pesticide dependency, and improved sustainability in Passion fruit cultivation.

5 Conclusion

In this study, a total of 25 and 21 CNL genes were identified in P. edulis Sims. and Passiflora edulis f. flavicarpa, respectively. The PeCNL genes were validated by the presence of conserved domains and motifs associated with the function of CNL genes. Phylogenetic analysis classified PeCNLs into four groups. Gene structure was highly conserved across P. edulis and other plants. Most of the PeCNL genes were present on chromosomes in the form of clusters. Both segmental and tandem duplications have been involved in the expansion of the CNL gene family in P. edulis Sims. Cis-regulatory elements were also found to be involved in growth and development, defense and stress, and hormone response of PeCNL genes. All of the PeCNL proteins were interacting with defense-related proteins. miRNA target prediction showed the regulatory roles in the expression of the PeCNL proteins. The varied number of alpha helices and beta sheets were present in PeCNL proteins and GO enrichment analysis confirmed the involvement of PeCNL proteins in the defense of plants against pathogens. The PeCNL3, PeCNL13, and PeCNL14 were multi-stress responsive genes and were validated using machine learning approaches. Thus, the aforementioned genes could be crucial for the survival of plants underlying changing environmental conditions and pathogenic stress. After experimental validation, these genes could be increasingly helpful in making stress-tolerant varieties of P. edulis in the future.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author contributions

KZ: Data curation, Formal Analysis, Investigation, Methodology, Visualization, Writing–original draft. MS: Data curation, Investigation, Methodology, Validation, Writing–review and editing. BD: Investigation, Methodology, Validation, Writing–review and editing. KF: Data curation, Investigation, Methodology, Validation, Writing–review and editing. NA: Funding acquisition, Methodology, Project administration, Resources, Software, Validation, Writing–review and editing. AA: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Validation, Writing–review and editing. MT: Conceptualization, Methodology, Project administration, Resources, Software, Supervision, Validation, Writing–original draft.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Acknowledgments

The authors extend their appreciation to the Deputyship for Research and Innovation, “Ministry of Education” in Saudi Arabia for funding this research IFKSUOR3-618-2.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2024.1358134/full#supplementary-material

References

Afzal, M., Alghamdi, S. S., Nawaz, H., Migdadi, H. H., Altaf, M., El-Harty, E., et al. (2022). Genome-wide identification and expression analysis of CC-NB-ARC-LRR (NB-ARC) disease-resistant family members from soybean (Glycine max L.) reveal their response to biotic stress. J. King Saud Univ. - Sci. 34 (2), 101758. doi:10.1016/j.jksus.2021.101758

CrossRef Full Text | Google Scholar

Ali, S., Mir, Z. A., Tyagi, A., Bhat, J. A., Chandrashekar, N., Papolu, P. K., et al. (2017). Identification and comparative analysis of Brassica juncea pathogenesis-related genes in response to hormonal, biotic and abiotic stresses. Acta Physiol. Plant. 39 (12), 268–315. doi:10.1007/s11738-017-2565-8

CrossRef Full Text | Google Scholar

Argueso, C. T., Ferreira, F. J., Epple, P., To, J. P. C., Hutchison, C. E., Schaller, G. E., et al. (2012). Two-component elements mediate interactions between cytokinin and salicylic acid in plant immunity. PLoS Genet. 8 (1), e1002448. doi:10.1371/journal.pgen.1002448

PubMed Abstract | CrossRef Full Text | Google Scholar

Artimo, P., Jonnalagedda, M., Arnold, K., Baratin, D., Csardi, G., De Castro, E., et al. (2012). ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res. 40 (W1), 597–603. doi:10.1093/nar/gks400

PubMed Abstract | CrossRef Full Text | Google Scholar

Bailey, T. L., Boden, M., Buske, F. A., Frith, M., Grant, C. E., Clementi, L., et al. (2009). MEME Suite: tools for motif discovery and searching. Nucleic Acids Res. 37 (Suppl. 2), 202–208. doi:10.1093/nar/gkp335

PubMed Abstract | CrossRef Full Text | Google Scholar

Bezerra-Neto, J. P., Araújo, F. C., Ferreira-Neto, J. R. C., Silva, R. L. O., Borges, A. N. C., Matos, M. K. S., et al. (2019). “NBS-LRR genes-Plant health sentinels: structure, roles, evolution and biotechnological applications,” in Applied plant biotechnology for improving resistance to biotic stress (Elsevier Inc). doi:10.1016/B978-0-12-816030-5.00004-5

CrossRef Full Text | Google Scholar

Bittner-eddy, P. D., Crute, I. R., Holub, E. B., and Beynon, J. L. (2000). RPP13 is a simple locus in Arabidopsis thaliana for alleles that specify downy mildew resistance to different avirulence determinants in Peronospora parasitica. Plant J. 21 (2), 177–188. doi:10.1046/j.1365-313x.2000.00664.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Cesari, S., Thilliez, G., Ribot, C., Chalvon, V., Michel, C., Jauneau, A., et al. (2013). The rice resistance protein pair RGA4/RGA5 recognizes the Magnaporthe oryzae effectors AVR-Pia and AVR1-CO39 by direct binding. Plant Cell 25 (4), 1463–1481. doi:10.1105/tpc.112.107201

PubMed Abstract | CrossRef Full Text | Google Scholar

Chaudhary, A., Kolhe, S., and Kamal, R. (2016). An improved random forest classifier for multi-class classification. Inf. Process. Agric. 3 (4), 215–222. doi:10.1016/j.inpa.2016.08.002

CrossRef Full Text | Google Scholar

Chavarría-Perez, L. M., Giordani, W., Dias, K. O. G., Costa, Z. P., Ribeiro, C. A. M., Benedetti, A. R., et al. (2020). Improving yield and fruit quality traits in sweet passion fruit: evidence for genotype by environment interaction and selection of promising genotypes. PLoS ONE 15 (5), e0232818. doi:10.1371/journal.pone.0232818

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, C., Chen, H., Zhang, Y., Thomas, H. R., Frank, M. H., He, Y., et al. (2020). TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13 (8), 1194–1202. doi:10.1016/j.molp.2020.06.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Chou, K. C., and Shen, H. B. (2010). Plant-mPLoc: a top-down strategy to augment the power for predicting plant protein subcellular localization. PLoS ONE 5 (6), e11335. doi:10.1371/journal.pone.0011335

PubMed Abstract | CrossRef Full Text | Google Scholar

Corina Vlot, A., Dempsey, D. A., and Klessig, D. F. (2009). Salicylic acid, a multifaceted hormone to combat disease. Annu. Rev. Phytopathology 47, 177–206. doi:10.1146/annurev.phyto.050908.135202

PubMed Abstract | CrossRef Full Text | Google Scholar

Correia, A. de O., Alexandre, R. S., Pfenning, L. H., Cabanez, P. A., Ferreira, A., Ferreira, M. F. D. S., et al. (2022). Passiflora mucronata, a passion fruit wild species resistant to fusariosis and a potential rootstock for commercial varieties. Sci. Hortic. 302, 111174. doi:10.1016/j.scienta.2022.111174

CrossRef Full Text | Google Scholar

Dai, X., Zhuang, Z., and Zhao, P. X. (2018). PsRNATarget: a plant small RNA target analysis server (2017 release). Nucleic Acids Res. 46 (W1), W49–W54. doi:10.1093/nar/gky316

PubMed Abstract | CrossRef Full Text | Google Scholar

Delplace, F., Huard-Chauveau, C., Berthomé, R., and Roby, D. (2022). Network organization of the plant immune system: from pathogen perception to robust defense induction. Plant J. 109 (2), 447–470. doi:10.1111/tpj.15462

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Z., Su, H., Wang, W., Ye, L., Wei, H., Peng, Z., et al. (2021). The trRosetta server for fast and accurate protein structure prediction. Nat. Protoc. 16 (12), 5634–5651. doi:10.1038/s41596-021-00628-9

PubMed Abstract | CrossRef Full Text | Google Scholar

El Kasmi, F., Chung, E. H., Anderson, R. G., Li, J., Wan, L., Eitas, T. K., et al. (2017). Signaling from the plasma-membrane localized plant immune receptor RPM1 requires self-association of the full-length protein. Proc. Natl. Acad. Sci. U. S. A. 114 (35), E7385–E7394. doi:10.1073/pnas.1708288114

PubMed Abstract | CrossRef Full Text | Google Scholar

Fonseca, A. M. A., Geraldi, M. V., Junior, M. R. M., Silvestre, A. J. D., and Rocha, S. M. (2022). Purple passion fruit (Passiflora edulis f. edulis): a comprehensive review on the nutritional value, phytochemical profile and associated health effects. Food Res. Int. 160, 111665. doi:10.1016/j.foodres.2022.111665

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, X., Chen, F., Gao, F., Li, L., Liu, K., You, L., et al. (2020). CNSA: a data repository for archiving omics data. Database 2020, baaa055–6. doi:10.1093/database/baaa055

PubMed Abstract | CrossRef Full Text | Google Scholar

Gururani, M. A., Venkatesh, J., Upadhyaya, C. P., Nookaraju, A., Pandey, S. K., and Park, S. W. (2012). Plant disease resistance genes: current status and future directions. Physiological Mol. Plant Pathology 78, 51–65. doi:10.1016/j.pmpp.2012.01.002

CrossRef Full Text | Google Scholar

Hu, B., Jin, J., Guo, A. Y., Zhang, H., Luo, J., and Gao, G. (2015). GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics 31 (8), 1296–1297. doi:10.1093/bioinformatics/btu817

PubMed Abstract | CrossRef Full Text | Google Scholar

Hunter, S., Apweiler, R., Attwood, T. K., Bairoch, A., Bateman, A., Binns, D., et al. (2009). InterPro: the integrative protein signature database. Nucleic Acids Res. 37 (1), D211–D215. doi:10.1093/nar/gkn785

PubMed Abstract | CrossRef Full Text | Google Scholar

Ilag, L. L., Yadav, R. C., Huang, N., Ronald, P. C., and Ausubel, F. M. (2000). Isolation and characterization of disease resistance gene homologues from rice cultivar IR64. Gene 255 (2), 245–255. doi:10.1016/S0378-1119(00)00333-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Jacob, F., Vernaldi, S., and Maekawa, T. (2013). Evolution and conservation of plant NLR functions. Front. Immunol. 4 (SEP), 297–316. doi:10.3389/fimmu.2013.00297

PubMed Abstract | CrossRef Full Text | Google Scholar

Joy, P. P., and Sherin, C. G. (1983). Diseases of passion fruit (Passiflora edulis). Pineapple Res. Stn. Kerala Agric. Univ. 686, 1.

Google Scholar

Jupe, F., Pritchard, L., Etherington, G. J., MacKenzie, K., Cock, P. J. A., Wright, F., et al. (2012). Identification and localisation of the NB-LRR gene family within the potato genome. BMC Genomics 13 (1), 75–14. doi:10.1186/1471-2164-13-75

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaur, S., Samota, M. K., Choudhary, M., Choudhary, M., Pandey, A. K., Sharma, A., et al. (2022). How do plants defend themselves against pathogens-Biochemical mechanisms and genetic interventions. Physiology Mol. Biol. Plants 28 (2), 485–504. doi:10.1007/s12298-022-01146-y

CrossRef Full Text | Google Scholar

Kohler, A., Rinaldi, C., Duplessis, S., Baucher, M., Geelen, D., Duchaussoy, F., et al. (2008). Genome-wide identification of NBS resistance genes in Populus trichocarpa. Plant Mol. Biol. 66 (6), 619–636. doi:10.1007/s11103-008-9293-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Kong, W., Ding, L., Cheng, J., and Wang, B. (2018). Identification and expression analysis of genes with pathogen-inducible cis-regulatory elements in the promoter regions in Oryza sativa. Rice 11 (1), 52. doi:10.1186/s12284-018-0243-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, S., Stecher, G., and Tamura, K. (2016). MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33 (7), 1870–1874. doi:10.1093/molbev/msw054

PubMed Abstract | CrossRef Full Text | Google Scholar

Lescot, M., Déhais, P., Thijs, G., Marchal, K., Moreau, Y., Van De Peer, Y., et al. (2002). PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 30 (1), 325–327. doi:10.1093/nar/30.1.325

PubMed Abstract | CrossRef Full Text | Google Scholar

Letunic, I., and Bork, P. (2021). Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49 (W1), W293–W296. doi:10.1093/nar/gkab301

PubMed Abstract | CrossRef Full Text | Google Scholar

Liao, Y., Smyth, G. K., and Shi, W. (2014). FeatureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30 (7), 923–930. doi:10.1093/bioinformatics/btt656

PubMed Abstract | CrossRef Full Text | Google Scholar

Librado, P., and Rozas, J. (2009). DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25 (11), 1451–1452. doi:10.1093/bioinformatics/btp187

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., Qiao, L., Zhang, X., Li, X., Zhan, H., Guo, H., et al. (2017). Genome-wide identification and resistance expression analysis of the NBS gene family in Triticum urartu. Genes Genomics 39 (6), 611–621. doi:10.1007/s13258-017-0526-7

CrossRef Full Text | Google Scholar

Liu, Y., Li, D., Yang, N., Zhu, X., Han, K., Gu, R., et al. (2021). Genome-wide identification and analysis of cc-nbs-lrr family in response to downy mildew and black rot in Chinese cabbage. Int. J. Mol. Sci. 22 (8), 4266. doi:10.3390/ijms22084266

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, S., Wang, J., Chitsaz, F., Derbyshire, M. K., Geer, R. C., Gonzales, N. R., et al. (2020). CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res. 48 (D1), D265–D268. doi:10.1093/nar/gkz991

PubMed Abstract | CrossRef Full Text | Google Scholar

McDonnell, A. V., Jiang, T., Keating, A. E., and Berger, B. (2006). Paircoil2: improved prediction of coiled coils from sequence. Bioinformatics 22 (3), 356–358. doi:10.1093/bioinformatics/bti797

PubMed Abstract | CrossRef Full Text | Google Scholar

Meyers, B. C., Kozik, A., Griego, A., Kuang, H., and Michelmore, R. W. (2003). Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis. Plant Cell 15 (4), 809–834. doi:10.1105/tpc.009308

PubMed Abstract | CrossRef Full Text | Google Scholar

Michael, A., Ahlmann-eltze, C., Forbes, K., and Anders, S. (2023). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology 15, 550. doi:10.1186/s13059-014-0550-8

CrossRef Full Text | Google Scholar

Miroddi, M., Calapai, G., Navarra, M., Minciullo, P. L., and Gangemi, S. (2013). Passiflora incarnata L.: ethnopharmacology, clinical application, safety and evaluation of clinical trials. J. Ethnopharmacol. 150 (3), 791–804. doi:10.1016/j.jep.2013.09.047

PubMed Abstract | CrossRef Full Text | Google Scholar

Mistry, J., Chuguransky, S., Williams, L., Qureshi, M., Salazar, G. A., Sonnhammer, E. L. L., et al. (2021). Pfam: the protein families database in 2021. Nucleic Acids Res. 49 (D1), D412–D419. doi:10.1093/nar/gkaa913

PubMed Abstract | CrossRef Full Text | Google Scholar

Nepal, M. P., and Benson, B. V. (2015). CNL disease resistance genes in soybean and their evolutionary divergence. Evol. Bioinforma. 11, 49–63. doi:10.4137/EBO.S21782

PubMed Abstract | CrossRef Full Text | Google Scholar

Passiflora, P., Shi, M., Ali, M. M., He, Y., Ma, S., Rizwan, H. M., et al. (2021). Flavonoids accumulation in fruit peel and expression profiling of related genes in purple (Passiflora edulis f. Edulis) and Yellow (Passiflora edulis f. Flavicarpa) Passion Fruits. Plants 10 (11). doi:10.3390/plants10112240

CrossRef Full Text | Google Scholar

Paul, S., de la Fuente-Jiménez, J. L., Manriquez, C. G., and Sharma, A. (2020). Identification, characterization and expression analysis of passion fruit (Passiflora edulis) microRNAs. 3 Biotech. 10 (1), 1–9. doi:10.1007/s13205-019-2000-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Paul Shannon, A., Owen, O., and Nitin, S. (1971). Cytoscape: a software environment for integrated models. Genome Res. 13 (22), 426. doi:10.1101/gr.1239303.metabolite

PubMed Abstract | CrossRef Full Text | Google Scholar

Potter, S. C., Luciani, A., Eddy, S. R., Park, Y., Lopez, R., and Finn, R. D. (2018). HMMER web server: 2018 update. Nucleic Acids Res. 46 (W1), W200–W204. doi:10.1093/nar/gky448

PubMed Abstract | CrossRef Full Text | Google Scholar

Qi, D., de Young, B. J., and Innes, R. W. (2012). Structure-function analysis of the coiled-coil and leucine-rich repeat domains of the RPS5 disease resistance protein. Plant Physiol. 158 (4), 1819–1832. doi:10.1104/pp.112.194035

PubMed Abstract | CrossRef Full Text | Google Scholar

Qian, L. H., Wang, Y., Chen, M., Liu, J., Lu, R. S., Zou, X., et al. (2021). Genome-wide identification and evolutionary analysis of NBS-LRR genes from Secale cereale. Front. Genet. 12, 771814–771910. doi:10.3389/fgene.2021.771814

PubMed Abstract | CrossRef Full Text | Google Scholar

Ramkumar, M. K., Mulani, E., Jadon, V., Sureshkumar, V., Krishnan, S. G., Senthil Kumar, S., et al. (2022). Identification of major candidate genes for multiple abiotic stress tolerance at seedling stage by network analysis and their validation by expression profiling in rice (Oryza sativa L.). 3 Biotech. 12 (6), 127–214. doi:10.1007/s13205-022-03182-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Rizwan, H. M., Zhimin, L., Harsonowati, W., Waheed, A., Qiang, Y., Yousef, A. F., et al. (2021). Identification of fungal pathogens to control postharvest passion fruit (Passiflora edulis) decays and multi-omics comparative pathway analysis reveals purple is more resistant to pathogens than a yellow cultivar. J. Fungi 7 (10), 879–923. doi:10.3390/jof7100879

CrossRef Full Text | Google Scholar

Robert-Seilaniantz, A., Grant, M., and Jones, J. D. G. (2011). Hormone crosstalk in plant disease and defense: more than just JASMONATE-SALICYLATE antagonism. Annu. Rev. Phytopathology 49, 317–343. doi:10.1146/annurev-phyto-073009-114447

PubMed Abstract | CrossRef Full Text | Google Scholar

Rosli, R., Amiruddin, N., Ab Halim, M. A., Chan, P. L., Chan, K. L., Azizi, N., et al. (2018). Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm. PLoS ONE 13 (4), e0194792. doi:10.1371/journal.pone.0194792

PubMed Abstract | CrossRef Full Text | Google Scholar

Sadaqat, M., Umer, B., Attia, K. A., Abdelkhalik, A. F., Azeem, F., Javed, M. R., et al. (2023). Genome-wide identification and expression profiling of two-component system (TCS) genes in Brassica oleracea in response to shade stress. Front. Genet. 14, 1142544–1142617. doi:10.3389/fgene.2023.1142544

PubMed Abstract | CrossRef Full Text | Google Scholar

Saidi, A., Safaeizadeh, M., and Hajibarat, Z. (2024). Differential expression of the genes encoding immune system components in response to Pseudomonas syringae and Pseudomonas aeruginosa in Arabidopsis thaliana. 3 Biotech. 14, 11. doi:10.1007/s13205-023-03852-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Sekine, K. T., Kawakami, S., Hase, S., Kubota, M., Ichinose, Y., Shah, J., et al. (2008). High level expression of a virus resistance gene, RCY1, confers extreme resistance to Cucumber mosaic virus in Arabidopsis thaliana. Mol. Plant-Microbe Interact. 21 (11), 1398–1407. doi:10.1094/MPMI-21-11-1398

PubMed Abstract | CrossRef Full Text | Google Scholar

Shao, Z. Q., Xue, J. Y., Wang, Q., Wang, B., and Chen, J. Q. (2019). Revisiting the origin of plant NBS-LRR genes. Trends Plant Sci. 24 (1), 9–12. doi:10.1016/j.tplants.2018.10.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Shao, Z. Q., Xue, J. Y., Wu, P., Zhang, Y. M., Wu, Y., Hang, Y. Y., et al. (2016). Large-scale analyses of angiosperm nucleotide-binding site-leucine-rich repeat genes reveal three anciently diverged classes with distinct evolutionary patterns. Plant Physiol. 170 (4), 2095–2109. doi:10.1104/pp.15.01487

PubMed Abstract | CrossRef Full Text | Google Scholar

Szklarczyk, D., Gable, A. L., Nastou, K. C., Lyon, D., Kirsch, R., Pyysalo, S., et al. (2021). The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 49 (18), D605–D612. doi:10.1093/nar/gkaa1074

PubMed Abstract | CrossRef Full Text | Google Scholar

Takahashi, H., Miller, J., Nozaki, Y., Sukamto, M., Shah, J., Hase, S., et al. (2002). RCY1, an Arabidopsis thaliana RPP8/HRT family resistance gene, conferring resistance to cucumber mosaic virus requires salicylic acid, ethylene and a novel signal transduction mechanism. Plant J. 32 (5), 655–667. doi:10.1046/j.1365-313X.2002.01453.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Törönen, P., Medlar, A., and Holm, L. (2018). PANNZER2: a rapid functional annotation web server. Nucleic Acids Res. 46, W84. doi:10.1093/nar/gky350

PubMed Abstract | CrossRef Full Text | Google Scholar

ul Qamar, M. T., Sadaqat, M., Zhu, X.-T., Li, H., Huang, X., Fatima, K., et al. (2023). Comparative genomics profiling revealed multi-stress responsive roles of the CC-NBS-LRR genes in three mango cultivars. Front. Plant Sci. 14, 1285547 doi:10.3389/fpls.2023.1285547

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Yang, C., Wu, X., Wang, Y., Wang, B., Wu, X., et al. (2022). Genome-wide characterization of NBS-LRR family genes and expression analysis under powdery mildew stress in Lagenaria siceraria. Physiological Mol. Plant Pathology 118, 101798. doi:10.1016/j.pmpp.2022.101798

CrossRef Full Text | Google Scholar

Wei, H., Liu, J., Guo, Q., Pan, L., Chai, S., Cheng, Y., et al. (2020). Genomic organization and comparative phylogenic analysis of NBS-LRR resistance gene family in Solanum pimpinellifolium and Arabidopsis thaliana. Evol. Bioinforma. 16, 1176934320911055. doi:10.1177/1176934320911055

PubMed Abstract | CrossRef Full Text | Google Scholar

Wen, G. (2017). A simple process of RNA-sequence analyses by Hisat2, Htseq and DESeq2. ACM Int. Conf. Proceeding Ser. Part F1319, 11–15. doi:10.1145/3143344.3143354

CrossRef Full Text | Google Scholar

Wilgenbusch, J. C., and Swofford, D. (2003). Inferring evolutionary trees with PAUP. Curr. Protoc. Bioinforma. 00 (1), 28. doi:10.1002/0471250953.bi0604s00

PubMed Abstract | CrossRef Full Text | Google Scholar

Woldesemayat, A. A., Modise, D. M., Gemeildien, J., Ndimba, B. K., and Christoffels, A. (2018). Cross-species multiple environmental stress responses: an integrated approach to identify candidate genes for multiple stress tolerance in sorghum (Sorghum bicolor (L.) Moench) and related model species. PLoS ONE 13 (3), e0192678. doi:10.1371/journal.pone.0192678

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, X., Chen, Y., Li, B., Zhang, Z., Qin, G., Chen, T., et al. (2022). Molecular mechanisms underlying multi-level defense responses of horticultural crops to fungal pathogens. Horticulture Research 9. doi:10.1093/hr/uhac066

PubMed Abstract | CrossRef Full Text | Google Scholar

Yaghobi, M., and Heidari, P. (2023). Genome-Wide analysis of aquaporin gene family in Triticum turgidum and its expression profile in response to salt stress. Genes 14 (1), 202. doi:10.3390/genes14010202

PubMed Abstract | CrossRef Full Text | Google Scholar

Yin, T., Han, P., Xi, D., Yu, W., Zhu, L., Du, C., et al. (2023). Genome-wide identification, characterization, and expression profile of NBS-LRR gene family in sweet orange (Citrus sinensis). Gene 854, 147117. doi:10.1016/j.gene.2022.147117

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan, S., Chan, H. C. S., and Hu, Z. (2017). Using PyMOL as a platform for computational drug design. Wiley Interdiscip. Rev. Comput. Mol. Sci. 7 (2), 1–10. doi:10.1002/wcms.1298

CrossRef Full Text | Google Scholar

Zaman, F., Zhang, M., Wu, R., Zhang, Q., Luo, Z., and Yang, S. (2023). Recent research advances of small regulatory RNA in fruit crops. Horticulturae 9 (3), 294. doi:10.3390/horticulturae9030294

CrossRef Full Text | Google Scholar

Zameer, R., Fatima, K., Azeem, F., Algwaiz, H. I. M., Sadaqat, M., Rasheed, A., et al. (2022). Genome-Wide characterization of superoxide dismutase (SOD) genes in daucus carota: novel insights into structure, expression, and binding interaction with hydrogen peroxide (H2O2) under abiotic stress condition. Front. Plant Sci. 13, 870241–870315. doi:10.3389/fpls.2022.870241

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, R., Zhang, C., Yu, C., Dong, J., and Hu, J. (2022a). Integration of multi-omics technologies for crop improvement: status and prospects. Front. Bioinforma. 2, 1027457–1027459. doi:10.3389/fbinf.2022.1027457

CrossRef Full Text | Google Scholar

Zhang, R., Zheng, F., Wei, S., Zhang, S., Li, G., Cao, P., et al. (2019). Evolution of disease defense genes and their regulators in plants. Int. J. Mol. Sci. 20 (2), 335–425. doi:10.3390/ijms20020335

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, W., Yuan, Q., Wu, Y., Zhang, J., and Nie, J. (2022b). Genome-Wide identification and characterization of the CC-NBS-LRR gene family in cucumber (Cucumis sativus L.). Int. J. Mol. Sci. 23 (9), 5048–5121. doi:10.3390/ijms23095048

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Wang, F., Yang, N., Chen, N., Hu, Y., Peng, X., et al. (2023). Bioinformatics analysis and function prediction of NBS-LRR gene family in Broussonetia papyrifera. Biotechnol. Lett. 45 (1), 13–31. doi:10.1007/s10529-022-03318-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y. M., Chen, M., Sun, L., Wang, Y., Yin, J., Liu, J., et al. (2020). Genome-Wide identification and evolutionary analysis of NBS-LRR genes from Dioscorea rotundata. Front. Genet. 11, 484–511. doi:10.3389/fgene.2020.00484

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, T., Wang, Y., Chen, J. Q., Araki, H., Jing, Z., Jiang, K., et al. (2004). Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes. Mol. Genet. Genomics 271 (4), 402–415. doi:10.1007/s00438-004-0990-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Zia, K., Rao, M. J., Sadaqat, M., Azeem, F., Fatima, K., ul Qamar, M. T., et al. (2022). Pangenome-wide analysis of cyclic nucleotide-gated channel (CNGC) gene family in citrus Spp. Revealed their intraspecies diversity and potential roles in abiotic stress tolerance. Front. Genet. 13, 1034921. doi:10.3389/fgene.2022.1034921

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: passion fruit, CNL, pathogen resistance, gene ontology, expression profiling, machine learning

Citation: Zia K, Sadaqat M, Ding B, Fatima K, Albekairi NA, Alshammari A and Tahir ul Qamar M (2024) Comparative genomics and bioinformatics approaches revealed the role of CC-NBS-LRR genes under multiple stresses in passion fruit. Front. Genet. 15:1358134. doi: 10.3389/fgene.2024.1358134

Received: 19 December 2023; Accepted: 16 February 2024;
Published: 26 February 2024.

Edited by:

Sajid Shokat, International Atomic Energy Agency, Austria

Reviewed by:

Vikender Kaur, Indian Council of Agricultural Research (ICAR), India
Parviz Heidari, Shahrood University of Technology, Iran

Copyright © 2024 Zia, Sadaqat, Ding, Fatima, Albekairi, Alshammari and Tahir ul Qamar. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Muhammad Tahir ul Qamar, dGFoaXJ1bHFhbWFyQGdjdWYuZWR1LnBr

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.