- 1Plant Biotechnology Division, Unit of Central Coffee Research Institute, Coffee Board, Mysore, Karnataka, India
- 2Department of Horticulture, Faculty of Agriculture, Erzurum, Turkey
- 3Food Engineering Department, Faculty of Food Science and Technology, University of Agricultural Sciences and Veterinary Medicine, Cluj-Napoca, Romania
- 4Institute of Life Sciences, Faculty of Food Science and Technology, University of Agricultural Sciences and Veterinary Medicine, Cluj-Napoca, Romania
Coffee is a high value agricultural commodity grown in about 80 countries. Sustainable coffee cultivation is hampered by multiple biotic and abiotic stress conditions predominantly driven by climate change. The NAC proteins are plants specific transcription factors associated with various physiological functions in plants which include cell division, secondary wall formation, formation of shoot apical meristem, leaf senescence, flowering embryo and seed development. Besides, they are also involved in biotic and abiotic stress regulation. Due to their ubiquitous influence, studies on NAC transcription factors have gained momentum in different crop plant species. In the present study, NAC25 like transcription factor was isolated and characterized from two cultivated coffee species, Coffea arabica and Coffea canephora and five Indian wild coffee species for the first time. The full-length NAC25 gene varied from 2,456 bp in Coffea jenkinsii to 2,493 bp in C. arabica. In all the seven coffee species, sequencing of the NAC25 gene revealed 3 exons and 2 introns. The NAC25 gene is characterized by a highly conserved 377 bp NAM domain (N-terminus) and a highly variable C terminus region. The sequence analysis revealed an average of one SNP per every 40.92 bp in the coding region and 37.7 bp in the intronic region. Further, the non-synonymous SNPs are 8-11 fold higher compared to synonymous SNPs in the non-coding and coding region of the NAC25 gene, respectively. The expression of NAC25 gene was studied in six different tissue types in C. canephora and higher expression levels were observed in leaf and flower tissues. Further, the relative expression of NAC25 in comparison with the GAPDH gene revealed four folds and eight folds increase in expression levels in green fruit and ripen fruit, respectively. The evolutionary relationship revealed the independent evolution of the NAC25 gene in coffee.
Introduction
The genus Coffea (Family – Rubiaceae) consists of more than 125 species of which only two species Coffea arabica and Coffea canephora are commercially cultivated (Davis, 2011; Mishra and Slater, 2012). Most of the coffee species are diploid (2n = 2x = 22) and self-sterile except C. arabica, which is an allotetraploid (2n = 4x = 44) and self-fertile species originated by natural hybridization involving C. canephora and C. eugenioides (Lashermes et al., 1999; Pearl et al., 2004; Mishra et al., 2012). Most of the wild coffee species are native to the highlands of south-western Ethiopia to tropical African rainforest covering diverse ecological habitats of West Africa through Cameroon, Central African Republic, Democratic Republic of Congo, Uganda, Tanzania, Indian Ocean Islands, Madagascar, Comoros Islands, and the Mascarene Islands and Australasia (Davis et al., 2019; Mishra, 2019). In India, several indigenous diploid wild coffee species are discovered which include C. bengalensis, C. jenkinsii, C. wightiana, C. khasiana and C. travancorensis (Narasimhaswamy and Vishweshwara, 1963; Mishra et al., 2020; Jingade et al., 2021). Both C. jenkinsii and C. khasiana are distributed in Khasi and Jaintia Hills of the North-Eastern Himalayas and C. bengalensis is found growing wild in Cooch Behar and Lataguri forest of West Bengal and extended up to eastern Assam. Similarly, C. wightiana and C. travancorensis were wildly distributed in the forest of Travancore (Kerala) and Nilgiri hills of Tamil Nadu, India (Jingade et al., 2021). In fact, many wild crop species often offer a greater variety of traits which could form an important link between ecological adaption and the evolutionary process (Dulloo and Owadally, 1991). In this context, wild coffee species could serve as an important reservoir for the transfer and incorporate genes to breed new cultivars suitable for changed climatic conditions (Volk and Krishnan, 2020). Although plants are constantly challenged by a range of environmental stresses, they mitigate these effects by producing stress-inducing phytochemicals which involve a cascade of molecular and biochemical events. Extensive studies have underpinned that transcriptional factors (TFs) are involved in plant adaptation by acting as mediators by perceiving stress signals and directing downstream gene expression. Moreover, on an around, 7% of the coding part of the plant transcriptomes are contributed by the TFs (Singh et al., 2021). It is, therefore important to study the role of various TFs which will facilitate molecular breeding and genetic improvement of crop plants.
Among various TFs present in plants, NAC TFs constitute one of the largest groups of plant-specific transcription factors and play a very crucial role in many vital physiological and developmental processes of plants. It regulates the target genes by binding to NAC recognition sequence (NACRS) and thereby affecting different phenotypic characteristics such as lateral root development (Guo et al., 2005; Singh et al., 2021), formation and development of vegetative and floral organs (Mallory et al., 2004) and flower development (Zong et al., 2020). The NAC TFs binds to DNA or other protein kinases and regulate various physiological processes which involve secondary wall formation (Zhao et al., 2014; Hurtado et al., 2020), bud differentiation (Chen et al., 2021), embryo development (Liu et al., 2018), shoot branching (Ni et al., 2017) and fruit ripening (Martín-Pizarro et al., 2021). Among different types of NAC genes known so far, NAC25 gene play a crucial role in plants response to abiotic stress such as drought, heat, salinity and cold tolerance (Nakashima et al., 2007; Peng et al., 2009; Mao et al., 2012; Tweneboah and Oh, 2017; Han et al., 2020). It also regulates gibberellin-mediated endosperm expansion, and seed germination in plants (Sánchez-Montesino et al., 2019). In coffee, NAC TFs are known to be involved in bean development and sucrose metabolism (Dong et al., 2019).
The NAC super family protein has three distinguishable families of TFs which constitutes No Apical Meristem (NAM), Arabidopsis Transcription Activation Factors (ATAF) and the Cup-Shaped-Cotyledon families TFs (Olsen et al., 2005). The members of the NAC gene family in plants increased during evolution due to gene duplication events (Dong et al., 2019). Different NAC transcriptional factor (TF) families have been reported from different crops. About 117 different NAC TF families from Arabidopsis (Nuruzzaman et al., 2010), 151 from rice (Ooka et al., 2003), 163 from poplar (Hu et al., 2010), 79 from grape (Wang et al., 2013) and 63 from robusta coffee (Dong et al., 2019) have been identified so far. Although 63 NAC TFs are reported from C. canephora, the extent of molecular and sequence diversity of NAC TF across wild coffee species is unfortunately not available. Despite diploid coffee species constitute an important source for genetic variation to counter specific pests and diseases, alleviate abiotic stress, breed low caffeine coffee, and enhance organoleptic quality, there is scanty information available on the NAC gene in coffee species. The present study was therefore undertaken to explore the variability of the NAC gene among cultivated and Indian wild coffee species. Further, an attempt was also made to carry out expression analyses of the NAC25 gene in different tissues of coffee.
Materials and methods
Plant material
Two commercial cultivated coffee species, viz. C. arabica (S.795) and C. canephora (S.274) and five wild indigenous coffee species viz., C. travancorensis, C. bengalensis, C. wightiana, C. jenkinsii and C. khasiana belonging to different Indian geographic locations were selected in the present study. These coffee species are maintained ex situ at Plant tissue culture and Biotechnology Division, Coffee Research Institute, Mysore, Karnataka, India.
Deoxyribonucleic acid extraction
The young growing leaf tips of all the seven coffee species were collected and total genomic DNA was isolated using the modified CTAB protocol described by Mishra et al. (2020). The DNA quality was tested by separating it on 0.8% agarose gel stained with ethidium bromide (0.5 μg/ml) and quantified using Nanodrop (Eppendorf). The DNA samples were diluted to a working concentration of 10 ng/μl and stored at −20°C for future uses.
Ribonucleic acid extraction
The total RNA was extracted from the six different tissues viz., root, young leaf, immature flower bud, flower, green fruit and ripen fruit of C. canephora (S.274) using the potassium acetate method described by Huded et al. (2018). The total RNA was quantified using Nanodrop (Thermo Fisher) at 260/280 nm and was reverse transcribed into cDNA using a high-capacity RNA to cDNA conversion kit (Applied Biosystem, USA). Further, the first-strand cDNA synthesis was carried out in 20 μl reaction volume containing 10 μl of 2 × RT buffer, 1.0 μl of 2 × enzyme mixes, 4.0 μl of RNA sample and 5.0 μl of nuclease-free water. Reverse transcription was performed using a thermal cycler (BioRad) at 37°C for 60 min and 95°C for 5 min. and the cDNA was stored at −80°C for future use.
Primer designing and polymerase chain reaction amplification
The NAC25 like transcript gene sequence of Coffea eugenioides (XM_027296760.1; XM_027296758.1 and XM_027296751.1) available in the NCBI GenBank database1, was downloaded and primer pairs were designed using Primer3Plus program2. The designed primer pairs were checked byPrimer-BLAST tool in NCBI database3. The PCR amplification was carried out with genomic DNA samples of seven coffee species in a reaction volume of 15 μl using Bio-Rad Thermal cycler S1000. PCR reaction mixtures contained 3.0 μl of DNA (10 ng/μl) (cDNA samples as a template for characterization of transcript), 3.0 μl each of 3 μM primer (Forward primer – 5′-CGCAAATAGAGGCCTCAGCC-3′ and Reverse primer – 5′-CGCATGGCTCCCAAGATTCT-3), 1.5 μl of 2 mM dNTPs, 1.5 μl of 10X Taq buffer, 1.5 μl of 25 mM MgCl2 (Thermo Fisher Scientific, Waltham, USA) and 0.3 μl of 3 units/μl Taq DNA polymerase enzyme (GeNei, Bangalore, India). The reaction volume was made up to 15 μl using sterile distilled water. Standard PCR cycling parameters were used which includes an initial denaturation step of 5 min at 94°C, followed by 30 cycles of 94°C for 30 s, Primer annealing at 62°C for 1 min, primer extension at 72°C for 2 min and final extension of 10 min at 72°C. The amplified PCR products were mixed with 5 μl Bromophenol blue dye (99.5% deionized formamide, 10 mM EDTA pH 8, 0.05% Bromophenol blue, xylene-cyanol dye solution, 1 μl pure sterile water) and separated on 1.5% agarose gel (SeaKem, Rockland USA) containing 0.5 μg ethidium bromide/ml in 1x TBE (Tris-HCl, Boric acid, EDTA) buffer. After electrophoresis, the gels were visualized and documented using Gel Doc System (BioRad) with Multi Analyst software program.
Cloning and sequencing
The purified PCR products of NAC gene ∼2.4kb were ligated into pGEM-T easy cloning vector as per manufacturer’s instructions (pGEM-T clone kit, Promega) and transformed into Escherchia coli DH5α competent cells using the heat shock method. Successful transformants were selected using blue/white screening and colony PCR. The colonies tested positive for PCR were cultured and the recombinant plasmids were extracted from the culture using the alkaline lysis method as described by Sambrook and Russell (2001). The isolated plasmids were sequenced by the commercial sequencing facility at Eurofins Genomics India Pvt. Ltd., Bangalore, Karnataka, India.
Sequence analysis and construction of phylogenetic tree
The full-length NAC gene sequences of seven coffee species obtained by sequencing were BLASTn searched against the NCBI database (see text footnote 1). Based on the sequence homology studies, the intronic and exonic regions were identified from the full-length NAC gene sequence. Further, by joining the exonic regions full-length transcript sequences were derived and translated to protein sequence using the EMBOSS-Transeq tool of EMBL-EBI. Successively, the homologies of these protein sequences were analyzed using the BLASTp tool of the NCBI database. Further, to access the extent of variation in NAC25 gene sequences of different coffee species, multiple sequence alignment was performed using the MUSCLE program of MEGA X version 10.2.4 software with the default parameter (Kumar et al., 2018). The phylogenetic tree was constructed using full-length NAC25 gene sequences, by following the maximum likelihood (ML) method, based on the Tamura-Nei model (Tamura and Nei, 1993) with 1,000 bootstrap replications in MEGA X Version 10.2.4 Software developed by Penn State University, USA. Consequently, the protein sequences were analyzed and a phylogenetic tree was constructed using the maximum likelihood (ML) method based on the Jones-Taylor-Thornton (JTT) model (Jones et al., 1992) with 1,000 bootstrap replications in MEGA X Version 10.2.4.
Moreover, to annotate the NAC genes obtained from seven coffee species, a phylogenetic tree was constructed using these protein sequences along with a total of 73 protein sequences belonging to 37 different NAC genes of C. eugenioides retrieved from the NCBI database and 63 protein sequences of different NAC genes identified by Dong et al. (2019). Further, to ascertain the evolutionary relationship of NAC25 genes, a global phylogenetic analysis was carried out using the 147 NAC25 like protein sequences belonging to 72 different crop species retrieved from the NCBI protein database. Both the phylogenetic tree was constructed using the maximum likelihood (ML) method based on the Jones-Taylor-Thornton (JTT) model (Jones et al., 1992) with 1,000 bootstrap replications in MEGA X Version 10.2.4.
Sequence characterization
In the present study, the full length NAC gene was isolated from C. canephora (S.274) which is a diploid cultivated species with commercial importance and the same was considered as the reference sequence. The variations in the NAC gene sequence among six coffee species were analyzed and compared with C. canephora sequence. The sequence variations were studied using the multiple sequence alignment (MSA) tool of BioEdit software version 7.2.5 (Hall, 1999). Also, using multiple sequence alignment, Single Nucleotide Polymorphism (SNP), Insertion-Deletions (InDels) were manually identified. Further, the species-specific InDels sequences were identified for six coffee species by considering C. canephora (S.274) as a reference sequence. Further, the nucleotide base composition of the NAC gene from all the seven coffee species was determined using the BioEdit software version 7.2.5 (Hall, 1999). In addition, the NAC protein sequences from all the coffee species were analyzed and the amino acid content along with the theoretical isoelectric point (pI) was determined using the ExPASy-ProtParam online tool (Gasteiger et al., 2005). The hydrophilic coefficient was determined using ExPASy-ProtScale online tool (Kyte and Doolittle, 1982). Moreover, the conserved NAM domain was identified from full-length NAC protein sequences obtained from seven coffee species using conserved domain database (CDD) version3.18-55570 PSSMs of NCBI4 with default parameters. In full- length NAC25 gene (2.4kb) of all seven coffee species, exons and introns were marked using Gene Structure Display Server, version 2.0 (GSDS2.0) (Hu et al., 2015) and the NAM conserved domain was marked manually for easy distinction.
Expression analysis
To investigate the site of expression of the NAC gene, six different tissues viz., root, leaf, immature flower bud, flower, green fruit and ripen fruit of C. canephora (S.274) were selected for RT-PCR assay. The RT-PCR assay was performed using the Bio-Rad CFX-96 real-time system. The cDNA samples of six tissues were quantified using Nanodrop (Thermo Fisher) and diluted to uniform concentration for expression studies. Two replicates of each tissue in the PCR reaction mixtures were confined by the 1.2 μl of cDNA samples as a template, 1.5 μl each of 3 μM forward primer and reverse primer and 10.0 μl of SYBER green master mix (Thermo Fischer). The reaction volume was made up to 20 μl using sterile distilled water. The forward (5′-AAAGCTCGGCAAAGGAATGA-3′) and reverse primer (5′-CCCACASCTCACCATCAATGC-3′) pair amplifying 425bp from the exonic region of the highly variable C-terminal of NAC25 gene was designed and used for expression studies. The GAPDH specific gene (Forward primer –5′-ATTGTTGAGGGCCTTATGACCACT-3′ and Reverse primer – 5′-TGCCCGCAGCCTCCTCCTTA-3′) was used and the expression data was analyzed using 2–Δ Δ Ct method. The GAPDH gene was used as a reference gene.
Results and discussion
Polymerase chain reaction Amplification, cloning and sequencing
The full-length NAC gene of 2.4 kb was amplified from the genomic DNA samples of seven coffee species using NAC gene-specific primer (Figure 1). In each species, the 2.4 kb single amplicon was cloned using the pGEM-T easy cloning vector cloning kit and sequenced. The sequencing of the NAC gene amplicons yielded a minimum of 2.4 kb sequence in all the seven coffee species (Table 1). The highest NAC gene sequence of 2,493 bp was found in C. arabica (S.795) whereas the lowest sequence size of 2,456 bp was obtained in C. jenkinsii (Table 1).
Figure 1. Gel pictures showing the amplification of full length NAC gene from seven coffee species. Lane L:1kb plus GeneRuler from Thermo Fischer Scientific, Lane 1: C. canephora (S.274), Lane 2: C. arabica (S.795), Lane 3: C. travancorensis, Lane 4: C. bengalensis Lane 5: C. wightiana Lane 6: C. jenkinsii Lane 7: C. khasiana.
Table 1. Table depicting the sequence size and nucleotide base contents of the NAC25 gene isolated from Coffea species.
Sequence similarity and phylogeny based gene annotation
The sequence homology of the seven full-length NAC gene sequences was verified using BLASTn tool of the NCBI database. The BLAST analysis resulted in 99.06 to 100% sequence similarity with NAC25 like transcript sequence of C. eugenioides (XM_027296760.1, XM_027296758.1 and XM_027296751.1) with query coverage of 46 to 53%. The lower percentage of query coverage was due to the absence of full- length NAC gene (exon + intron) sequences in the NCBI database. However, the higher sequence similarity can be attributed to the presence of the NAC25 transcript sequences in the NCBI database.
Based on homology studies, the NAC25 transcript sequences of seven coffee species were derived from their respective full-length NAC25 sequences by delimiting the intronic regions. These transcript sequences were converted to protein sequences and the amino acid sequence homology was studied. These homology studies revealed the highest sequence homology of 84.80 to 91.16% with C. eugenioides NAC25 (XP_027152552.1) with 99% query coverage. Further, the phylogenetic trees were constructed using both full- length nucleotide and the protein sequences of seven coffee species (Figure 2). The clustering pattern of the seven coffee species in both the phylogenetic trees was largely similar. The seven coffee species are divided into two main clusters (Figures 2A,B). C. arabica (S.795) and C. canephora (S.274) were clustered together and closely placed along with C. khasiana, whereas, C. bengalensis and C. wightiana were closely placed with C. travancorensis in the phylogenetic tree constructed using full length and protein sequences. However, the C. jenkinsii, which was placed close to C. khasiana in the phylogenetic tree constructed using full-length NAC25 gene sequence, formed its own cluster in the tree constructed using protein sequences (Figure 2B). The change in the clustering pattern of C. jenkinsii could be attributed to the variations in the NAC25 gene sequence at the transcript level. Since, NAC TFs are associated with stress responses in plants, the evolutionary changes occurred in NAC gene sequence of C. jenkinsii could be attributed to the species adaptive mechanism in the course of evolution. The C. jenkinsii transcript sequence had 17 (1.22%) non-synonymous single nucleotide polymorphisms (nsSNP) coupled with 9bp deletions while C. khasiana transcript sequence had 57 (4.08%) nsSNPs with no deletions in comparison to the reference sequence.
Figure 2. Phylogenetic tree constructed usingFull length nucleotide sequence by Maximum Likelihood approach using Tamura-Nei model (A) and Protein sequence by Maximum Likelihood approach Jones-Taylor-Thornton (JTT) model (B).
The phylogenetic tree constructed, to annotate the seven NAC genes isolated in the present study, classified the 151 (including our seven NAC genes) different types of NAC genes into two major clusters (Figure 3). The first major cluster is composed of 33 NAC genes of different types of which 20 belong to C. eugenioides and 13 from C. canephora (Figure 3). The second major cluster was subdivided into two minor clusters and each minor cluster is further subdivided into two sub minor clusters. The NAC genes isolated from seven coffee species were closely grouped together with three NAC25 like genes of C. eugenioides in one of the sub minor clusters (Figure 3). Further, the NAC10 of C. canephora were also placed in the same sub minor cluster. However, the NAC10 gene of C. canephora had a short peptide length and only one intron unlike the NAC genes isolated in the present study which had an average peptide length of 463 amino acids and two introns. The NAC25 like genes of C. eugenioides had the peptide length of 432 amino acids and shared 99% sequence homology with the NAC gene isolated from different coffee species in the present study. Hence, all the seven NAC genes isolated from different coffee species were characterized as NAC25 genes further, these sequences were submitted to the BankIt portal of NCBI and accession numbers were obtained (Table 1).
Figure 3. Phylogenetic tree constructed by using 81 different classes of NAC protein sequences of C. eugenioides (orange squares), 63 different classes of NAC protein sequences of C. canephora (black circle) and the NAC protein sequences isolated in the present study (green square) using Maximum Likelihood approach JTT-model.
The evolutionary relationship of NAC25 genes isolated in the present study was further elucidated by constructing a separate phylogenetic tree (Figure 4) comprising 147 NAC25 protein sequences belonging to 72 different crop species (Supplementary Table 1). The phylogenetic tree revealed a close relationship between the NAC25 sequences, obtained from seven coffee species with the NAC25 sequence of C. eugenioides. Further, the NAC25 like genes isolated from different Coffea species were closely clustered with NAC25 likes genes of two Solanum species such as Nicotiana attenuata and Solanum tuberosum. The close clustering of Solanum NAC25 like genes with NAC25 like genes of Coffea is concurrent with an earlier study made by Lin et al. (2005) which suggested that compared to Arabidopsis, coffee is closely related to Solanaceae and tomato a member of Solanaceae had a nearly perfect gene-for-gene match with coffee and could have similar functions. Further, coffee and tomato have a similar genome size, chromosome karyotype (tomato, n = 12; coffee n = 11), and chromosome architecture and both belong to the Asterid I clade of dicot plant families Lin et al. (2005).
Figure 4. Phylogenetic tree constructed by using 147 NAC25 protein sequences belonging to 72 different crop species along with NAC25 protein sequences of C. eugenioides (orange squares) and NAC protein sequences isolated in the present study (green square) by Maximum Likelihood approach Jones-Taylor-Thornton (JTT) model.
Sequence characterization
The full-length sequence of the NAC25 gene, isolated from the seven coffee species ranged from 2,456 bp (C. jenkinsii) to 2,493 bp (C. arabica) with an average length of ∼2,477 bp (Table 1). Considerable variation in the NAC25 gene sequence length was reported from different crops plant species. In Arabidopsis thaliana, the length of the NAC25 gene (gene id: AT1G61110.0) was 1,619 bp in size (Alvarado and Terry, 2005), whereas, the length of the NAC25 gene isolated from Malus baccata was 1,122 bp (Han et al., 2020) and the gene isolated from Malus domestica (Li et al., 2020) was 1,335 bp. The average length of the NAC25 gene isolated from coffee species was higher when compared to the NAC25 genes isolated from different crop species. Further, characterization of the NAC25 gene revealed that the NAC25 genes isolated from the seven coffee species were composed of three exons spliced by two introns. Our finding is in concordance with earlier studies by Alvarado and Terry (2005) on the characterization of the NAC25 gene in A. thaliana and Dong et al. (2019) on the NAC25 gene in C. canephora. The average exon size of ∼1,392 bp and average intron size of ∼1,073 bp were obtained from the NAC25 gene of seven coffee species (Table 1).
In the present study, the peptide length of the NAC25 gene was ranged from 461 (C. travancorensis, C. bengalensis and C. jenkinsii) to 466 (C. wightiana) with an average of 463 amino acids and the hydrophilic coefficient was −0.490 (Table 2). Interestingly, the peptide length of NAC25 isolated from cultivars C. arabica and C. canephora was 464 amino acids. The peptide length of NAC25 gene isolated from C. canephora in the present study was higher than the length of CocNAC25 gene reported by Dong et al. (2019). This disparity in lengths could be attributed to the classification and nomenclature of NAC genes adopted by Dong et al. (2019). Dong et al. (2019) categorized the NAC genes based on their positions on chromosomes unlike, the NAC genes identified in the present study, which were classified based on sequence homology studies with the annotated NAC25 gene sequences available in the public database. However, the CocNAC36 and CocNAC43 genes identified by Dong et al. (2019) had peptide lengths of 400 and 455 amino acids, respectively. These lengths correspond to the peptide length of NAC25 genes isolated in the present study. Further, the variations in peptide length among the NAC25 genes isolated from different crops were observed. The peptide length of ANAC25 isolated from A. thaliana was 323 amino acids (Alvarado and Terry, 2005), the length of FtNAC25 isolated from Fagopyrumtataricum was 436 amino acids (Liu et al., 2019) and the length of MbNAC25 isolated from Malus baccata was 373 amino acids (Han et al., 2020). Further, the peptide sequence of NAC25 protein revealed the presence of amino acids such as aspartic acid, glycine, leucine, proline, serine and asparagine in larger numbers, unlike methionine, tryptophan and cysteine which were found in relatively lower numbers (Table 2). The earlier study by Han et al. (2020) also reported that the presence of leucine, lysine, proline, serine and threonine in higher numbers.
Also, in the present study, the NAC25 protein had the molecular weight ranged from 51.46 KDa (C. travancorensis) to 52.35KDa (C. wightiana) with an average of 51.78KDa and the theoretical isoelectric point (pI) ranged from 5.60 (C. khasiana) to 6.13 (C. wightiana) with an average of 5.86 (Table 2). In a previous study, the molecular weight of NAC25 genes isolated from different crops such as A. thaliana (Alvarado and Terry, 2005), C. canephora (Dong et al., 2019), F. tataricum (Liu et al., 2019), M. baccata (Han et al., 2020) and M. domestica (Li et al., 2020) were 36.31KDa, 34.27KDa, 48.62KDa, 41.49KDa and 43.189 KDa, respectively and their respective theoretical isoelectric point was 8.1, 8.9, 4.75, 8.7, and 7.02.
Conserved domain
The NAC gene sequences isolated from seven different coffee species had the typical characteristics of the NAC gene family. Proteins of the NAC family have the characteristic NAC domain of about 150 amino acids at the N-terminus region (NAM domain) and area highly conserved region (Mohanta et al., 2020). In contrast, the large C-terminal regions of NAC proteins are highly divergent and act as a transcriptional activator or repressor and therefore known as functional domain (Tran et al., 2004; Hu et al., 2006; Kim et al., 2007; Bian et al., 2021). The N-terminus of all the NAC25 genes had the presence of a highly conserved domain (NAM domain) and the variable C-terminus region carrying transcriptional regulatory sites. In the present study, the length of the conserved NAM domain was uniform in all the species with 377bp conserved sequence length (125 amino acids) (Figure 5). However, variation in the position of conserved domain across the species was observed (Figure 5A). The analysis of the conserved domain using the online Hidden Markov Model (HMM) program showed the best and maximum hit with the NAM domain of pfam02365 (PFAM ID: 02365). The analysis of the peptide sequence of NAM domain from all the seven coffee species revealed a higher content of glycine followed by lysine, proline and arginine. This NAM domain plays an important role in maintaining the structure and function of NAC proteins. Hence, the conserved nature of NAM domain was confirmed through multiple sequence alignment using NAC25 protein sequences of 10 coffee species and 12 different crop species (Figure 5B). The analysis demonstrated the evolutionarily conserved nature of the NAM domain across the species. Unlike the NAM domain, the highly variable region at the upstream region is known as the C-terminal region which is also said to be the function domain of NAC proteins.
Figure 5. (A) Figure depicting the exon, intron and conserved domain of full length NAC25 gene isolated from seven coffee species. (B) Comparison of NAM conserved domain of NAC25 gene with other crop species.
Sequence variation studies
Single nucleotide polymorphism/polymorphic sites
To study the sequence variations in NAC25 genes isolated from seven coffee species, the full-length NAC25 gene sequences of individual species were aligned with the reference sequence (C. canephora) and the SNPs were identified. The analysis revealed that C. travancorensis had the highest number of SNPs (142) unlike C. arabica which had the lowest number of SNPs (6) (Table 3 and Supplementary Table 1). Further, the total number of SNPs in exonic and intronic regions of the NAC25 gene isolated from each species was computed and presented in Table 3. It was observed that the frequency of detected SNPs ranged from 0.24% (C. arabica) to 5.75% (C. travancorensis). The sequencing of the NAC25 gene also revealed that an average of one SNP per every 40.92bp in the coding region and 37.7bp in the intronic region with an average of one SNP per 39bp of the entire gene length. Further, the total SNPs in exonic and intronic were classified into synonymous (sSNPs) and non-synonymous SNPs (nsSNPs) and it was observed that the nsSNPs are 8-11 fold higher compared to sSNPs in the non-coding and coding region of the NAC25 gene, respectively.
Table 3. Table depicting the sequence polymorphism and diversity of the NAC25 gene with reference to Coffea canephora (S.274).
In this study, the level of polymorphism created by the frequency of SNP is comparable to that reported by Kolkman et al. (2007) who reported an average of one SNP every 45.7 bp in sunflower but higher than what was previously reported for grapevine by Riahi et al. (2013) which recorded an average of one SNP per every 63 bp in the NAC gene sequence. The frequency of SNP in gene sequence varies considerably in different species. For example, in maize, there is one SNP for every 100 bp (Rafalski, 2002) whereas an average of 1 SNP per every 77 bp was observed for cotton (An et al., 2008) and one SNP per 273 bp was detected in soybean (Zhu et al., 2003). In an earlier study, Mishra et al. (2011) reported an average of one SNP for every 47 bp in the EST sequence of various coffee species and the present findings are comparable with the earlier results. In the present study, the frequency of SNP in the non-coding region is slightly higher than the coding region which is in agreement with the earlier studies showing that the mutations carried out with a higher frequency in non-coding regions than in coding regions (Kolkman et al., 2007; An et al., 2008). The presence of an eightfold higher frequency of nsSNPs over sSNPs in the NAC25 gene resulted in the change of the encoded amino acid in various coffee species, although no drastic change in the amino acid coding was recorded between two cultivated species. This could be important because SNPs play a significant role in the evolution of candidate genes coding for functional traits in plants. Indeed, most of the Indian coffee species have low/trace caffeine in their fruits compared to the cultivated varieties and therefore further work is needed to establish whether NAC25 gene has any role in controlling the caffeine metabolism.
Insertions and deletions
Insertions and deletions (InDels) are the results of one or more mutation processes, including DNA impairing (Levinson and Gutman, 1987), transposition (Bennett, 2004) crossover (Wang et al., 2018) and/or slippage (Moghaddam et al., 2014). Considering the importance, the InDels in the NAC25 gene sequence of each species were analyzed by aligning the individual gene sequence with the NAC25 sequence of C. canephora (S.274). The comparative analysis revealed that, C. khasiana had the lowest number of InDels and C. wightiana had the highest number of InDels (Table 3). The highest number of insertions, as well as the highest number of deletions, was observed in C. wightiana (51 insertions and 29 deletions) and no insertions were found in C. jenkinsii. The NAC25 gene from C. arabica had the lowest number of deletions. Even though InDels affect the sequence diversity, they are considered to be relatively rare events when compared to point mutations (Pascarella and Argos, 1992; Kondrashov, 2003). Unlike substitutions, InDels are less likely to be selectively neutral under constant selective pressure and are frequently deleterious (Panchenko et al., 2005). Further, the InDels that are accumulated during evolution may not be deleterious to a species or a group of species but, they can change protein structures and function (Jiang and Blouin, 2007; Wolf et al., 2007; Cai et al., 2008) leading to adaptations to new environments (Grishin, 2001). Considering the diverse role of NAC TFs in species adaption and stress resistance, it will be interesting to explore whether the structural variability (InDels and SNPs) of NAC25 gene obtained in different coffee species could be attributed to the functional significance.
Identification of species specific insertion-deletion
Sequence diversity was observed in the NAC25 gene isolated from seven coffee species. Hence, an attempt was made to identify species-7specific InDels in the NAC25 gene sequence by considering the NAC25 sequence of C. canephora diploid cultivated species as a reference. The comparative analysis fetched the sequence-specific InDels in all the species (Table 4). The highest number of species-specific insertions was found in the intronic region of all the species except C. jenkinsii (Table 4). None of the species had the presence of specific insertions in the exonic region except C. wightiana which had the insertion of 18 bp in the exonic region (Table 4). At the intronic region of the NAC25 gene, the lengths of species-specific insertion ranged from 24 bp in C. travancorensis, C. bengalensis and C. wightiana to 28 bp in C. arabica (Table 4). Similarly, at the exonic region of the NAC25 gene, the species-specific deletions of 9bp were observed in all the species except C. arabica and C. khasiana. At the intronic region of the NAC25 gene, C. travancorensis, C. bengalensis and C. wightiana had the specific deletion of 11bp (Table 4).
Table 4. Table depicting species specific Insertions and Deletions (InDels) causing frameshift (F) and non-frameshift (NF) mutation at the NAC25 gene sequence of six coffee species with reference to NAC25 gene of Coffea canephora (S.274).
Expression studies
A preliminary study was carried out to understand the relative level of expression of the NAC25 gene in C. canephora (S.274) using six different tissues such as root, leaf, immature flower bud, flower, green fruit and ripen fruit was studied using GAPDH as a reference gene. All the six tissues showed the expression of the NAC25 gene however, the expression levels are varied (Figure 6). Among the six tissues studied, higher expression levels were observed in leaf and flower tissues. The lowest expression was observed in root tissue followed by immature bud. Further, the relative expression of NAC25 in comparison with the GAPDH gene revealed four folds and eight folds increase in expression levels of the NAC25 gene in green fruit and ripen fruit, respectively (Figure 6). The higher relative expression levels of NAC25 in fruit tissue obtained in the present study is in concordance with the results obtained by Dong et al. (2019). Dong et al. (2019) carried out the expression analysis of 63 different NAC genes in four different stages of fruits such as small green, large green, yellow and red fruit of C. canephora and observed that 54 out of 63 different types of NAC genes showed different levels of expression during different stages of fruit development. Sánchez-Montesino et al. (2019) also obtained higher expression of NAC gene during endosperm development. Further, Han et al. (2020) observed higher level of NAC25 expression in young leaves and stem of Malus baccata and opined that the higher level of expression of NAC25 in young levees could be associated with abiotic stress tolerance. In Solanaceae family, NAC TFs confer resistance to fungal infection, drought and salt stress (Singh et al., 2013; Zhu et al., 2014). Since the NAC25 transcription factor isolated from coffee showed a close relationship with the NAC TF of members of the Solanaceae family, it could be possible that they could have similar functions.
Figure 6. Figure depicting the relative expression of NAC25 gene in different tissues of coffee (A) Root; (B) Leaf; (C) Immature Bud; (D) Flower; (E) Green fruit and (F) Ripen fruit.
Conclusion
In most coffee-growing countries, sustainable coffee cultivation is under considerable threat due to the increased prevalence of biotic and abiotic stress escalated by global climate change. The future coffee varieties should be suitably bred to respond appropriately to biotic and abiotic stresses which not only ensure their chances of survival but also affect the yields positively. In this context, it is imperative to identify and functionally characterize the NAC TFs and their promoters in coffee. At present, there is a large paucity of data on coffee transcription factors. Considering the importance of NAC TFs in plant growth, development and defense, further research on NAC TFs in coffee become urgent and indispensable.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary material.
Author contributions
MM designed the experiment, collected the material, participated in full length gene isolation, project administration and reviewed the manuscript. AH and PJ carried out the experiments and participated in manuscript preparation. SE, GI, RM, and DV participated in in silico analysis, review, and manuscript editing. All authors read and approved the published version of manuscript.
Acknowledgments
The authors thank Coffee Board, Govt of India for facilitating the Biotechnology research by providing funding under plan program.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.1009733/full#supplementary-material
Footnotes
- ^ https://www.ncbi.nlm.nih.gov/
- ^ http://www.bioinformatics.nl/cgi-bin/primer3plus/primer3plus.cgi
- ^ https://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?INPUT_SEQUENCE=%20EU563945.2&LINK_LOC=nuccore
- ^ https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi
References
Alvarado, V. Y., and Terry, L. T. (2005). “Abstract: A tapetal specific NAC transcription factor involved in pollen development,” in Proceedings of the 16TH International Conference On Arabidopsis Research, (Wisconsin).
An, C., Saha, S., Jenkins, J. N., Ma, D. P., Scheffler, B. E., Kohel, R. J., et al. (2008). Cotton (Gossypium spp.) R2R3-MYB transcription factors SNP identification, phylogenomic characterization, chromosome localization, and linkage mapping. Theor. Appl. Genet. 116, 1015–1026. doi: 10.1007/s00122-008-0732-4
Bennett, P. M. (2004). Genome plasticity: Insertion sequence elements, transposons and integrons, and DNA rearrangement. Methods Mol. Biol. 266:71–113. 76, 071. doi: 10.1385/1-59259-
Bian, Z., Gao, H., and Wang, C. (2021). NAC Transcription factors as positive or negative regulators during ongoing battle between pathogens and our food crops. Int. J. Mol. Sci. 22:81. doi: 10.3390/ijms22010081
Cai, Y., Chin, H. F., Lazarova, D., Menon, S., Fu, C., Cai, H., et al. (2008). The structural basis for activation of the Rab Ypt1p by the TRAPP membrane-tethering complexes. Cell 133, 1202–1213. doi: 10.1016/j.cell.2008.04.049
Chen, Q., Jing, D., Wang, S., Xu, F., Bao, C., Luo, M., et al. (2021). The Putative Role of the NAC Transcription Factor EjNACL47 in Cell Enlargement of Loquat (Eriobotrya japonica Lindl.). Horticulturae 7:323. doi: 10.3390/horticulturae7090323
Davis, A. P. (2011). Psilanthus mannii, the type species of Psilanthus, transferred to Coffea. Nord. J. Bot. 29, 471–472.
Davis, A. P., Chadburn, H., Moat, J., O’Sullivan, R., Hargreaves, S., and Lughadha, E. N. (2019). High extinction risk for wild coffee species and implications for coffee sector sustainability. Sci. Adv. 5:eaav3473. doi: 10.1126/sciadv.aav3473
Dong, X., Jiang, Y., Yang, Y., Xiao, Z., Bai, X., Gao, J., et al. (2019). Identification and expression analysis of the NAC gene family in Coffea canephora. Agronomy 9:670. doi: 10.3390/agronomy9110670
Dulloo, M. E., and Owadally, A. W. (1991). “Conservation and utilization of wild coffee,” in Crop genetic resources of Africa Volume 1, eds F. Attere, H. Zedan, N. Q. Ng, and P. Perrino (Nigeria: IBPGR, UNEP, IITA & CNR), 231–238.
Gasteiger, E., Hoogland, C., Gattiker, A., Duvaud, S., Wilkins, M. R., Appel, R. D., et al. (2005). “Protein Identification and analysis tools on the ExPASy Server,” in The proteomics protocols handbook. springer protocols handbooks, ed. J. M. Walker (Totowa, NJ: Humana Press). doi: 10.1385/1-59259-890-0:571
Grishin, N. V. (2001). Fold change in evolution of protein structures. J. Struct. Biol. 134, 167–185. doi: 10.1006/jsbi.2001.4335
Guo, H. S., Xie, Q., Fei, J. F., and Chua, N. H. (2005). Micro RNA directs mRNA cleavage of the transcription factor NAC1 to down regulate auxin signals for Arabidopsis lateral root development. Plant Cell 17, 1376–1386. doi: 10.1105/tpc.105.030841
Hall, T. A. (1999). BioEdit: A user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucleic Acids Symp. Series 41, 95–98.
Han, D., Du, M., Zhou, Z., Wang, S., Li, T., Han, J., et al. (2020). Overexpression of a Malus baccata NAC Transcription Factor Gene MbNAC25 Increases cold and salinity tolerance in Arabidopsis. Int. J. Mol. Sci. 21:1198.
Hu, B., Jin, J., Guo, A. Y., Zhang, H., Luo, J., and Gao, G. (2015). GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 31, 1296–1297. doi: 10.1093/bioinformatics/btu817
Hu, H., Dai, M., Yao, J., Xiao, B., Li, X., Zhang, Q., et al. (2006). Overexpressing a NAM, ATAF, and CUC (NAC) transcription factor enhances drought resistance and salt tolerance in rice. Proc. Natl. Acad. Sci. U.S.A. 103, 12987–12992. doi: 10.1073/pnas.0604882103
Hu, R., Qi, G., Kong, Y., Kong, D., Gao, Q., and Zhou, G. (2010). Comprehensive analysis of NAC domain transcription factor gene family in Populus trichocarpa. BMC Plant Biol. 10:145. doi: 10.1186/1471-2229-10-145
Huded, A. K. C., Jingade, P., and Mishra, M. K. (2018). A rapid and efficient SDS-based RNA isolation protocol from different tissues of coffee. 3 Biotech 8:183. doi: 10.1007/s13205-018-1209-z
Hurtado, F. M. M., Pinto, M. S., Oliveira, P. N., Riaño-Pachón, D. M., Inocente, L. B., and Carrer, H. (2020). Analysis of NAC domain transcription factor genes of Tectona grandis L. f. involved in secondary cell wall deposition. Genes (Basel) 11:20. doi: 10.3390/genes11010020
Jiang, H., and Blouin, C. (2007). Insertions and the emergence of novel protein structure: A structure-based phylogenetic study of insertions. BMC Bioinform. 8:444. doi: 10.1186/1471-2105-8-444
Jingade, P., Huded, A. K. C., and Mishra, M. K. (2021). First report on genome size and ploidy determination of five indigenous coffee species using flow cytometry and stomatal analysis. Braz. J. Bot. 44, 381–389. doi: 10.1007/s40415-021-00714-y
Jones, D. T., Taylor, W. R., and Thornton, J. M. (1992). The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci. 8, 275–282. doi: 10.1093/bioinformatics/8.3.275
Kim, S. G., Kim, S. Y., and Park, C. M. (2007). A membrane-associated NAC transcription factor regulates salt-responsive flowering via FLOWERING LOCUS T in Arabidopsis. Planta 226, 647–654. doi: 10.1007/s00425-007-0513-3
Kolkman, J. M., Berry, S. T., Leon, A. J., Slabaugh, M. B., Tang, S., Gao, W., et al. (2007). Single nucleotide polymorphisms and linkage disequilibrium in sunflower. Genetics 177, 457–468. doi: 10.1534/genetics.107.074054
Kondrashov, A. S. (2003). Direct estimates of human per nucleotide mutation rates at 20 loci causing Mendelian diseases. Hum. Mutat. 21, 12–27. doi: 10.1002/humu.10147
Kumar, S., Stecher, G., Li, M., Knyaz, C., and Tamura, K. (2018). MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. doi: 10.1093/molbev/msy096
Kyte, J., and Doolittle, R. F. (1982). A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105–132. doi: 10.1016/0022-2836(82)90515-0
Lashermes, P., Combes, M. C., Robert, J., Trouslot, P., D’Hont, A., Anthony, F., et al. (1999). Molecular characterisation and origin of the Coffea arabica L. genome. Mol. Gen. Genet. 261, 259–266. doi: 10.1007/s004380050965
Levinson, G., and Gutman, G. A. (1987). Slipped-strand mispairing: A major mechanism for DNA sequence evolution. Mol. Biol. Evol. 4, 203–221. doi: 10.1093/oxfordjournals.molbev.a040442
Li, H., Ran, K., Dong, Q., Zhao, Q., and Shi, S. (2020). Cloning, sequencing, and expression analysis of 32 NAC transcription factors (MdNAC) in apple. PeerJ 8:e8249. doi: 10.7717/peerj.8249
Lin, C., Mueller, L. A., McCarthy, J., Crouzillat, D., Pétiard, V., and Tanksley, S. D. (2005). Coffee and tomato share common gene repertoires as revealed by deep sequencing of seed and cherry transcripts. Theor. Appl. Genet. 112, 114–130. doi: 10.1007/s00122-005-0112-2
Liu, M., Ma, Z., Sun, W., Huang, L., Wu, Q., Tang, Z., et al. (2019). Genome-wide analysis of the NAC transcription factor family in Tartary buckwheat (Fagopyrum tataricum). BMC Genom. 20:113. doi: 10.1186/s12864-019-5500-0
Liu, X., Wang, T., Bartholomew, E., Black, K., Dong, M., Zhang, Y., et al. (2018). Comprehensive analysis of NAC transcription factors and their expression during fruit spine development in cucumber (Cucumis sativus L.). Hortic. Res. 5:31. doi: 10.1038/s41438-018-0036-z
Mallory, A. C., Dugas, D. V., Bartel, D. P., and Bartel, B. (2004). MicroRNA regulation of NAC-domain targets is required for proper formation and separation of adjacent embryonic, vegetative, and floral organs. Curr. Biol. 14, 1035–1046. doi: 10.1016/j.cub.2004.06.022
Mao, X., Zhang, H., Qian, X., Li, A., Zhao, G., and Jing, R. (2012). TaNAC2, a NAC-type wheat transcription factor conferring enhanced multiple abiotic stress tolerances in Arabidopsis. J. Exp. Bot. 63, 2933–2946. doi: 10.1093/jxb/err462
Martín-Pizarro, C., Vallarino, J. G., Osorio, S., Meco, V., Urrutia, M., Pillet, J., et al. (2021). The NAC transcription factor FaRIF controls fruit ripening in strawberry. Plant Cell 33, 1574–1593. doi: 10.1093/plcell/koab070
Mishra, M. K. (2019). “Genetic resources and breeding of coffee (Coffea spp.),” in Advances in plant breeding strategies: Nut and beverage crops, eds J. M. Al-Khayri, M. Jain, and D. V. Johnson (Cham: Springer). doi: 10.1007/978-3-030-23112-5_12
Mishra, M. K., and Slater, A. (2012). Recent advances in the genetic transformation of coffee. Biotechnol. Res. Int. 2012:580857. doi: 10.1155/2012/580857
Mishra, M. K., Huded, A. K. C., and Jingade, P. (2020). Assessment of the suitability of molecular SCoT markers for genetic analysis of coffee species. Botanica 26, 184–196. doi: 10.2478/botlit-2020-0019
Mishra, M. K., Sandhyarani, N., Suresh, N., Satheesh, K. S., Soumya, P. R., Yashodha, M. H., et al. (2012). Genetic Diversity among Indian coffee cultivars determined via molecular markers. J. Crop Improv. 26, 727–750. doi: 10.1080/15427528.2012.696085
Mishra, M. K., Tornincasa, P., De Nardi, B., Asquini, E., Dreos, R., Del Terra, L., et al. (2011). Genome organization in coffee as revealed by EST PCR RFLP, SNPs and SSR analysis. J. Crop Sci. Biotechnol. 14:25. doi: 10.1007/s12892-010-0035-6
Moghaddam, S. M., Song, Q., Mamidi, S., Schmutz, J., Lee, R., Cregan, P., et al. (2014). Developing market class specific InDel markers from next generation sequence data in Phaseolus vulgaris L. Front Plant Sci. 5:185. doi: 10.3389/fpls.2014.00185
Mohanta, T. K., Yadav, D., Khan, A., Hashem, A., Tabassum, B., Khan, A. L., et al. (2020). Genomics, molecular and evolutionary perspective of NAC transcription factors. PLoS One 15:e0231425. doi: 10.1371/journal.pone.0231425
Nakashima, K., Tran, L. S., Nguyen, V. D., Fujita, M., Maruyama, K., Todaka, D., et al. (2007). Functional analysis of a NAC-type transcription factor OsNAC6 involved in abiotic and biotic stress-responsive gene expression in rice. Plant J. 51, 617–630. doi: 10.1111/j.1365-313X.2007.03168.x
Narasimhaswamy, R. L., and Vishweshwara, S. (1963). A note on the occurrence of three species of Coffea indigenous to India. Turrialba 5, 65–71.
Ni, J., Zhao, M. L., Chen, M. S., Pan, B. Z., Tao, Y. B., and Xu, Z. F. (2017). Comparative transcriptome analysis of axillary buds in response to the shoot branching regulators gibberellin A3 and 6-benzyladenine in Jatropha curcas. Sci. Rep. 7:11417. doi: 10.1038/s41598-017-11588-0
Nuruzzaman, M., Manimekalai, R., Sharoni, A. M., Satoh, K., Kondoh, H., Ooka, H., et al. (2010). Genome-wide analysis of NAC transcription factor family in rice. Gene 465, 30–44. doi: 10.1016/j.gene.2010.06.008
Olsen, A. N., Ernst, H. A., Leggio, L. L., and Skriver, K. (2005). NAC transcription factors: Structurally distinct, functionally diverse. Trends Plant Sci. 10, 79–87. doi: 10.1016/j.tplants.2004.12.010
Ooka, H., Satoh, K., Doi, K., Nagata, T., Otomo, Y., Murakami, K., et al. (2003). Comprehensive analysis of NAC family genes in Oryza sativa and Arabidopsis thaliana. DNA Res. 10, 239–247. doi: 10.1093/dnares/10.6.239
Panchenko, A. R., Wolf, Y. I., Panchenko, L. A., and Madej, T. (2005). Evolutionary plasticity of protein families: Coupling between sequence and structure variation. Proteins 61, 535–544. doi: 10.1002/prot.20644
Pascarella, S., and Argos, P. (1992). A data bank merging related protein structures and sequences. Protein Eng. 5, 121–137. doi: 10.1093/protein/5.2.121
Pearl, H. M., Nagai, C., Moore, P. H., Steiger, D. L., Osgood, R. V., and Ming, R. (2004). Construction of a genetic map for arabica coffee. Theor. Appl. Genet. 108, 829–835. doi: 10.1007/s00122-003-1498-3
Peng, H., Cheng, H. Y., Chen, C., Yu, X. W., Yang, J. N., Gao, W. R., et al. (2009). A NAC transcription factor gene of chickpea (Cicerarietinum), CarNAC3, is involved in drought stress responses and various developmental processes. J. Plant Physiol. 166, 1934–1945. doi: 10.1016/j.jplph.2009.05.013
Rafalski, A. (2002). Applications of single nucleotide polymorphisms in crop genetics. Curr. Opin. Plant Biol. 5, 94–100. doi: 10.1016/s1369-5266(02)00240-6
Riahi, L., Zoghlami, N., Dereeper, A., Mliki, A., and This, P. (2013). Single nucleotide polymorphism and haplotype diversity of the gene NAC4 in grapevine. Ind. Crops Prod. 43, 718–724. doi: 10.1016/j.indcrop.2012.08.021
Sambrook, J., and Russell, D. W. (2001). Molecular cloning: A laboratory manual, 3rd Edn, Vol. 1. New York, NY: Cold Spring Harbor Laboratory Press.
Sánchez-Montesino, R., Bouza-Morcillo, L., Marquez, J., Ghita, M., Duran-Nebreda, S., Gómez, L., et al. (2019). A regulatory module controlling ga-mediated endosperm cell expansion is critical for seed germination in Arabidopsis. Mol. Plant 12, 71–85. doi: 10.1016/j.molp.2018.10.009
Singh, A. K., Sharma, V., Pal, A. K., Acharya, V., and Ahuja, P. S. (2013). Genome-wide organization and expression profiling of the NAC transcription factor family in potato (Solanum tuberosum L.). DNA Res. 20, 403–423.
Singh, S., Koyama, H., Bhati, K. K., and Alok, A. (2021). Thebiotechnological importance of the plant-specific NAC transcription factor family in crop improvement. J. Plant Res. 134, 475–495. doi: 10.1007/s10265-021-01270-y
Tamura, K., and Nei, M. (1993). Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol. Biol. Evol. 10, 512–526. doi: 10.1093/oxfordjournals.molbev.a040023
Tran, L. S., Nakashima, K., Sakuma, Y., Simpson, S. D., Fujita, Y., Maruyama, K., et al. (2004). Isolation and functional analysis of Arabidopsis stress-inducible NAC transcription factors that bind to a drought-responsive cis-element in the early responsive to dehydration stress 1 promoter. Plant Cell 16, 2481–2498. doi: 10.1105/tpc.104.022699
Tweneboah, S., and Oh, S. K. (2017). Biological roles of NAC transcription factors in the regulation of biotic and abiotic stress responses in Solanaceous crops. J. Plant Biotechnol. 44, 1–11. doi: 10.5010/JPB.2017.44.1.001
Volk, G. M., and Krishnan, S. (2020). “Case study: Coffee wild species and cultivars,” in Crop wild relatives in genebanks, eds G. M. Volk and P. Byrne (Fort Collins, CO: Colorado State University).
Wang, J., Kong, L., Yu, K., Zhang, F., Shi, X., Wang, Y., et al. (2018). Development and validation of InDel markers for identification of QTL underlying flowering time in soybean. Crop J. 6, 126–135. doi: 10.1016/j.cj.2017.08.001
Wang, N., Zheng, Y., Xin, H., Fang, L., and Li, S. (2013). Comprehensive analysis of NAC domain transcription factor gene family in Vitis vinifera. Plant Cell Rep. 32, 61–75. doi: 10.1007/s00299-012-1340-y
Wolf, Y., Madej, T., Babenko, V., Shoemaker, B., and Panchenko, A. R. (2007). Long-term trends in evolution of indels in protein sequences. BMC Evol. Biol. 7:19. doi: 10.1186/1471-2148-7-19
Zhao, Y., Sun, J., Xu, P., Zhang, R., and Li, L. (2014). Intron-mediated alternative splicing of WOOD-ASSOCIATED NAC TRANSCRIPTION FACTOR1B regulates cell wall thickening during fiber development in Populus species. Plant Physiol. 164, 765–776. doi: 10.1104/pp.113.231134
Zhu, M., Chen, G., Zhang, J., Zhang, Y., Xie, Q., Zhao, Z., et al. (2014). The abiotic stress-responsive NAC-type transcription factor SlNAC4 regulates salt and drought tolerance and stress-related genes in tomato (Solanum lycopersicum). Plant Cell Rep. 33, 1851–1863. doi: 10.1007/s00299-014-1662-z
Zhu, Y. L., Song, Q. J., Hyten, D. L., Van Tassell, C. P., Matukumalli, L. K., Grimm, D. R., et al. (2003). Single-nucleotide polymorphisms in soybean. Genetics. 163, 1123–1134. doi: 10.1093/genetics/163.3.1123
Keywords: Coffea, NAC25, conserved domain (NAM), sequence diversity, evolutionary relationship, expression analysis
Citation: Huded AKC, Jingade P, Mishra MK, Ercisli S, Ilhan G, Marc RA and Vodnar D (2022) Comparative genomic analysis and phylogeny of NAC25 gene from cultivated and wild Coffea species. Front. Plant Sci. 13:1009733. doi: 10.3389/fpls.2022.1009733
Received: 02 August 2022; Accepted: 30 August 2022;
Published: 16 September 2022.
Edited by:
Mostafa Farajpour, Mazandaran Agricultural and Natural Resources Research and Education Center, IranReviewed by:
Mehdi Younessi, University of Tabriz, IranZeeshan Zafar, National University of Sciences and Technology (NUST), Pakistan
Copyright © 2022 Huded, Jingade, Mishra, Ercisli, Ilhan, Marc and Vodnar. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Manoj Kumar Mishra, bWFub2ptaXNocmEubUBnbWFpbC5jb20=; Romina Alina Marc, cm9taW5hLnZsYWljQHVzYW12Y2x1ai5ybw==; Dan Vodnar, ZGFuLnZvZG5hckB1c2FtdmNsdWoucm8=