- 1College of Pharmacy, Anhui University of Chinese Medicine, Hefei, China
- 2Institute of Conservation and Development of Traditional Chinese Medicine Resources, Anhui Academy of Chinese Medicine, Hefei, China
- 3Joint Research Center for Chinese Herbal Medicine of Anhui of IHM, Anhui University of Chinese Medicine, Hefei, China
- 4Anhui Province Key Laboratory of Research and Development of Chinese Medicine, Hefei, China
Melon (Cucumis melo L.) is one of the most extensively grown horticulture crops of the world. Based on the morphological characters, melon was formerly divided into two subspecies, Cucumis melo ssp. melo and C. melo ssp. agrestis. However, the present methods are still inadequate to distinguish between them. The phylogenetic analysis based on chloroplast genome sequences could provide essential evidence for the classification of melon varieties. We sequenced the chloroplast genomes of nine different melon varieties by the Illumina Hiseq and performed bioinformatic analyses including repeat element analysis, genome comparison and phylogenetic analysis. The results showed that the melon chloroplast genome has a typical quadripartite structure that was conserved across the analyzed sequences. Its length ranges between 155, 558 and 156, 569 bp, with a total GC content varying from 36.7% to 37%. We found 127–132 genes in melon chloroplast genomes, including 85–87 protein-coding regions, 34–37 tRNA and 6-8 rRNA genes. The molecular structure, gene order, content, codon usage, long repeats, and simple sequence repeats (SSRs) were mostly conserved among the nine sequenced genomes. Phylogenetic analysis showed that the chloroplast genome could clearly distinguish between C. melo ssp. melo and C. melo ssp. agrestis. This study not only provides valuable knowledge on melon chloroplasts, but also offers a theoretical basis and technical support for the genetic breeding of melons.
1 Introduction
Melon (Cucumis melo L.) is an important emblematic crop of the Cucurbitaceae family that is widely distributed in tropical and subtropical regions and is thought to have originated in Africa and Asia (Hu et al., 2019; Zhang et al., 2023). It has a long history of planting and is cultivated worldwide due to economic and nutraceutical importance. China is the biggest melon producer in the world, accounting for about half of the global production in recent years. In 2021, global melon yield was over 28.62 million tons, with the total yield in China being over 14.02 million tons (FAO Statistics 2023; https://www.fao.org/faostat/zh/#data/QCL/visualize). It has a pleasant aromatic flavor and is a rich source of soluble sugars, minerals, organic acids, vitamins and other health-promoting substances (Hu et al., 2019; Zhang et al., 2023).
There are many melon species in China and around the world, the feature diversity is primarily expressed in rind color, flesh color, shape, and rind netting of the fruit. At present, there are many studies on melon diversity from rind color, flesh color and other aspects to reveal their influencing factors (Gur et al., 2017; Liang et al., 2023; Shahwar et al., 2023). However, the number of melon varieties produced and sold in China has increased over the years as a result of the multiple melon varieties being continually improved in recent years, making variety evaluation, discrimination and breeding innovation extremely difficult (Gao et al., 2012). The traditional methods of seed quality evaluation and management are focused on morphological characteristics. Previous studies revealed that melon mostly consists of two populations, C. melo ssp. melo and C. melo ssp. agrestis, based on ovary pubescence (Jeffrey, 1980). In general, C. melo ssp. melo plants have more vigorous vegetative growth, thicker pulp, higher sugar content and higher tolerance to biotic and abiotic stress than C. melo ssp. agrestis plants (Liu et al., 2020). However, there are minor phenotypic differences between the seeds of C. melo ssp. melo and C. melo ssp. agrestis, making morphological evaluation challenging. In addition, recognition based on morphological descriptors is time-consuming, and subject to environmental influences. The above concerns make it more difficult to monitor the melon seed market. Therefore, a reliable method is needed to accurately identify and distinguish C. melo ssp. agrestis and C. melo ssp. melo.
The chloroplast is a crucial organelle that plays a significant role in photosynthesis, carbon fixation, translation, and transcription (Daniell et al., 2016; Arab et al., 2022). The chloroplast genome is a source of sequence diversity that can be used to distinguish species (Song et al., 2017). In angiosperms, the chloroplast genome sequence typically contains between 100 and 130 genes, the majority of which are involved in photosynthesis, transcription, and translation and range in size from 120 to 170 kb (Jiao et al., 2022). The chloroplast genomic structure is conserved and stable; it is a circular molecule consisting two copies of the reverse repetition regions (IRs) separating the small single-copy region (SSC) and large single-copy region (LSC) (Yang et al., 2010; Green, 2011; Wicke et al., 2011; Roy et al., 2016; Ivanova et al., 2017). Chloroplast genome structure is simple and conserved, compared with the nuclear genome has more advantages. It is generally monophyletic and mostly maternal inheritance (Drouin et al., 2008; Krak et al., 2016). The chloroplast genome evolves slower than the nuclear genome and includes a large amount of genetic information (Song et al., 2017; Zuo et al., 2017). It has been successfully used to analyze the phylogenetic relationships between many challenging species and study the structural characteristics, variation, and evolution of plants in the recent past (Xi et al., 2012; Li et al., 2018; Xie et al., 2019; Alzahrani, 2021). For example, a previous phylogenetic study based on the chloroplast genome, revealed that Benincasa hispida was closely related to Cucumis, Citrullus and Lagenaria as a sister group (Zhang et al., 2021; Song et al., 2022). Luffa aegyptiaca and Luffa acutangula were found to be closely associated, and the Luffa was distinguished from the other species of the Sicyocae subtribe as a monophyletic clade (Yundaeng et al., 2020).
In this study, nine main melon varieties were collected in Chinese mainland, which fully represented the morphological diversity of the two subspecies, and the complete chloroplast genome was assembled by the Illumina Hiseq 2,500 platform. We analyzed the characteristics of the chloroplast genomic structures of C. melo ssp. melo and C. melo ssp. agrestis, and carried out comparative genome, systematic evolution, and genetic structure analyses with the published chloroplast genomes of other species of Cucurbitaceae. Our work not only adds to the information of melon chloroplast genomic, but also provides the foundation for future research on plastid-mics, genetic evolution, and accurate molecular identification of melon varieties.
2 Materials and methods
2.1 Plant materials and DNA extraction
A total of nine samples from different varieties were collected across China to represent two subspecies of Melon. The melon seeds were carefully obtained from the melon using sterilized tweezers. Seeds were preserved by the School of Pharmacy, Anhui University of Chinese Medicine (Hefei, China). The phenotypic characteristics of melons are shown in Supplementary Figure S1. The seeds of the nine melons were collected and rinsed thoroughly with running water. Then, they were thoroughly cleaned with sterile water several times before being dried in a sampling bag with silica gel. The modified CTAB method (Cota-Sánchez et al., 2006) was used to extract the total genomic DNA from the collected melon seeds. Genomic DNA integrity was evaluated by 1% (w/v) agarose gel electrophoresis and the quality was assessed by an ultra-micro spectrophotometer (Denovix DS-11+, USA).
2.2 Chloroplast genome sequencing, assembly and annotation
The prepared DNA samples were sequenced by the Illumina Hiseq 2,500 (Illumina, USA) in Genesky Biotechnologies (Shanghai, China). The original data was subjected to quality control and filtering, removing the adapter and low-quality data to obtain high-quality sequencing data and improve the accuracy of subsequent bioinformatics analysis. The quality of the filtered data was checked with the FastQC v0.11.8 tool. The chloroplast genomes of melons were assembled by using the metaSPAdes v 3.13.0 (Bankevich et al., 2012). The gavas2 (http://47.96.249.172:16019/analyzer/home) was used to annotate the assembled genome (Shi et al., 2019). Lastly, OrganellarGenomeDRAW v 1.1.1 (https://chlorobox.mpimp-golm.mpg.de/OGDraw) was used to create the circular maps of each chloroplast genome (Greiner et al., 2019). The raw data was uploaded to the National Centre for Biotechnology Information database (https://www.ncbi.nlm.nih.gov/) (accession numbers: OR643673- OR643681).
2.3 Repeat element analysis
The REPuter v1.0 (https://bibiserv.cebitec.unibielefeld.de/reputer) software (Kurtz et al., 2001) was used to analyze repeat sequences, including palindromic, reverse, forward, and complement repeat, with a hamming distance of 3, maximum computed repeats of 5, 000 bp and a minimal repeat size of 30 bp. The simple sequence repeats (SSRs) in the genome sequences of different melon varieties were analyzed by the MISA v2.1 (https://webblast.ipk-gatersleben.de/misa/index.php) software (Thiel et al., 2003). The minimum frequencies set for the detection of mono-, di-, tri-, tetra-, penta- and hexanucleotide repeats were 10, 5, 4, 3, 3 and 3, respectively.
2.4 Genome comparison
In this study, rpl22, rps19, rpl2, ndhF, ycf1, psbA and trnH-GUG genes were selected by default to analyze IR boundaries which were visualized between the four major regions (LSC/IRb/SSC/IRa) of the chloroplast genome. Relative synonymous codon usage (RSCU) of protein-coding genes was conducted to determine codon bias. The complete chloroplast genomes of different varieties melon were compared by mVISTA v2.0 (http://genome.lbl.gov/vista/index.shtml) in the shuffle-LAGAN mode (Frazer et al., 2004).
2.5 Phylogenetic analysis
A phylogenetic tree was constructed based on the chloroplast genomes with Indofevillea khasiana (NC046859) as an outgroup (He et al., 2024) (Supplementary Table S1). Before constructing the phylogenetic tree, the sequences of all chloroplast genomes were aligned by mafft v7.017 (Katoh et al., 2005). Finally, the aligned sequences were collated by trimAI plug-in in PhyloSuite v1.2.2 (Zhang et al., 2020) to create a phylogenetic tree by the MrBayes method in the PhyloSuite v1.2.2, while taking into account the following criteria: models-GTR + F + I + G4, burninfrac = 0.25, sample frequency = 101.
3 Results
3.1 General characteristics of the chloroplast genome of melons (Cucumis melo L.)
The lengths of the complete chloroplast genomes ranged from 155, 558 bp (C. melo ‘Jiashigua’) to 156, 569 bp (C. melo ‘BoyangNo.9’) (Table 1). The chloroplast genomes of the melon (Cucumis melo L.) exhibited a very conservative and characteristic quadripartite structure, consisting of a pair of inverted repeats (IRs, 20, 238 bp-25, 797 bp), LSC (86, 334 bp-86, 428 bp) and SSC region (SSC, 18, 087 bp-28, 600 bp) (Figure 1). The LSC and SSC were separated by a pair of IR regions. The total GC content of melon (Cucumis melo L.) chloroplast genomes was comparable, ranging from 36.7% to 37% (C. melo ‘Mitiangua’ and C. melo ‘Meilong’) (Table 1). The GC contents of the four regions showed that the IR regions had the highest GC content (41.6%–42. 8%), followed by the LSC (34.7%) and SSC (30.8%–36.8%) regions (Supplementary Table S2). The distribution of GC content was similar to that of most other angiosperms (Zhou et al., 2017). The substantial number of sequences encoding rRNA in IR causes this phenomenon. Several studies have demonstrated that rRNA has high GC content, which is also a characteristic of the majority of angiosperm chloroplast genomes (Zheng et al., 2020).
Figure 1. Gene map of the chloroplast genome from Cucumis melo ssp. melo (A) and Cucumis melo ssp. agrestis (B). Genes transcribed clockwise and counter-clockwise are showed inside and outside the circle, respectively. Genes belonging to different functional groups are color-coded. Genes shown outside the map are transcribed in a clockwise direction, while those inside the map are transcribed in a counterclockwise direction. Dark gray in the inner circle corresponds to GC content. LSC, large single-copy region; SSC, small single-copy region; IRA and IRB, inverted repeat A/B region.
Melon (Cucumis melo L.) chloroplast genomes showed similar content and order, and encode 127–132 genes, including 85–87 protein-coding genes (PCGs), 34–37 tRNA (transfer RNA) genes and 6-8 rRNA (ribosomal RNA) genes (Table 1). Among these, majority are present as a single-copy in LSC or SSC regions, and 25 genes are duplicated in the IR regions (Table 2). However, the different genes in each melon (Cucumis melo L.) chloroplast genome were unevenly distributed. Only minor differences were observed pertaining to some of the protein coding genes in the melon (Cucumis melo L.) chloroplast genomes. For instance, compared to C. melo ‘Yugu’ and C. melo ‘Meilong’, C. melo ‘Jiashigua’ and C. melo ‘Huangpin’ lost the rpl2 gene; C. melo ‘BoyangNo.9’, C. melo ‘Mitiangua’, C. melo ‘Zhimami’, C. melo ‘Yangjiaomi’ and C. melo ‘Lvbao’ lost the infA gene; C. melo ‘Huangpin’, C. melo ‘Yangjiaomi’ and C. melo ‘Lvbao’ lost the psbA gene. Furthermore, considering tRNA and rRNA genes, C. melo ‘Mitiangua’ had one copy of trnN-GUU while trnR-ACG and rrn5 were missing.
We then focused on intron containing genes since they play a crucial role in the regulation of gene expression by accumulating more mutations than exons and promoting the expression of foreign genes in plants (Kelchner, 2002; Xu et al., 2003). We found 23 different genes containing introns. Among the 23 genes, 11 were tRNA genes (trnA-TGC、trnA-UGC、trnG-TCC、trnG-UCC、trnI-GAT、trnK-TTT、trnK-UUU、trnL-TAA、trnL-UAA、trnV-TAC、trnV-UAC) and 12 were protein-coding genes (petB、petD、atpF、ndhA、ndhB、rpoC1、rps16、rps12、rpl16、rpl2、clpP、ycf3). While clpP and ycf3 were the only genes with two introns, the other 21 genes had one intron (Table 2).
3.2 Analysis of repeat sequences and simple sequence repeats (SSRs)
Interspersed repeats are distinct from tandem repeats, they are distributed in the genome in a decentralized manner (Jiao et al., 2022). There are three interspersed repeat types in the complete chloroplast genome of melons: forward, palindromic and reverse repeats. In total, 487 repeat sequences were detected in the nine chloroplast genomes, including 145 forward, 309 palindromic and 33 reverse repeats (Supplementary Table S3). Although there were differences in the number of repeats across the chloroplast genomes of melons, the tendency was similar and conserved. Among all melons, the most common were the palindromic repeats, followed by forward repeats and the least common were the complement-type repeats (Figure 2A). Similarly, forward and palindromic repeats were more abundant than reverse and complement repeats, and these were observed as 63.45%, 29.77%, 6.78% and 0% of the genome, respectively (Figure 2B).
Figure 2. Analysis of repeat sequences and simple sequence repeats (SSRs) in the chloroplast genomes of the nine melon varieties (A) The number of repeat sequences in the nine chloroplast genomes; (B) Proportion of different repeat types; (C) The numbers of different types of simple sequence repeats (SSRs); (D) Proportions of different types of simple sequence repeats (SSRs); (E) Number of simple sequence repeats (SSRs) in LSC, SSC and IR; (F) Proportions of different simple sequence repeats (SSRs) in LSC, SSC and IR. The A-I below the horizontal coordinate represents the different varieties, respectively are Cucumis melo ‘Yugu’, Cucumis melo ‘Meilong’, Cucumis melo ‘Jiashigua’, Cucumis melo ‘Huangpin’, Cucumis melo ‘Mitiangua’, Cucumis melo ‘Boyang NO.9’, Cucumis melo ‘Zhimami’, Cucumis melo ‘Yangjiaomi’ and Cucumis melo ‘Lvbao’.
A total of six categories of simple sequence repeats (SSRs) were detected in the nine melon chloroplast genomes, ranging from 68 in C. melo ‘Huangpin’ to 72 in C. melo ‘Meilong’ and C. melo ‘Zhimami’ (Supplementary Table S4). The most abundant SSRs were mononucleotide repeats, whose proportions relative to the total number of SSRs in the chloroplast genome were similar in all the analyzed varieties of melon (Figures 2C, D). We also analyzed the distribution of SSRs in different regions of the chloroplast genomes of the nine melons. Although the number of SSRs in the LSC, SSC and IR regions of the chloroplast genomes of the nine melons was slightly different, their distribution trends were similar (Figure 2E). Comparatively, most SSRs were distributed in LSC regions, accounting for an average of 71.02%, considering the total number of SSRs (Figure 2F).
3.3 IR boundary analysis
Since the expansion of the contraction domain of IR boundary causes changes in the length of the chloroplast genome (Shen et al., 2017), we investigated the location of genes in the junctional region to assess the boundary. The IR region of chloroplast genomes was highly conserved across the nine melon species, and slight structural variations were found in the JSB and JSA regions (Figure 3). Among them, there were no differences between the IR boundaries of C. melo ‘Lvbao’ and C. melo ‘Yangjiaomi’. C. melo ‘Yugu’ and C. melo ‘Meilong’ differed only in the LSC/IR boundary. In the two melon varieties, there was no gene that crossed the LSC/IR boundary, and there was only a small difference in the distance between the gene and the boundary. The chloroplast genomes of C. melo ‘Jiashigua’ and C. melo ‘Mitiangua’ were significantly different from other species, indicating that these species may have gone through a special evolutionary process. Furthermore, trnA and rrn23 were duplicated in the IR/SSC borders in C. melo ‘Jiashigua’, rrn4.5 in those of the C. melo ‘Mitiangua’ and ycf1 in those of the other five melon genomes. Overall, the IR/LSC boundary regions of the nine chloroplast genomes of melon were highly conserved, although the IR/SSC boundary regions exhibited variations.
Figure 3. Comparisons of the LSC, SSC and IR region borders of chloroplast genomes among different melon varieties (Cucumis melo L.).
3.4 Codon usage analysis
Codon usage, amino acid frequency and relative synonymous codon usage (RSCU) analyses were performed on the nine sequenced melon chloroplast genomes. The total number of codons ranged between 25, 902 (C. melo ‘Huangpin’)-26, 679 (C. melo ‘Zhimami’). Moreover, the total number of codons did not change significantly, and the type of codons were consistent with the type of amino acids. The codons for the amino acids methionine (Met) and Tryptophan (Trp) had RSCU values of 1.00 and thus no codon bias, but the codons for the other amino acids, which are encoded by several synonymous codons, have RSCU values ranging from 2 to 6 (Supplementary Table S5). AGC (0.359) and TTA (1.887) had the lowest and highest RSCU values, respectively, among the nine analyzed sequences (Supplementary Table S5). The distribution of codon usage revealed that all codons ending in A or T had an RSCU >1, except CTA (leucine, 0.826–0.841) and ATA (isoleucine, 0.922–0.943). TTG (leucine, 1.224–1.244) alone had an RSCU >1 (Figure 4). This indicated that codons ending in A or T were preferred, whereas codons ending in C or G were not.
Figure 4. Codon content for 20 amino acids in the chloroplast genomes of the nine melon species. (A) Codon content for CDS in the four C. melo ssp. melo chloroplast genomes, each column in the bar graph represents a species. The corresponding species from left to right are C. melo ‘Yugu’, C. melo ‘Meilong’, C. melo Jiashigua’ and C. melo ‘Huangpin’. (B) Codon content for CDS in the rest five C. melo ssp. agrestis chloroplast genomes, the corresponding species from left to right are C. melo ‘Mitiangua’, C. melo ‘BoyangNo.9’, C. melo ‘Zhimami’, C. melo ‘Yangjiaomi’, and C. melo ‘Lvbao’.
3.5 Comparative genome analysis
With the sequence data from C. melo ‘Shengkaihua’ as a reference, the mVISTA v2.0 software was used to compare and assemble the complete chloroplast genomes of the nine melon varieties. The results are shown in Figure 5. The comparative investigation revealed a high degree of uniformity in the chloroplast genomes of the nine melon varieties (Cucumis melo L.). However, minor differences were found mainly in the untranslated region (UTR). Additionally, only psbA, rps19 and rpl2 differed significantly from the other genes in the coding region, while the others were relatively alike. In the non-coding region, there were significant differences among the intergenic regions, such as tRNA-His and tRNA-Leu. Our findings indicate that there was no major variation within the complete chloroplast genomes from across the melon species.
Figure 5. Visualization of alignment of melon (Cucumis melo L.) chloroplast genome sequences The vertical scale indicates the percentage of identity, ranging from 50% to 100%.
3.6 Phylogenetic analysis
In order to study the phylogenetic relationship between the different groups of melons, a phylogenetic tree of the nine different melon varieties and 18 other species of Cucurbitaceae was constructed based on the chloroplast genomes using MrBayes, with Indofevillea khasiana as an outgroup (Figure 6). We have obtained a stable phylogenetic tree with strong support, where each genus and species are monophyletic and melons were clearly divided into two branches. C. melo ‘Yugu’, C. melo ‘Meilong’, C. melo ‘Jiashigua’, C. melo ‘Huangpin’ and Cucumis melo var. cantalupo, were clustered into a monophyletic group, which belong to C. melo ssp. melo. Other varieties of melon including C. melo ‘Mitiangua’, C. melo ‘BoyangNo.9’, C. melo ‘Zhimami’, C. melo ‘Yangjiaomi’, C. melo ‘Lvbao’, C.ucumis melo subsp. agrestis and C. melo cultivar Shengkaihua were clustered into another monophyletic group, which belong to C. melo ssp. agrestis. Furthermore, C. melo ‘Jiashigua’ and C. melo ‘Huangpin’ were relatively closer compared with other varieties; C. melo ‘Yangjiaomi’ and C. melo ‘Lvbao’ had a close relationship. Therefore, the chloroplast genomes of the nine melon (Cucumis melo L.) varieties provide rich phylogenetic information for distinguishing C. melo ssp. melo and C. melo ssp. agrestis.
Figure 6. Phylogenetic tree based on the complete chloroplast genome sequences of 27 species of Cucurbitaceae generated by MrBayes. Bootstrap values are shown under each node.
4 Discussion
In this study, we assembled and compared the chloroplast genomes of nine different melon varieties. The genome size, gene content, gene sequence and GC content were all similar across the melon varieties indicating that the chloroplast genome of melon is highly conserved (Cheng et al., 2020). The chloroplast genomes from the nine different melon species exhibit a typical quadripartite molecular structure including four important regions: a large single-copy (LSC) region, a small single-copy (SSC) region and a pair of inverted repeats (IRs) region (Powell et al., 1995; Jansen et al., 2005). According to earlier studies, the sizes of angiosperm chloroplast genomes range from 120 to 160 kb (Palmer, 1985). The lengths of chloroplast genome sequences of the nine genomes were quite different, ranging from 155, 558 bp (C. melo ‘Jiashigua’) to 156, 569 bp (C. melo ‘BoyangNo.9’) and the total length of chloroplast genome of C. melo ssp. melo was shorter than that of C. melo ssp. agrestis. This indicates that although the melon has been artificially domesticated for a long time and resulted in different varieties, its chloroplast genome did not undergo recombination. Thus, the sizes of the chloroplast genomes were not significantly changed.
The IR region, which is the most conserved of all regions, can change the size of the chloroplast genome due to expansion and contraction (He et al., 2017). We investigated the location of genes in the junctional region of the chloroplast genome in nine melon varieties. In general, there is a significant correlation between genome size and IR length. For instance, the C. melo ‘Jiashigua’ had the smallest chloroplast genome (155,558 bp) and the shortest IRs (20, 238 bp and 20, 292 bp), whereas the C. melo ‘Boyang No.9’ had the largest chloroplast genomes (156,569 bp) and the longest IRs (25,797 bp). The gene distribution at the boundaries of the four regions of the chloroplast genome followed a similar rule among the nine chloroplast genomes, except for C. melo ‘Jiashigua’ and C. melo ‘Mitiangua’. The main differences were concentrated in the IR/LSC region. In contrast to previous studies, where duplication of ycf1 and rps15 were reported in the Luffa acutangula and Luffa aegyptiaca (Yundaeng et al., 2020) we found ycf1 duplication in all varieties, except in C. melo ‘Jiashigua’ and C. melo ‘Mitiangua’. This shows that C. melo ‘Jiashigua’ and C. melo ‘Mitiangua’ have experienced a unique evolutionary process distinct from the other varieties, which have likely evolved in a similar manner. Additionally, we found the IR bounds of C. melo ‘Lvbao’ and C. melo ‘Yangjiaomi’ to be identical. Our subsequent phylogenetic analysis supports our hypothesis that C. melo ‘Lvbao’ and C. melo ‘Yangjiaomi’ have a closer genetic relationship than other melon species.
SSRs are important molecular markers in population genetics, polymorphism investigations and evolutionary research (Ebert and Peakall, 2009; Daniell et al., 2016; Zhang et al., 2018). A total of six different types of SSRs were detected and the nine different melon species had identical numbers of SSRs. These were found more frequently in the LSC region than in the SSC and IR regions, which is consistent with the findings in other plants (Qian et al., 2013; Shen et al., 2022). We found that the SSRs in the chloroplast genome of melons were especially rich in A and T, which was consistent with earlier research, which found that many plants’ chloroplast SSRs contained higher proportions of polyadenine (polyA) and polythymine (polyT) than polycytosine (polyC) and polyguanine (polyG) (Kuang et al., 2011). We also identified more SSRs in C. melo ssp. agrestis than C. melo ssp. melo. This may indicate the presene of some unique SSRs in C. melo ssp. agrestis, which can be used to identify C. melo ssp. agrestis.
The codon usage analysis helps to understand the evolution process (Wang et al., 2017). Among the chloroplast genes of the nine melon species, most codons encoded leucine (Leu), while the number of codons encoding cysteine (Cys) were the least, and codons ending in A/T were preferred. This feature is consistent with that of most plant species (Zheng et al., 2020). Codon bias is due to the different abundances of tRNAs corresponding to different codons in the cell, and affects translation initiation, elongation, and accuracy (Hanson and Coller, 2018). In this study, the codon preferences of alanine (Ala), aspartic acid (Asp), glutamic acid (Glu) and proline (Pro) in C. melo ‘Mitiangua’ and C. melo ‘Boyang No.9’ were different from that of other melon species based on the RSCU values, suggesting that the two species incurred unique mutations, and underwent distinctive genetic drift and natural selection.
The majority of angiosperms have haploid chloroplast genomes, which are a valuable resource for research into the phylogeny and evolution of plants due to their uni-parental inheritance (Wang H. X. et al., 2020). The phylogenetic relationship among the different genera in the Cucurbitaceae has been explored based on the chloroplast genome (Wang L. et al., 2020; Cheng et al., 2020). However, few studies apply the chloroplast genome to the in-depth study of distinguishing between C. melo ssp. melo and C. melo ssp. agrestis. In this study, we constructed a phylogenetic tree based on 27 chloroplast genomes, and all Cucumis plants were found to be grouped together into one broad branch with C. melo ssp. melo and C. melo ssp. agrestis notably divided into two branches, which was consistent with previous studies based on molecular evidence (Zhao et al., 2019; Liu et al., 2020). The melon varieties can be divided into two groups: C. melo ssp. melo and C. melo ssp. agrestis. The two groups are further divided into four sub-groups, and the sub-groups are further divided into several small groups, which clarifies the genetic relationship between different varieties. Cucumis melo var. cantalupo and C. melo ‘Yugu’ clustered together in this phylogenetic tree, but they were not closely related to other C. melo ssp. melo indicating that they might have experienced a domestication process distinct from other C. melo ssp. melo. Otherwise, C. melo ‘Lvbao’ has the closest relationship with C. melo ‘Yangjiaomi’, which was consistent with the results of IR boundary analysis. Similarly, C. melo ‘Huangpin’ and C. melo ‘Jiashigua’ clustered together. It suggested that the artificial domestication processes of C. melo ‘Lvbao’ and C. melo ‘Yangjiaomi’, C. melo ‘Huangpin’ and C. melo ‘Jiashigua’ would have been similar with small genetic variations.
5 Conclusion
The complete chloroplast genomes of different melon (Cucumis melo L.) species displayed the typical quadripartite structure of a land plant. More specifically, their structure, gene composition, GC content, and codon bias were alike to those of typical angiosperms, which proved that the chloroplast genome of melon (Cucumis melo L.) to be relatively conservative. Additionally, the phylogenetic results showed that the C. melo ssp. melo and C. melo ssp. agrestis clustered into a monophyletic group. The findings of this study not only improve our understanding of the internal structure of the chloroplast genome of the melon (Cucumis melo L.) but also provide a rapid and simple method for identifying C. melo ssp. melo and C. melo ssp. agrestis. At the same time, this study also provides a better understanding of the phylogeny and genetic improvement of melon (Cucumis melo L.) germplasm resources.
Data availability statement
The data presented in the study are deposited in the NCBI repository, accession number OR643673- OR643681.
Author contributions
JH: Writing–original draft. JY: Writing–review and editing. JL: Writing–review and editing. WL: Writing–review and editing. ZZ: Writing–review and editing. YL: Writing–review and editing. LJ: Writing–review and editing. LZ: Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This study was supported by Key project at central government level: The ability establishment of sustainable use for valuable Chinese medicine resources(grant numbers 2060302-1901-05); Young Elite Scientists Sponsorship Program by CACM(grant numbers CACM-2023-QNRC2-B23); Excellent Young Scholars Project of Natural Science Foundation of Anhui Province in China (grant numbers 2208085Y30); Science Research Project at the Universities of Anhui Province for Distinguished Young Scholars(grant numbers 2023AH020036); the Key Project Foundation of Support Program for the Excellent Young Faculties in Universities of Anhui Province in China (grant numbers gxyqZD2022051); research Funds of Joint Research Center for Chinese Herbal Medicine of Anhui of IHM (grant numbers yjzx2023002) and Traditional Chinese Medicine high-level key discipline construction project of National Administration of Traditional Chinese Medicine -Science of Chinese medicinal material resources (pharmaceutical botany) (zyyzdxk-2023095); Scientific Research Team Program of Anhui Colleges and Universities(Grant no.2022AH010036).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2024.1417266/full#supplementary-material
References
Alzahrani, D. A. (2021). Complete chloroplast genome of Abutilon fruticosum: genome structure, comparative and phylogenetic analysis. Plants (Basel) 10 (2), 270. doi:10.3390/plants10020270
Arab, M. M., Brown, P. J., Abdollahi-Arpanahi, R., Sohrabi, S. S., Askari, H., Aliniaeifard, S., et al. (2022). Genome-wide association analysis and pathway enrichment provide insights into the genetic basis of photosynthetic responses to drought stress in Persian walnut. Hortic. Res. 9, uhac124. doi:10.1093/hr/uhac124
Bankevich, A., Nurk, S., Antipov, D., Gurevich, A. A., Dvorkin, M., Kulikov, A. S., et al. (2012). SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19 (5), 455–477. doi:10.1089/cmb.2012.0021
Cheng, H., Kong, W. P., Zhang, M. M., and Hou, D. (2020). The complete chloroplast genome of Cucumis anguria var. anguria (Cucurbitaceae) and its phylogenetic implication. Mitochondrial DNA B Resour. 5 (1), 654–655. doi:10.1080/23802359.2019.1711231
Cota-Sánchez, J. H., Remarchuk, K., and Ubayasena, K. (2006). Ready-to-use DNA extracted with a CTAB method adapted for herbarium specimens and mucilaginous plant tissue. Plant Mol. Biol. Rep. 24 (2), 161–167. doi:10.1007/bf02914055
Daniell, H., Lin, C. S., Yu, M., and Chang, W. J. (2016). Chloroplast genomes: diversity, evolution, and applications in genetic engineering. Genome Biol. 17 (1), 134. doi:10.1186/s13059-016-1004-2
Drouin, G., Daoud, H., and Xia, J. (2008). Relative rates of synonymous substitutions in the mitochondrial, chloroplast and nuclear genomes of seed plants. Mol. Phylogenet. Evol. 49 (3), 827–831. doi:10.1016/j.ympev.2008.09.009
Ebert, D., and Peakall, R. (2009). Chloroplast simple sequence repeats (cpSSRs): technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species. Mol. Ecol. Resour. 9 (3), 673–690. doi:10.1111/j.1755-0998.2008.02319.x
Frazer, K. A., Pachter, L., Poliakov, A., Rubin, E. M., and Dubchak, I. (2004). VISTA: computational tools for comparative genomics. Nucleic. Acids. Res. 32 (Web Server issue), W273–W279. doi:10.1093/nar/gkh458
Gao, P., Ma, H., Luan, F., and Song, H. (2012). DNA fingerprinting of Chinese melon provides evidentiary support of seed quality appraisal. PloS one 7 (12), 52431. doi:10.1371/journal.pone.0052431
Green, B. R. (2011). Chloroplast genomes of photosynthetic eukaryotes. Plant J. 66 (1), 34–44. doi:10.1111/j.1365-313X.2011.04541.x
Greiner, S., Lehwark, P., and Bock, R. (2019). OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic. Acids. Res. 47 (W1), W59–W64. doi:10.1093/nar/gkz238
Gur, A., Tzuri, G., Meir, A., Sa'ar, U., Portnoy, V., Katzir, N., et al. (2017). Genome-wide linkage-disequilibrium mapping to the candidate gene level in melon (Cucumis melo). Sci. Rep. 7 (1), 9770. doi:10.1038/s41598-017-09987-4
Hanson, G., and Coller, J. (2018). Codon optimality, bias and usage in translation and mRNA decay. Nat. Rev. Mol. Cell. Biol. 19 (1), 20–30. doi:10.1038/nrm.2017.91
He, L., Qian, J., Li, X., Sun, Z., Xu, X., and Chen, S. (2017). Complete chloroplast genome of medicinal plant Lonicera japonica: genome rearrangement, intron gain and loss, and implications for phylogenetic studies. Molecules 22 (2), 249. doi:10.3390/molecules22020249
He, S., Xu, B., Chen, S., Li, G., Zhang, J., Xu, J., et al. (2024). Sequence characteristics, genetic diversity and phylogenetic analysis of the Cucurbita ficifolia (Cucurbitaceae) chloroplasts genome. BMC genomics 25 (1), 384. doi:10.1186/s12864-024-10278-2
Hu, J., Gao, L., Xu, Y., Li, Q., Zhu, H., Yang, L., et al. (2019). Microsatellite markers reveal genetic diversity and relationships within a melon collection mainly comprising asian cultivated and wild germplasms. Biomed. Res. Int. 2019, 7495609. doi:10.1155/2019/7495609
Ivanova, Z., Sablok, G., Daskalova, E., Zahmanova, G., Apostolova, E., Yahubyan, G., et al. (2017). Chloroplast genome analysis of resurrection tertiary relict Haberlea rhodopensis highlights genes important for desiccation stress response. Front. Plant Sci. 8, 204. doi:10.3389/fpls.2017.00204
Jansen, R. K., Raubeson, L. A., Boore, J. L., dePamphilis, C. W., Chumley, T. W., Haberle, R. C., et al. (2005). Methods for obtaining and analyzing whole chloroplast genome sequences. Methods Enzymol. 395, 348–384. doi:10.1016/S0076-6879(05)95020-9
Jeffrey, C. (1980). A review of the Cucurbitaceae. Bot. J. Linn. Soc. 81 (3), 233–247. doi:10.1111/j.1095-8339.1980.tb01676.x
Jiao, Y., Feng, G., Huang, L., Nie, G., Li, Z., Peng, Y., et al. (2022). Complete chloroplast genomes of 14 subspecies of D. glomerata: phylogenetic and comparative genomic analyses. Genes (Basel) 13 (9), 1621. doi:10.3390/genes13091621
Katoh, K., Kuma, K., Toh, H., and Miyata, T. (2005). MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic. Acids. Res. 33 (2), 511–518. doi:10.1093/nar/gki198
Kelchner, S. A. (2002). Group II introns as phylogenetic tools: structure, function, and evolutionary constraints. Am. J. Bot. 89 (10), 1651–1669. doi:10.3732/ajb.89.10.1651
Krak, K., Vít, P., Belyayev, A., Douda, J., Hreusová, L., and Mandák, B. (2016). Allopolyploid origin of Chenopodium album s. str. (chenopodiaceae): a molecular and cytogenetic insight. PloS one 11 (8), e0161063. doi:10.1371/journal.pone.0161063
Kuang, D. Y., Wu, H., Wang, Y. L., Gao, L. M., Zhang, S. Z., and Lu, L. (2011). Complete chloroplast genome sequence of Magnolia kwangsiensis (Magnoliaceae): implication for DNA barcoding and population genetics. Genome 54 (8), 663–673. doi:10.1139/g11-026
Kurtz, S., Choudhuri, J. V., Ohlebusch, E., Schleiermacher, C., Stoye, J., and Giegerich, R. (2001). REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic. Acids. Res. 29 (22), 4633–4642. doi:10.1093/nar/29.22.4633
Li, Y., Zhang, J., Li, L., Gao, L., Xu, J., and Yang, M. (2018). Structural and comparative analysis of the complete chloroplast genome of Pyrus hopeiensis-"Wild plants with a tiny population"-and three other pyrus species. Int. J. Mol. Sci. 19 (10), 3262. doi:10.3390/ijms19103262
Liang, X., Li, Q., Cao, L., Du, X., Qiang, J., Hou, J., et al. (2023). Natural allelic variation in the EamA-like transporter, CmSN, is associated with fruit skin netting in melon. Theor. Appl. Genet. 136 (9), 192. doi:10.1007/s00122-023-04443-6
Liu, S., Gao, P., Zhu, Q., Zhu, Z., Liu, H., Wang, X., et al. (2020). Resequencing of 297 melon accessions reveals the genomic history of improvement and loci related to fruit traits in melon. Plant Biotechnol. J. 18 (12), 2545–2558. doi:10.1111/pbi.13434
Palmer, J. D. (1985). Comparative organization of chloroplast genomes. Annu. Rev. Genet. 19, 325–354. doi:10.1146/annurev.ge.19.120185.001545
Powell, W., Morgante, M., McDevitt, R., Vendramin, G. G., and Rafalski, J. A. (1995). Polymorphic simple sequence repeat regions in chloroplast genomes: applications to the population genetics of pines. Proc. Natl. Acad. Sci. U. S. A. 92 (17), 7759–7763. doi:10.1073/pnas.92.17.7759
Qian, J., Song, J., Gao, H., Zhu, Y., Xu, J., Pang, X., et al. (2013). The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza. PloS one 8 (2), e57607. doi:10.1371/journal.pone.0057607
Roy, P. S., Rao, G. J., Jena, S., Samal, R., Patnaik, A., Patnaik, S. S. C., et al. (2016). Nuclear and chloroplast DNA Variation provides insights into population structure and multiple origin of native aromatic rices of odisha, India. PloS one 11 (9), e0162268. doi:10.1371/journal.pone.0162268
Shahwar, D., Khan, Z., and Park, Y. (2023). Molecular marker-assisted mapping, candidate gene identification, and breeding in melon (Cucumis melo L.): a review. Int. J. Mol. Sci. 24 (20), 15490. doi:10.3390/ijms242015490
Shen, J., Li, X., Chen, X., Huang, X., and Jin, S. (2022). The complete chloroplast genome of carya cathayensis and phylogenetic analysis. Genes (Basel) 13 (2), 369. doi:10.3390/genes13020369
Shen, X., Wu, M., Liao, B., Liu, Z., Bai, R., Xiao, S., et al. (2017). Complete chloroplast genome sequence and phylogenetic analysis of the medicinal plant Artemisia annua. Molecules 22 (8), 1330. doi:10.3390/molecules22081330
Shi, L., Chen, H., Jiang, M., Wang, L., Wu, X., Huang, L., et al. (2019). CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic. Acids. Res. 47 (W1), W65–W73. doi:10.1093/nar/gkz345
Song, W., Chen, Z., He, L., Feng, Q., Zhang, H., Du, G., et al. (2022). Comparative chloroplast genome analysis of wax gourd (Benincasa hispida) with three benincaseae species, revealing evolutionary dynamic patterns and phylogenetic implications. Genes (Basel) 13 (3), 461. doi:10.3390/genes13030461
Song, Y., Chen, Y., Lv, J., Xu, J., Zhu, S., Li, M., et al. (2017). Development of chloroplast genomic resources for oryza species discrimination. Front. Plant Sci. 8, 1854. doi:10.3389/fpls.2017.01854
Thiel, T., Michalek, W., Varshney, R. K., and Graner, A. (2003). Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106 (3), 411–422. doi:10.1007/s00122-002-1031-0
Wang, H. X., Michael, J., Barrett, R. L., et al. (2020a). Plastome phylogenomic insights into the sino-Japanese biogeography of diabelia (caprifoliaceae). J. Syst. Evol. 58 (6), 927–987. doi:10.1111/jse.12560
Wang, L., Lu, G., Liu, H., Huang, L., Jiang, W., Li, P., et al. (2020b). The complete chloroplast genome sequence of Gynostemma yixingense and comparative analysis with congeneric species. Genet. Mol. Biol. 43 (4), e20200092. doi:10.1590/1678-4685-GMB-2020-0092
Wang, W., Yu, H., Wang, J., Lei, W., Gao, J., Qiu, X., et al. (2017). The complete chloroplast genome sequences of the medicinal plant Forsythia suspensa (Oleaceae). Int. J. Mol. Sci. 18 (11), 2288. doi:10.3390/ijms18112288
Wicke, S., Schneeweiss, G. M., dePamphilis, C. W., Müller, K. F., and Quandt, D. (2011). The evolution of the plastid chromosome in land plants: gene content, gene order, gene function. Plant Mol. Biol. 76 (3-5), 273–297. doi:10.1007/s11103-011-9762-4
Xi, Z., Ruhfel, B. R., Schaefer, H., Amorim, A. M., Sugumaran, M., Wurdack, K. J., et al. (2012). Phylogenomics and a posteriori data partitioning resolve the Cretaceous angiosperm radiation Malpighiales. Proc. Natl. Acad. Sci. U. S. A. 109 (43), 17519–17524. doi:10.1073/pnas.1205818109
Xie, D. F., Yu, H. X., Price, M., Xie, C., Deng, Y. Q., Chen, J. P., et al. (2019). Phylogeny of Chinese Allium species in section daghestanica and adaptive evolution of Allium (Amaryllidaceae, Allioideae) species revealed by the chloroplast complete genome. Front. Plant. Sci. 10, 460. doi:10.3389/fpls.2019.00460
Xu, J., Feng, D., Song, G., Wei, X., Chen, L., Wu, X., et al. (2003). The first intron of rice EPSP synthase enhances expression of foreign gene. Sci. China C. Life Sci. 46 (6), 561–569. doi:10.1360/02yc0120
Yang, M., Zhang, X., Liu, G., Yin, Y., Chen, K., Yun, Q., et al. (2010). The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.). PLOS ONE 5 (9), e12762. doi:10.1371/journal.pone.0012762
Yundaeng, C., Nawae, W., Naktang, C., Shearman, J. R., Sonthirod, C., Sangsrakru, D., et al. (2020). Chloroplast genome data of Luffa acutangula and Luffa aegyptiaca and their phylogenetic relationships. Data Brief. 33, 106470. doi:10.1016/j.dib.2020.106470
Zhang, D., Gao, F., Jakovlić, I., Zou, H., Zhang, J., Li, W. X., et al. (2020). PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol. Ecol. Resour. 20 (1), 348–355. doi:10.1111/1755-0998.13096
Zhang, J., Yang, J., Lv, Y., Zhang, X., Xia, C., Zhao, H., et al. (2023). Genetic diversity analysis and variety identification using SSR and SNP markers in melon. BMC Plant Biol. 23 (1), 39. doi:10.1186/s12870-023-04056-7
Zhang, Q. P., Wei, X., Liu, W. S., Liu, N., Zhang, Y. p., Xu, M., et al. (2018). The genetic relationship and structure of some natural interspecific hybrids in Prunus subgenus Prunophora, based on nuclear and chloroplast simple sequence repeats. Genet. Resour. Crop. Ev. 65 (2), 625–636. doi:10.1007/s10722-017-0559-4
Zhang, T., Xie, J. J., Zhang, J., Yang, Z. A., Li, X., and He, S. L. (2021). Analysis of Cucurbita ficifolia (Cucurbitaceae) chloroplast genome and its phylogenetic implications. Mitochondrial DNA B 6 (10), 3033–3035. doi:10.1080/23802359.2021.1959440
Zhao, G., Lian, Q., Zhang, Z., Fu, Q., He, Y., Ma, S., et al. (2019). A comprehensive genome variation map of melon identifies multiple domestication events and loci influencing agronomic traits. Nat. Genet. 51 (11), 1607–1615. doi:10.1038/s41588-019-0522-8
Zheng, G., Wei, L., Ma, L., Wu, Z., Gu, C., and Chen, K. (2020). Comparative analyses of chloroplast genomes from 13 Lagerstroemia (Lythraceae) species: identification of highly divergent regions and inference of phylogenetic relationships. Plant Mol. Biol. 102 (6), 659–676. doi:10.1007/s11103-020-00972-6
Zhou, J., Chen, X., Cui, Y., Sun, W., Li, Y., Wang, Y., et al. (2017). Molecular structure and phylogenetic analyses of complete chloroplast genomes of two Aristolochia medicinal species. Int. J. Mol. Sci. 18 (9), 1839. doi:10.3390/ijms18091839
Keywords: chloroplast genome, C. melo ssp. melo, C. melo ssp. agrestis, species identification, phylogenetic relationship
Citation: Hu J, Yao J, Lu J, Liu W, Zhao Z, Li Y, Jiang L and Zha L (2024) The complete chloroplast genome sequences of nine melon varieties (Cucumis melo L.): lights into comparative analysis and phylogenetic relationships. Front. Genet. 15:1417266. doi: 10.3389/fgene.2024.1417266
Received: 14 April 2024; Accepted: 10 June 2024;
Published: 09 July 2024.
Edited by:
Subhojit Datta, Indian Council of Agricultural Research, IndiaReviewed by:
Cheng Song, West Anhui University, ChinaJuan Liu, University of Chinese Academy of Sciences, China
Copyright © 2024 Hu, Yao, Lu, Liu, Zhao, Li, Jiang and Zha. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Liangping Zha, emxwX2FodGNtQDEyNi5jb20=; Lu Jiang, bHVqaWFuZzU2OTZAMTI2LmNvbQ==
†These authors have contributed equally to this work