Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci. , 01 April 2025

Sec. Technical Advances in Plant Science

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1573967

This article is part of the Research Topic Advancements of Haploid Technology in Crops: New Horizons in Breeding and Genetics View all articles

Genome-wide development of SSR molecular markers for modern sugarcane cultivars

Yi Xu,Yi Xu1,2Siyuan Chen,Siyuan Chen1,2Shuqi ChenShuqi Chen1Xiangzhen Wei,Xiangzhen Wei1,2Heyang ShangHeyang Shang1Qing Zhang,Qing Zhang1,3Jisen Zhang,*Jisen Zhang1,3*
  • 1State Key Laboratory for Conservation and Utilization of Subtropical Agro - bioresources, Guangxi University, Nanning, China
  • 2College of Life Science and Technology, Guangxi University, Nanning, China
  • 3College of Agriculture, Guangxi University, Nanning, China

Modern sugarcane cultivars are derived from interspecific hybridization between S. officinarum and S. spontaneum with complex genetic backgrounds, and their lack of SSR markers limits the genetic improvement of sugarcane. In this study, We searched for and identified SSR loci within the genomes of 14 Poaceae plants. Notably, a significant positive correlation (r = 0.958) was detected between genome size and the number of SSRs. We identified SSR loci in the whole genome of XTT22, a modern sugarcane cultivar. A total of 1,054,918 SSR loci were identified, with a frequency of 123 loci/Mb and an average of 1 SSR locus per 8.11 kb, with Chr1 having the highest content and frequency of SSR loci. Among different repeat types, the number of mononucleotide repeats (620, 901) and dinucleotide repeats (238, 261) was the largest, accounting for 81.45% of the total number of SSR loci, and the number of SSR decreases with the increase of the number of SSR repeat motifs. Based on the above SSR loci, 910,519 primer pairs were obtained, and 459 SSR markers with polymorphism were screened. The polymorphism rate of SSR markers among different SSR repeat types ranged from 81.97% to 97.90%, and the pentanucleotide repeat type had the highest number of SSR markers. In order to test the universality of the developed SSR markers in sugarcane and its related species, 24 polymorphic SSR markers were randomly selected for verification in 33 sugarcane and its related species and amplified 134 alleles in total. Each pair of primers amplified 1-11 alleles, with an average of 5.58 alleles per pair. This study is the first to systematically develop SSR molecular markers for modern sugarcane cultivars at the genome-wide level, which not only enriches the number of existing SSR markers of modern sugarcane cultivars, but also provides important molecular markers to support the molecular marker-assisted breeding of sugarcane.

1 Introduction

Sugarcane is the most important sugar crop in China. As the world’s third-largest producer of sugar, China derives 90% of its sugar from sugarcane (Hu, 2018). Sugarcane is also a highly promising energy crop and one of the most efficient crops in converting solar energy into usable chemical energy. It accounts for over 40% of the world’s total biomass ethanol production and is a vital component of the biomass ethanol industry. Compared to other energy crops, sugarcane demonstrates outstanding advantages in ethanol yield and production costs, making it widely recognized as one of the most excellent renewable bioenergy crops worldwide (Zhang et al., 2018b).

SSR markers are DNA fragments composed of repetitive nucleotide motifs ranging from 1 to 6 base pairs, and they are present in the genomes of prokaryotes and eukaryotes (Xu et al., 2021b). SSR markers have advantages such as high polymorphism, high specificity, simplicity of operation, relatively low cost, easy detection, and genetic co-dominance. They have been widely used in population differentiation, genetic diversity studies, kinship analysis, linkage analysis, gene mapping, and genetic diversity research, providing convenience for molecular breeding of various crops (Xu et al., 2021b). SSR molecular markers have been used in various plants such as wheat (Triticum aestivum) (Xu et al., 2021a), barley (Hordeum vulgare) (Mohammadi et al., 2020), rice (Oryza sativa) (Li et al., 2023), cotton (Gossypium spp.) (Wang et al., 2023a), and maize (Zea mays) (Belalia et al., 2018) to construct genetic linkage maps, perform variety identification, and analyze genetic diversity. In wheat hybrid populations, SSR markers were used for QTL identification, and six QTLs were identified for leaf rust resistance, which can be directly applied in marker-assisted selection (MAS) for resistance studies (Mohammadi et al., 2020). In peanut, genetic diversity analysis of 101 varieties was conducted using 40 SSR markers, and an evolutionary tree was constructed based on genotyping data, providing a basis for assessing the genetic relationships among varieties and selecting breeding materials (Hong et al., 2021).

Currently, most of the developed SSR markers in sugarcane are EST-SSR markers (Jiang et al., 2023), as well as SSR markers developed based on haploid genomes or closely related species. Previous studies have identified numerous SSRs in primitive sugarcane varieties (Liu et al., 2022). However, these SSRs have not been effectively applied in the molecular breeding of sugarcane cultivars. Therefore, the development of SSR markers based on the whole genome of sugarcane is still in its early stages and is not yet mature. These markers have low polymorphism, low repeatability, and lack richness in the genome. The modern sugarcane cultivar XTT22 has the longest cultivation history and is a key variety in most modern sugarcane cultivars due to its excellent economic traits. Therefore, in this study, SSR markers based on the XTT22 genome were developed with the aim of enriching the existing repertoire of SSR markers in the sugarcane genome. These markers serve as important resources for the construction of sugarcane genetic maps and the screening of molecular markers for marker-assisted breeding.

2 Materials and methods

2.1 Data sources

In this study, we utilized a variety of materials including 14 Poaceae plants, comprising sugarcane varieties, sugarcane related species, and representative Poaceae species. The sugarcane varieties consisted of one modern sugarcane cultivar and two each of S. officinarum and S. spontaneum. The sugarcane related species included Erianthus rufipilus and Saccharum arundinaceum. Representative Poaceae species encompassed barley (Hordeum vulgare), Aegilops tauschii, maize (Zea mays), Sorghum bicolor, millet (Setaria italica), rice (Oryza sativa), and Brachypodium distachyon.

The genome data for S. spontaneum (AP85-441, Np-X), S. officinarum (LA-purple, Badila), Hordeum vulgare, Aegilops tauschii, Zea mays, Sorghum bicolor, Setaria italica, Oryza sativa, Brachypodium distachyon and E. rufipilus (YN2009-3) were obtained from the NCBI database (https://www.ncbi.nlm.nih.gov/). The genomic data for modern sugarcane cultivation varieties (XTT22) can be obtained at sugarcane.gxu.edu.cn/scdb/, and S. arundinaceum (HN92-105) have not been published.

2.2 SSR loci identification and primer design

This study employed the MISA (Microsatellite identification tool) software (http://pgrc.ipk-gatersleben.de/misa) to scan SSR (Simple Sequence Repeat) loci in the whole genome sequences of 14 Poaceae species. The least numbers of SSRs used for identification of mono-, di-, tri-, tetra-, penta-, hexanucleotides and compound type were 10, 7, 6, 5, 4, 4 and 2, respectively, with a maximum spacing of 100 bp between adjacent SSRs. Next, using the identified flanking conserved sequences of the SSR sites from the entire genome of the XTT22, we utilized the Primer3.0 software (http://primer3.sourceforge.net/releases.php) for batch development of SSR primers. The primer design parameters were set as follows: the desired product length ranged from 100 to 300 bp, the primer length ranged from 18 to 23 bp, and the GC-content was set between 40% and 60%. The optimal annealing temperature was set at 60°C. All other parameters were left at their default values.

2.3 Plant materials and DNA extraction

The validation materials for polymorphism of SSR markers in this study include four S. spontaneum (Np-X, SES-208, 2013-8, Hainan 2012-124), three S. officinarum (Badila, LA-purple, NC20), and three modern sugarcane cultivation materials (XTT22, F134, R570). The validation materials for the universality of SSR markers in this study include nine S. spontaneum, eight S. officinarum, 10 modern sugarcane cultivation materials, three S. robustum, and three E. rufipilus.

Total DNA was extracted from fresh leaves using the plant genomic DNA kit (TIANGEN, China). DNA concentrations were checked using BioPhotometer D30 (Eppendorf, Germany) and the quality of DNA was measured by 1.2% agarose gel electrophoresis.

2.4 Polyacrylamide gel electrophoresis (PAGE) run protocol and

Electrophoresis gel preparation: Use 50 mL of non - denaturing polyacrylamide solution for each pair of glass plates. The formula for the 9% solution includes 15 mL of 30% polyacrylamide (29:1), 35 mL of 1×TBE, 500 μL of 10% APS, and 50 μL of TEMED. It takes about 30 minutes to solidify at room temperature. Mix the solutions in a beaker, and pour the gel after the bubbles disappear. After the gel has solidified for about 30 minutes, place the glass plates vertically in the electrophoresis groove, add TBE buffer, and load 2 μL of PCR products with a DNA concentration of 5 ng/μL into the wells in sequence. Load 2 μL of 50 bp Ladder (for PAGE) in the middle well for band comparison. Set the voltage of the electrophoresis apparatus to 150 V and turn off the power after 180 minutes. Prepare the staining solution by adding 10 μL of Gelred to 1000 mL of clean water. After electrophoresis, remove the gel plate, separate the glass plates, make the gel adhere to one plate, immerse it in the staining solution for 10 - 15 minutes, and then observe and take photos using a gel imaging system.

The results of non-denaturing polyacrylamide gel electrophoresis were analyzed. For materials lacking bands at the same amplification site with the same primer pair, a value of 0 was recorded, while materials with bands were recorded as 1. These electrophoresis outcomes were processed to establish a 0 - 1 similarity matrix. Subsequently, the Unweighted Pair - Group Method with Arithmetic Mean (UPGMA) was employed to construct a phylogenetic tree within the NTSYS software.

3 Results

3.1 Comparison of SSR loci among 14 Poaceae species

In this study, we conducted a search and identification of SSR loci in 14 Poaceae species’ genomes. The genome sizes of these Poaceae species ranged from 271 Mb (Brachypodium distachyon) to 8,560 Mb (XTT22, a modern sugarcane cultivar) (Table 1). We observed a significant positive correlation between genome size and the number of SSRs (r=0.958). Additionally, there was a significant negative correlation between genome size and the frequency of SSRs (r=-0.461), consistent with previous research findings in flowering plants (Vinatzer et al., 2013).

Table 1
www.frontiersin.org

Table 1. Distribution characteristics of SSR loci in 14 Poaceae species.

Among the 14 Poaceae plants, rice exhibited the highest SSR frequency (384/Mb), significantly surpassing the average. Brachypodium distachyon followed with a frequency of 200/Mb, while S. officinarum, S. spontaneum, and cultivated species demonstrated the lowest SSR frequencies (118-126/Mb).

After analyzing the SSR loci characteristics of various Poaceae species, it was discovered that there is some similarity in the distribution of SSR loci among these plants. It was observed that mononucleotide repeats repeat types are the most common SSR types in most Poaceae species (Table 1), which is in contrast to previous findings on angiosperms where the most abundant SSR types are three and four nucleotide repeat types (Vinatzer et al., 2013). It is possible that the difference observed is due to different search criteria used when identifying SSRs. For instance, this study considered SSRs with 10 and 11 repeats of mononucleotide repeat type, which was not done in previous studies on angiosperms.

3.2 Characteristics of SSR loci in the genome of modern sugarcane cultivars

A total of 1,054,918 SSR loci were identified in the genome of XTT22 (Table 2; Figure 1). Among them, Chr2H (19,000) holds the highest number of SSRs, while Chr7L (6,441) possesses the lowest (Supplementary Table 1). These SSR loci consisted of 3,935 different types of SSR motifs, with a frequency of 123 per Mb, equivalent to an average of 1 SSR locus every 8.11 kb. Mononucleotide repeat types were the most abundant, with 620,901 loci, accounting for 58.86% of the total. Dinucleotide and trinucleotide repeat types followed, with 238,261 and 103,920 loci, respectively, representing 22.59% and 9.85% of the total. The proportions of tetranucleotide, pentanucleotide, and hexanucleotide repeat types were all below 5%, with 27,575, 34,565, and 29,696 loci, respectively, accounting for 2.61%, 3.28%, and 2.82% of the total. The A/T type represents the highest proportion among single - nucleotide repeat types, constituting 84.09% of the total number of single - nucleotide repeat types. Regarding dinucleotide repeat types, there are four in total, with AT/TA being the dominant element, accounting for 57.92% of this repeat type. For tetranucleotide and pentanucleotide repeat elements, there are 60 and 235 types respectively, and for hexanucleotide repeat elements, there are 975 types. The major repeat elements include TGT/ACA, AAAT/GATG, and CGAGC/AAAAG. Among all the SSR loci, the minimum number of repeats was 4, and the maximum number of repeats was 522. SSR loci with repeat numbers ranging from 4 to 20 accounted for 88.97% of the total. The most abundant SSR loci had 10 repeats, with a total of 355,762 loci, representing 35.15% of the total. This is mainly due to the fact that the number of mononucleotides repeat loci with 10 repeats is higher than the total number of SSR loci with repeat types ranging from dinucleotide to hexanucleotide repeats.

Table 2
www.frontiersin.org

Table 2. Repeat type, number and proportion of SSR in the genome of modern sugarcane cultivar XTT22.

Figure 1
www.frontiersin.org

Figure 1. Distribution and frequency of whole genome SSR. (A) Chromosome of XTT22. (B–H) denote the distribution of Mononucleotide (The density ranges from 0 to 134 SSRs per 500kb), Dinucleotide (The density ranges from 0 to 81 SSRs per 500kb), Trinucleotide (The density ranges from 0 to 28 SSRs per 500kb), Tetranucleotide (The density ranges from 0 to 34 SSRs per 500kb), Pentanucleotide (The density ranges from 0 to 15 SSRs per 500kb), Hexanucleotide (The density ranges from 0 to 12 SSRs per 500kb), and compound (The density ranges from 0 to 180 SSRs per 500kb) repeat SSRs, respectively.

One of the fundamental factors for the usability of SSR markers is the level of SSR polymorphism, which can be influenced by the length of the SSR motif (Zhang et al., 2022). Criteria for low, moderate, and high levels of polymorphism are typically defined as SSR motif lengths below 12 bp, between 12-20 bp, and above 20 bp, respectively. In this study, the distribution of SSR motif lengths in the genome of the XTT22 was analyzed (Figure 2). The highest number of SSR motifs was observed in the range of less than 12 bp (10-11 bp), totaling 329,133 motifs and accounting for 43.51% of the total SSRs. This is mainly due to the abundance of mononucleotide repeat types in this length range. The next most abundant category was motifs with lengths between 12-20 bp, with a count of 301,619 motifs, representing 30.65% of the total. Motifs with lengths exceeding 20 bp had the fewest occurrences, with 254,242 motifs, accounting for 25.84% of the total. These results indicate that the distribution of SSR motif lengths in the entire genome of the modern sugarcane cultivar primarily falls within the range of less than 12 bp, indicating a low level of polymorphism.

Figure 2
www.frontiersin.org

Figure 2. Distribution of SSR motif length in the genome of XTT22.

Statistical analysis was conducted on the SSR loci in each chromosome of the XTT22. The results showed that Chr01 had the highest number of SSR loci (165,914). The SSR frequency and average distance between two SSRs were statistically analyzed in each chromosome of the XTT22. The results showed that there were 108.07-122.90 SSR loci per Mb, with the highest SSR frequency observed on Chr01 at 122.90 loci/Mb. The average distance between two SSRs on Chr01 was 8.14 kb. On the other hand, Chr05 had the lowest SSR frequency at 102.81 loci/Mb, with an average distance of 9.73 kb between two SSRs.

3.3 Development of whole genome level SSR markers in modern sugarcane cultivars

Based on the flanking conserved sequences of SSR loci in the XTT22 genome of modern sugarcane cultivars, a batch design of SSR primers was conducted using Primer3.0 software. A total of 910,519 primer pairs were successfully designed, with 882,677 primer pairs distributed widely distributed on 10 chromosomes. The remaining primer pairs that were not located on chromosomes were not included in the statistical analysis. The frequency of designed primers was 107.43 per Mb, with an average of one SSR primer motif every 9.31 kb. Chr01 had the highest number of SSR primers (154,393), while Chr05 had the fewest (35,502). Among the chromosome, chr03 had the highest SSR frequency, with 114.69 primers per Mb and an average distance of 8.72 kb between two SSRs. Chr05 had the lowest SSR frequency, with 94.42 primers per Mb and an average distance of 10.59 kb between two SSRs.

Subsequently, the amplification efficiency of the 882,677 SSR primer pairs in the XTT22 genome was analyzed using e-PCR. A total of 47,165 primer pairs were able to generate a single amplification product, and these primers were referred to as specific SSR candidate markers. Among them, 35,215 (74.66%) SSR candidate markers were of the standard type, while 11,950 (25.34%) were of the compound type (Table 3). Among the specific SSR candidate markers of the standard type, the mononucleotide repeat type accounted for the highest proportion (49.22%), while the four-nucleotide repeat type was the lowest (2.20%). The distribution of specific SSR candidate markers in the XTT22 genome was analyzed, and Chr10 had the highest number of specific SSR candidate markers, with 10,716 primer pairs. This accounted for approximately 50% of all specific SSR candidate markers, indicating that SSR primers had a much better specificity on Chr10 compared to other chromosomes. Chr01 and chr02 followed with 4,469 and 4,293 specific SSR candidate markers, respectively. Chr05 and chr08 had the fewest number of specific SSR candidate markers among all chromosomes, with 373 and 326 specific SSR candidate markers, respectively. This closely correlated with the size of their respective chromosomes (Table 3).

Table 3
www.frontiersin.org

Table 3. Distribution characteristics of genome-wide specific SSR candidate markers in XTT22.

Among all chromosomes, Chr10 had the highest number of specific SSR candidate marker types. The two most abundant types of specific SSR candidate markers on each chromosome were the mononucleotide e repeat type and the compound type. Their combined quantity exceeded half of the specific SSR candidate markers on each chromosome. However, studies have shown that the mononucleotide repeat type is not suitable for marker development (Cavagnaro et al., 2010). On the other hand, the compound type presents significant challenges and limitations in marker development due to its complex motifs and the possibility of different combinations of repeat numbers within the motifs.

3.4 Polymorphism verification of whole genome level SSR markers in modern sugarcane cultivars

SSR primers with known positions in the genome and specific amplification products have higher utility value. The polymorphism of SSRs is closely related to the length of their motifs. In this study, standard-type specific SSR candidate markers with motif lengths greater than 30 bp were selected from the pool of 47,165 candidates for polymorphism validation in 10 sugarcane materials. A total of 538 specific SSR candidate markers met the screening criteria, and PCR amplification was performed on the 10 different sugarcane materials. The results showed that 507 primer pairs (94.24%) produced amplification products, and among them, 459 primer pairs (85.32%) exhibited polymorphism across the 10 materials (Figure 3). These SSR markers had clear and stable bands, making them easy to identify.

Figure 3
www.frontiersin.org

Figure 3. Native-PAGE results of primers 1J(TCAT)10, 1C(TCTAT)20, 2J(CCCTT)7, 2B(TAGGGT)6, 3I(TATAT)14 and 3I(GTTGT)7. 1-10 sugarcane materials were: 1: Np-X; 2: SES208; 3: 2013-8; 4: HN2012-124; 5: Badila; 6: LA purple; 7: NC20; 8: XTT22; 9: F134; 10: R570. M: 50 bp Ladder (for PAGE).

The polymorphism rates of the 459 polymorphic markers on each chromosome of the XTT22 genome varied and ranged from 66.67% to 100%. Among them, the highest number of specific SSR candidate markers were screened on chr10 (209 primer pairs), and 182 pairs (87.08%) exhibited polymorphism across the 10 different sugarcane materials. Chr05 and chr08 had the highest polymorphism rates, both reaching 100%. This may be attributed to the relatively smaller number of SSR primers initially developed for these chromosomes. Chr02 and chr06 had relatively lower polymorphism rates, at 66.67% and 76.32%, respectively. The polymorphism rates of SSR markers on other chromosomes showed little difference and were all above 80%. This result indicates that the polymorphic SSR markers developed in this study also exhibit high polymorphism rates at the chromosome level.

A statistical analysis of the different types of SSR repeats among the 459 markers revealed that the pentanucleotide repeat type had the highest number of SSR markers, with 198 primer pairs validated, of which 171 (86.36%) exhibited polymorphism. The number of tetranucleotide and hexanucleotide repeat types was very similar, with 150 and 138 markers, respectively, but they showed significant differences in polymorphism rates (81.97% and 97.90%, respectively). Among the 459 polymorphic SSR markers, there was a diverse range of motif lengths, with the majority falling within the 30-40 bp range. The shortest motif length observed was 32 bp, while the longest reached 192 bp. The highest number of SSR markers was found in the 30-40 bp motif length range, totaling 197 markers. The next highest number of SSR markers was in the 41-50 bp motif length range, with 69 markers. In contrast, the fewest SSR markers were observed in the 81-90 bp motif length range, comprising only 12 markers. When the motif length ranged from 91 to 100 bp, SSR markers exhibited the highest polymorphic rate (90.70%). Furthermore, the polymorphic rates of SSR markers in different motif length ranges exceeded 80%. These results indicate that SSR markers with motif lengths greater than 30 bp hold significant potential for future applications in genetic diversity analysis and kinship identification of different sugarcane varieties.

In this study, experiments were also conducted using 21 core SSR markers developed by the International Society of Sugarcane Technologists (ISSCT) on the 10 sugarcane materials (Supplementary Table 2). The results showed that 8 primers amplified clear and distinguishable bands, while 7 primers did not yield any bands across the 10 materials. The remaining markers either showed non-polymorphic bands or weak and unreadable bands. The polymorphism rate of these 21 universal markers across the 10 sugarcane materials was 38.10%, significantly lower than the polymorphic SSR markers developed in this study (85.32%).

3.5 Universality analysis of whole genome level SSR markers in modern sugarcane cultivars

S. officinarum, characterized by high sugar content but relatively weak stress resistance, and S. spontaneum, known for its strong stress tolerance but lower sugar content in stalks, are the two main ancestral species of modern cultivated sugarcane varieties. Modern cultivated sugarcane varieties combine the strong stress tolerance of S. spontaneum with the high sugar content of S. officinarum. The polymorphism of SSR markers has been validated in materials of S. officinarum, S. spontaneum, and cultivated sugarcane varieties. To investigate whether these SSR markers are applicable in a wider range of sugarcane species and their related genera, 24 SSR markers were randomly selected for testing their universality in materials of S. spontaneum, S. officinarum, cultivated sugarcane, S. robustum, and other closely related species within the genus Saccharum.

The sequence information, number of amplified alleles (Na), and polymorphic information content (PIC) of the 24 polymorphic SSR markers were statistically analyzed (Supplementary Table 3). These markers amplified 134 different alleles across 33 sugarcane and related species materials, with an average of 5.58 alleles per primer. The PIC value reflects the ability of SSR markers to differentiate populations. If PIC is greater than 0.5, the SSR marker is considered highly polymorphic. Within the range of 0.25-0.5, it is considered moderately polymorphic. If PIC is less than 0.25, it is considered low polymorphic. The developed 24 polymorphic SSR markers in this study exhibited a PIC range of 0.45-0.95, with an average PIC value of 0.75, indicating rich polymorphism in the amplified bands. None of the SSR markers had a PIC value below 0.25, while two markers had PIC values between 0.25 and 0.5. The remaining 22 markers had PIC values exceeding 0.5, indicating highly polymorphic markers. This suggests that these SSR markers developed of this study can play a crucial role in molecular breeding of sugarcane.

The cluster diagram (Figure 4) of the 33 sugarcane materials were calculated using the NTSYSPC2.1 software. The results showed that the genetic similarity coefficients ranged from 0.19 to 0.93, with an average of 0.58, indicating significant genetic variation among these sugarcane and related species materials. S. officinarum (Keong Java) and the E. rufipilus (YN2009-3 exhibited) the lowest genetic similarity coefficient of 0.19, indicating a large genetic distance and distant relationship between these two materials. On the other hand, the two S. officinarum KARA KARA WA and Cana Blanca had the highest genetic similarity coefficient of 0.93, indicating a close genetic distance and highly similar genetic backgrounds between these two materials. The 24 SSR markers successfully differentiated the 33 different sugarcane materials (Figure 4). When the genetic similarity coefficient reached 0.56, these materials could be classified into three main groups: the first group consisted of 9 S. spontaneum, the second group consisted of 21 materials including 8 S. officinarum, 10 cultivated varieties, and 3 S. robustum, and the third group consisted of 3 E. rufipilus materials. This indicates that the SSR markers developed in this study can be used for genetic relationship identification, cultivar identification, and genetic diversity analysis among sugarcane and its related species. Molecular markers can provide insights into the genetic background of sugarcane varieties with unclear pedigrees or unknown origins, enabling more targeted parental selection and guidance for breeding purposes.

Figure 4
www.frontiersin.org

Figure 4. Cluster analysis of thirty Saccharum and three Erianthus materials.

4 Discussion

4.1 Evaluation of SSR density and characterization in whole-genome of modern cultivated sugarcane

Among the 14 Poaceae species analyzed, excluding Hordeum vulgare, the most abundant repeat type was the mononucleotide repeat type, followed by the dinucleotide repeat type. This finding is consistent with the results reported for Diospyros oleifera (Xu et al., 2023), strawberry (Miao et al., 2021), and Bunium persicum (Bansal et al., 2022), but differs with cotton (Zou et al., 2012). S. officinarum, S. spontaneum, and modern cultivated sugarcane showed similar patterns in terms of nucleotide repeat type content, except for the tetranucleotide repeat type, which showed some variations. In most Poaceae species, the most abundant repeat motif in the mononucleotide repeat type was A/T, except for Hordeum vulgare, Aegilops tauschii, and Zea mays, where G/C content was highest. In the dinucleotide repeat type, most Poaceae species had AT/TA as the most abundant motif, which is similar to soybean (Blair et al., 2008), while foxtail millet and Brachypodium distachyon had GG/CC as the most frequent motif. In the trinucleotide repeat type, TGT/ACA was the most common motif S. officinarum, S. spontaneum, and modern cultivated sugarcane, which is different from loquat (Jiang et al., 2021), where AAG/CTT had the highest proportion. Overall, in most of the analyzed Poaceae species, the most common repeat types in the mononucleotide to trinucleotide repeat motifs contained A and T nucleotides, which is consistent with the results observed in eukaryotes. This could be attributed to the fact that during DNA replication, the hydrogen bonds between A and T require less energy compared to those between G and C, making the A/T structure of SSR motifs more prone to occur.

Microsatellite is an important mutation in the genome because of its wide distribution and high frequency, which plays a significant role in genomics research (Tribhuvan et al., 2019). The wide distribution and high frequency of microsatellites in the genome indicate their universality and importance. The high frequency of these repetitive sequences suggests their potential role in genetic variation and evolutionary processes (Zhao et al., 2023). These findings contribute to our understanding of the genomic features of microsatellites in plants and their relevance in genomic research (Xu et al., 2021b). Prior research has determined that SSR significantly contributes to genome plasticity and environmental adaptation (Yuan et al., 2021). Given the high complexity of modern sugarcane cultivars’ genomes, the identification and utilization of SSR loci could potentially establish a foundation for unraveling the genomic evolution of these modern sugarcane cultivars.

4.2 Potential application of SSR markers of molecular breeding in sugarcane

SSR markers are often used in genetic research and molecular marker-assisted (MAS) breeding. Among the 538 pairs of specific SSR primers developed in this study, a total of 507 primer pairs showed amplification products, resulting in an amplification efficiency of 94.24%. This efficiency was higher than the SSR primer amplification efficiency reported by Wang et al. (2020) based on the haploid cultivar R570 (90.24%) and Gordeiro et al. (2001) based on the sugarcane EST library (60.00%). Additionally, out of the 459 primer pairs, 459 showed polymorphisms in 10 sugarcane materials, with a polymorphic rate of 85.32% (459/538). This polymorphic rate was significantly higher than the polymorphic rate of 32.62% (349/1070) reported by Xiao et al. (2020) for EST-SSR markers in 12 sugarcane materials. The higher polymorphic rate observed in the SSR markers developed from the sugarcane whole-genome sequence in this study compared to the EST-SSR markers developed from the sugarcane EST library by previous researchers may be attributed to the fact that EST-SSR markers mainly represent polymorphisms in gene coding regions (Paliwal et al., 2022), whereas the whole-genome SSR markers are predominantly distributed in non-coding regions, where the mutation rate may be higher than in coding regions. Further, 21 core SSR markers developed by the International Society of Sugar Cane Technologists (ISSCT) were utilized. These markers belong to the first molecular identification database for sugarcane based on SSR markers in the world (Pan, 2010). They were widely used in previous studies when sugarcane genome data were not available. However, with the continuous development of sugarcane genome sequencing, the limited coverage and low representation of these markers in the sugarcane genome have become apparent. Therefore, there is a need for SSR markers with broader coverage and higher polymorphism to serve molecular marker-assisted breeding and genetic mapping in sugarcane. The SSR candidate markers developed in this study are distributed across the chromosomes of modern sugarcane cultivars and exhibit high coverage. They can provide directions for future investigations into the genetic relationships of sugarcane hybrids at the chromosomal level.

S. officinarum serve as the source of high sucrose genes in modern cultivated sugarcane, while S. spontaneum provides resistance traits, making it the most valuable wild species in sugarcane breeding (Jackson, 2005). Modern cultivated sugarcane combines the strong stress resistance of S. spontaneum and the high sucrose content of S. officinarum (Zhang et al., 2018a). Among the selected modern sugarcane cultivars, POJ2878, F134, Chuanzhe 2, and XTT22 represent three stages of parental replacement in sugarcane breeding in China and are considered backbone parents (Zhang et al., 2009). S. robustum, with its advantages of insect resistance, lodging resistance, and drought tolerance, is an important wild species in the Saccharum genus and highly valued by sugarcane breeders (Yu et al., 2023). E. rufipilus, a closely related species of sugarcane, has excellent characteristics of drought tolerance, cold tolerance, and disease resistance (Wang et al., 2010) and has been used in sugarcane breeding through interspecific hybridization. The three selected Miscanthus materials are of great value in sugarcane breeding research, with Yunnan 2009-3, a diploid E. rufipilus, having available genomic data (Wang et al., 2023b), which will play a significant role in the analysis of sugarcane origin and evolution in the future. The SSR markers selected in this study have important research value in the experimental validation materials. Therefore, the identified universal SSR markers can provide a certain number of markers for breeding research in these materials and serve as a reference for selecting more optimized parents in future sugarcane breeding.

5 Conclusion

SSR markers have been widely used in various aspects of sugarcane research, including genetic diversity analysis, variety identification, kinship analysis, and QTL mapping, due to their co-dominance, genome-wide coverage, high polymorphism, and good experimental reproducibility. In this study, SSR loci were identified in the genomes of 14 Poaceae species for the first time. It was found that there was a positive correlation between genome size and the number of SSRs, and a negative correlation between genome size and the frequency of SSRs. Mononucleotide repeats were common in most Poaceae plants. A total of 1,054,918 SSR loci were identified in the genome of the modern sugarcane cultivar XTT22. A total of 910,519 pairs of primers were developed, and after screening, 459 polymorphic SSR markers were obtained. The polymorphism rate of these markers was as high as 85.32%, which was significantly higher than that of the universal markers (38.10%) developed by the International Society of Sugarcane Technologists. The 24 SSR markers showed rich polymorphism in 33 sugarcane and related species materials, and could effectively distinguish these materials. These markers can be used for the identification of genetic relationships, cultivar discrimination, and genetic diversity analysis of sugarcane and related species, providing crucial support for molecular breeding of sugarcane.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author contributions

YX: Funding acquisition, Writing – original draft, Writing – review & editing. SYC: Data curation, Formal Analysis, Software, Writing – original draft. SQC: Data curation, Formal Analysis, Software, Writing – original draft. XW: Formal Analysis, Writing – original draft. HS: Methodology, Writing – original draft. QZ: Investigation, Validation, Writing – review & editing. JZ: Funding acquisition, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the National Natural Science Foundation of China (32301881 and 32272196) and Guangxi Natural Science Foundation (2024GXNSFBA010353).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1573967/full#supplementary-material

Abbreviations

SSR, Simple Sequence Repeat; Chr, Chromosome; ISSCT, International Society of Sugarcane Technologists; Na, Amplified alleles; PIC, Polymorphic Information Content.

References

Bansal, S., Kumar, A., Lone, A. A., Khan, M. H., Malhotra, E. V., Singh, R. (2022). Development of novel genome−wide simple sequence repeats (SSR) markers in Bunium persicum. Ind. Crops Products 178, 114625. doi: 10.1016/j.indcrop.2022.114625

Crossref Full Text | Google Scholar

Belalia, N., Lupini, A., Djemel, A., Morsli, A., Mauceri, A., Lotti, C., et al. (2018). Analysis of genetic diversity and population structure in Saharan maize (Zea mays L.) populations using phenotypic traits and SSR markers. Genet. Resour. Crop Evol. 66, 243–257. doi: 10.1007/s10722-018-0709-3

Crossref Full Text | Google Scholar

Blair, M. W., Buendía, H. F., Giraldo, M. C., Métais, I., Peltier, D. (2008). Characterization of AT-rich microsatellites in common bean (Phaseolus vulgaris L.). Theor. Appl. Genet. 118, 91–103. doi: 10.1007/s00122-008-0879-z

PubMed Abstract | Crossref Full Text | Google Scholar

Cavagnaro, P. F., Senalikd, A., Yang, L. M., Simon, P. W., Harkins, T. T., Kodira, C. D., et al. (2010). Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.). BMC Genomics 11, 1–18. doi: 10.1186/1471-2164-11-569

PubMed Abstract | Crossref Full Text | Google Scholar

Gordeiro, G. M., Casu, R., Mcintyre, C. L., Manners, J. M., Henry, R. J. (2001). Microsatellite markers from sugarcane (Saccharum spp.) ESTs cross transferable to erianthus and sorghum. Plant Sci. 160, 1115–1123. doi: 10.1016/S0168-9452(01)00365-X

PubMed Abstract | Crossref Full Text | Google Scholar

Hong, Y., Pandey, M. K., Lu, Q., Liu, H., Gangurde, S. S., Li, S., et al. (2021). Genetic diversity and distinctness based on morphological and SSR markers in peanut. Agron. J. 113, 4648–4660. doi: 10.1002/agj2.v113.6

Crossref Full Text | Google Scholar

Hu, Z. (2018). Analysis on the production situation and development trend of sugarcane production in China. Sugarcane Canesugar 4, 58–66. doi: 10.3969/j.issn.1005-9695.2018.04.012

Crossref Full Text | Google Scholar

Jackson, P. A. (2005). Breeding for improved sugar content in sugarcane. Field Crops Res. 92, 277–290. doi: 10.1016/j.fcr.2005.01.024

Crossref Full Text | Google Scholar

Jiang, H., Waseem, M., Wang, Y., Basharat, S., Zhang, X., Li, Y., et al. (2023). Development of simple sequence repeat markers for sugarcane from data mining of expressed sequence tags. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1199210

PubMed Abstract | Crossref Full Text | Google Scholar

Jiang, S., Zhang, X., An, H., Xu, F., Zhang, J. (2021). Development and analysis of polymorphism of SSR markers in the whole genome of loquat. Acta Hortic. Sin. 48, 1013–1022. doi: 10.16420/j.issn.0513-353x.2020-0497

Crossref Full Text | Google Scholar

Li, R., Huang, Y., Yang, X., Su, M., Xiong, H., Dai, Y., et al. (2023). Genetic diversity and relationship of shanlan upland rice were revealed based on 214 upland rice SSR markers. Plants 12, 2876. doi: 10.3390/plants12152876

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, L., Wang, H.-B., Li, Y.-H., Chen, S.-Q., Wu, M.-X., Dou, M.-J., et al. (2022). Genome-wide development of interspecific microsatellite markers for Saccharum officinarum and Saccharum spontaneum. J. Integr. Agric. 21, 3230–3244. doi: 10.1016/j.jia.2022.08.129

Crossref Full Text | Google Scholar

Miao, L. X., Yang, X. F., Zhang, Y. C., Shen, Y. Y., Li, X. B., Jiang, G. H. (2021). Development of genome-wide SSR marker and application in strawberry. Mol. Plant Breed. 19, 1210–1222. doi: 10.13271/j.mpb.019.001210

Crossref Full Text | Google Scholar

Mohammadi, S. A., Abdollahi Sisi, N., Sadeghzadeh, B. (2020). The influence of breeding history, origin and growth type on population structure of barley as revealed by SSR markers. Sci. Rep. 10. doi: 10.1038/s41598-020-75339-4

PubMed Abstract | Crossref Full Text | Google Scholar

Paliwal, R., Singh, R., Choudhury, D. R., Tiwari, G., Kumar, A., Bhat, K. C., et al. (2022). Molecular Characterization of Tinospora cordifolia (Willd.) Miers Using Novel g-SSR Markers and Their Comparison with EST-SSR and SCoT Markers for Genetic Diversity Study. Genes 13, 2042. doi: 10.3390/genes13112042

PubMed Abstract | Crossref Full Text | Google Scholar

Pan, Y.-B. (2010). Databasing molecular identities of sugarcane (Saccharum spp.) clones constructed with microsatellite (SSR) DNA markers. Am. J. Plant Sci. 01, 87–94. doi: 10.4236/ajps.2010.12011

Crossref Full Text | Google Scholar

Shi, J., Huang, S., Fu, D., Yu, J., Wang, X., Hua, W., et al (2013). Evolutionary dynamics of microsatellite distribution in plants: insight from the comparison of sequenced brassica, arabidopsis and other angiosperm species. PloS One 8, e59988. doi: 10.1371/journal.pone.0059988

PubMed Abstract | Crossref Full Text | Google Scholar

Tribhuvan, K. U., S. V, A. M., Sharma, P., Das, A., Kumar, K., Tyagi, A., et al. (2019). Identification of genomic SSRs in cluster bean (Cyamopsis tetragonoloba) and demonstration of their utility in genetic diversity analysis. Ind. Crops Products 133, 221–231. doi: 10.1016/j.indcrop.2019.03.028

Crossref Full Text | Google Scholar

Wang, H.-B., Qi, S.-T., Chen, S.-Q., Guo, J.-L., Que, Y.-X. (2020). Development and application of SSR loci in monoploid reference genome of sugarcane cultivar. Acta Agronomica Sin. 46, 631–642. doi: 10.3724/SP.J.1006.2020.94135

Crossref Full Text | Google Scholar

Wang, T., Wang, B., Hua, X., Tang, H., Zhang, Z., Gao, R., et al. (2023b). A complete gap-free diploid genome in Saccharum complex and the genomic footprints of evolution in the highly polyploid Saccharum genus. Nat. Plants 9, 554–571. doi: 10.1038/s41477-023-01378-0

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, X. H., Yang, Q. H., Li, F. S., He, L. L., He, S. C. (2010). Characterization of the Chromosomal Transmission of Intergeneric Hybrids of Saccharum spp. and Erianthus fulvus by Genomic in situ Hybridization. Crop Sci. 50, 1642–1648. doi: 10.2135/cropsci2010.01.0004

Crossref Full Text | Google Scholar

Wang, L., Zheng, Y., Zhang, H., Wang, D., Wang, M., Wang, H., et al. (2023a). Genetic characterization of cotton varieties and genetic threshold value determination for similar variety selection in cotton DUS testing. Genet. Resour. Crop Evol. 70, 2463–2477. doi: 10.1007/s10722-023-01575-y

Crossref Full Text | Google Scholar

Xiao, N., Wang, H., Yao, W., Zhang, M., Ming, R., Zhang, J. (2020). Development and evaluation of SSR markers based on large scale full-length transcriptome sequencing in sugarcane. Trop. Plant Biol. 13, 343–352. doi: 10.1007/s12042-020-09260-5

Crossref Full Text | Google Scholar

Xu, X., Duan, Z., Su, J., Li, X., Wu, J., Yao, Z. (2021a). QTL mapping of adult-plant resistance to leaf rust based on SSR markers and SNP sequencing of Chinese wheat landrace Xu’ai (Triticum aestivum L.). Genet. Resour. Crop Evol. 68, 1359–1373. doi: 10.1007/s10722-020-01067-3

Crossref Full Text | Google Scholar

Xu, Y., Li, D., Afzal, M. Z., Yao, J., Xu, J., Lin, L., et al. (2021b). Genome-wide analysis and polymorphism evaluation of microsatellites involved in photoperiodic flowering-time genes in kenaf (Hibiscus cannabinus L.). J. Natural Fibers 19, 8332–8344. doi: 10.1080/15440478.2021.1964126

Crossref Full Text | Google Scholar

Xu, Y., Liu, C., Yang, X., Wu, K., Gong, B. (2023). Genome-wide microsatellite characterization and marker development in Diospyros oleifera. Ind. Crops Products 203, 117182. doi: 10.1016/j.indcrop.2023.117182

Crossref Full Text | Google Scholar

Yu, X., Tian, Y., Jing, Y., An, R., Lang, R., Dong, L., et al. (2023). Chlorophyll fluorescence characteristics on leaves of germplasm of Saccharum robustum 57NG208 Fi generation. Southwest China J. Agric. Sci. 36, 515–521. doi: 10.16213/j.cnki.scjas.2023.3.009

Crossref Full Text | Google Scholar

Yuan, J., Zhang, X., Wang, M., Sun, Y., Liu, C., Li, S., et al. (2021). Simple sequence repeats drive genome plasticity and promote adaptive evolution in penaeid shrimp. Commun. Biol. 4, 186. doi: 10.1038/s42003-021-01716-y

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, Q., Qi, Y., Zhang, C., Chen, Y., Deng, H. (2009). Pedigree analysis of genetic relationship among core parents of sugarcane in Mainland China. Guangdong Agric. Sci. 10, 44–48. doi: 10.16768/j.issn.1004-874x.2009.10.017

Crossref Full Text | Google Scholar

Zhang, X., Yu, Z., Jin, X.-H., Yu, X.-X., Li, J.-W., Li, J.-Q. (2022). Development and characterization analysis of potato SSR primers and the amplification research in colored potato materials. Acta Agronomica Sin. 48, 920–929. doi: 10.3724/SP.J.1006.2022.14065

Crossref Full Text | Google Scholar

Zhang, J., Zhang, Q., Li, L., Tang, H., Zhang, Q., Chen, Y., et al. (2018a). Recent polyploidization events in three Saccharum founding species. Plant Biotechnol. J. 17, 264–274. doi: 10.1111/pbi.2019.17.issue-1

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, J., Zhang, X., Tang, H., Zhang, Q., Hua, X., Ma, X., et al. (2018b). Allele-defined genome of the autopolyploid sugarcane Saccharum spontaneum L. Nat. Genet. 50, 1565–1573. doi: 10.1038/s41588-018-0237-2

PubMed Abstract | Crossref Full Text | Google Scholar

Zhao, M., Shu, G., Hu, Y., Cao, G., Wang, Y. (2023). Pattern and variation in simple sequence repeat (SSR) at different genomic regions and its implications to maize evolution and breeding. BMC Genomics 24, 136. doi: 10.1186/s12864-023-09156-0

PubMed Abstract | Crossref Full Text | Google Scholar

Zou, C. S., Lu, C. R., Zhang, Y. P., Song, G. L. (2012). Distribution and characterization of simple sequence repeats in Gossypium raimondii genome. Bioinformation 8, 801–806. doi: 10.6026/97320630008801

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: modern sugarcane cultivars, genome-wide, SSR markers, polymorphism, molecular breeding

Citation: Xu Y, Chen S, Chen S, Wei X, Shang H, Zhang Q and Zhang J (2025) Genome-wide development of SSR molecular markers for modern sugarcane cultivars. Front. Plant Sci. 16:1573967. doi: 10.3389/fpls.2025.1573967

Received: 10 February 2025; Accepted: 18 March 2025;
Published: 01 April 2025.

Edited by:

Nigel G. Halford, Rothamsted Research, United Kingdom

Reviewed by:

Lakshmi Narayana Vemireddy, Acharya N. G. Ranga Agricultural University, India
Siddanna Savadi, Directorate of Cashew Research (ICAR), India

Copyright © 2025 Xu, Chen, Chen, Wei, Shang, Zhang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jisen Zhang, emppc2VuQDEyNi5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

95% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more