Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 15 March 2022
Sec. Plant Breeding
This article is part of the Research Topic Marker-Assisted Selection (MAS) in Crop Plants View all 45 articles

Genome-Wide Development of Polymorphic Microsatellite Markers and Association Analysis of Major Agronomic Traits in Core Germplasm Resources of Tartary Buckwheat

\r\nSiyu Hou,&#x;Siyu Hou1,2†Xuemei Ren,&#x;Xuemei Ren1,2†Yang Yang,Yang Yang1,2Donghang WangDonghang Wang1Wei DuWei Du1Xinfang WangXinfang Wang1Hongying Li,Hongying Li1,2Yuanhuai Han,Yuanhuai Han1,2Longlong Liu*Longlong Liu3*Zhaoxia Sun,*Zhaoxia Sun1,2*
  • 1College of Agriculture, Institute of Agricultural Bioengineering, Shanxi Agricultural University, Taigu, China
  • 2Shanxi Key Laboratory of Minor Crop Germplasm Innovation and Molecular Breeding, Taiyuan, China
  • 3Center for Agricultural Genetic Resources Research, Shanxi Agricultural University, Taiyuan, China

Tartary buckwheat (TB; Fagopyrum tataricum Gaertn.) is an important multigrain crop and medicinal plant, but functional genomics and molecular breeding research in this species have been lacking for quite some time. Here, genome-wide screening was performed to develop simple sequence repeat (SSR) markers associated with six major agronomic traits and the rutin contents of 97 core germplasm resources. A total of 40,901 SSR loci were identified; they were uniformly distributed throughout the TB genome, with a mean distance of 11 kb between loci. Based on these loci, 8,089 pairs of SSR primers were designed, and 101 primer pairs for polymorphic SSR loci were used to genotype the 97 core germplasm resources. The polymorphic SSR loci showed high genetic variation in these core germplasm resources, with an average polymorphic information content (PIC) value of 0.48. In addition, multiple SSR markers, such as SXAU8002 [100-grain weight (HGW)] and SXAU8006 [stem diameter (SD)], were found to be associated with agronomic traits in the two environments. Finally, based on gene functional annotation and homology analysis, a candidate gene, FtPinG0007685500, that may affect the node number and SD of the main stem by participating in lignin synthesis was identified. This study reports the mining of genome-wide SSR loci and the development of markers in TB, which can be used for molecular characterization of the germplasm in its gene pool. In addition, the detected markers and candidate genes could be used for marker-assisted breeding and functional gene cloning in TB.

Introduction

Tartary buckwheat (TB; Fagopyrum tataricum Gaertn.) is a species of Fagopyrum that is recognized as an important edible crop and medicinal plant due to its high abundance of flavonoids with antioxidant activity (Zhang et al., 2012b; Hou et al., 2021). TB is cultivated in diverse ecological zones worldwide, especially as a minor crop, and it is a popular food in southwestern and northern China (Tang et al., 2016). TB has higher nutritional and economic value than common buckwheat, including higher levels of vitamin B, phenolic compounds, and other antioxidants, such as rutin (Uddin et al., 2013; Zhou et al., 2015). Rutin (quercetin-3-O-rutinoside) is one of many well-known flavonoids with high biological activity. This compound exerts antioxidant and anti-inflammatory effects by inhibiting blood platelet aggregation, increasing capillary strength, and reducing cholesterol levels in the blood (Luthar et al., 2020). The recommended daily dose of rutin in the human diet is approximately 50 mg/day (Brunori et al., 2010). As TB is a typical functional food resource, it is necessary to analyze the genetic basis of its important agronomic and quality traits, such as plant height (PH), yield, and rutin content.

Although TB has great nutritional and medicinal values, molecular biology research on TB has long lagged behind that on other crops due to its relatively small planting area worldwide and the comparatively late completion of genome sequencing. Recently, the publication of the TB reference genome enabled the rapid identification and breeding application of important functional genes in TB (Zhang et al., 2017). These genome data also facilitate the construction of fine genetic maps based on single nucleotide polymorphisms (SNPs), simple sequence repeats (SSRs), and sequencing-based markers, and make it possible to identify agronomically important genes by map-based cloning. In recent years, many different types of molecular markers, including amplified fragment-length polymorphisms (AFLPs) (Hou et al., 2009), random amplified polymorphic DNA (RAPD) (Sharma and Jana, 2002; Zhang et al., 2012a), intersimple sequence repeats (ISSRs) (Garima et al., 2013), and SNPs (Sohn et al., 2021), have been developed and used for genetic diversity analysis in TB. Although genetic linkage maps of several different segregating populations have been established based on SSRs and other markers, the marker density of these maps was extremely sparse, which limited the detection of QTLs for important agronomic traits of TB (Du et al., 2013; Li et al., 2017). Recently, a high-density TB genetic map was constructed by using the genotyping results from restriction site-associated DNA (RAD) sequencing, and several QTLs for 1,000-grain weight and hull type were found (Shi et al., 2021). Although traditional linkage mapping based on segregating populations has been effective for mining important functional genes of TB, there have been few reports on the use of association analysis in natural populations to explore the genetic basis of important agronomic traits in TB. Hou et al. developed a total of 23 polymorphic SSR primer pairs with 49.71 effective alleles from genome survey data and revealed the genetic relationships among 64 accessions strongly associated with their geographic regions by UPMGA clustering analysis (Hou et al., 2016). Furthermore, genomic selection breeding using genome resequencing methods for developing new SNP markers would be an effective approach for buckwheat functional gene cloning and yield improvement in the future.

China is the point of origin of buckwheat and has the most abundant germplasm resources for this crop (Tsuji and Ohnishi, 2000). These abundant resources provide a valuable gene pool for the identification and utilization of buckwheat traits. In this study, six agronomic traits and the rutin contents of 97 core TB germplasm resources were evaluated over 2 years; then, the SSR loci in the genome of TB were explored in depth. Finally, the TB accessions were genotyped for 101 representative markers chosen from 40,901 SSR loci, and association studies were conducted for the abovementioned traits. This study provides more complete and accurate information for further gene cloning and marker-assisted selection (MAS) in buckwheat breeding.

Materials and Methods

Tartary Buckwheat Germplasm

A total of 97 buckwheat accessions, including 78 domestic varieties and 19 foreign varieties from the United States, Nepal, Bhutan, and Russia, were used in this study (Supplementary Table 1). All materials were provided by the Institute of Agricultural Bioengineering, Shanxi Agricultural University, and grown during the cropping season (March to September of 2018 and 2019) on the experimental farm of the College of Agronomy, Shanxi Agricultural University, Taigu, Shanxi, China (37°12′N, 112°28′E). Field trials were conducted in a randomized complete block design with three replicates. Each variety was planted in 3 rows, and the plant spacing was 30 cm, with a row spacing of 50 cm. The field experiment followed standard local agronomic management practices for buckwheat.

Agronomic Trait Measurement and Rutin Content Determination

Ten plants with consistent growth were randomly selected from each variety to investigate agronomic traits at the seed filling stage (nearly 15 days after flowering). The investigated traits included plant height (PH; cm), the number of main stem nodes (MNs), and stem diameter (SD; measured with a vernier caliper, mm). At harvest, the whole plants were carefully removed from the field to measure the root length (RL). In addition, ten similarly sized plants were randomly selected, and the branches at both ends of the main stem were separated. The number of branches was recorded as the number of lateral branches (LBs). The 100-grain weight (HGW) of each variety was measured with 3 replicates in total. Plants of a similar size were selected to reduce the error in the collection of materials and the measurement of related phenotypic traits. The best linear unbiased estimator (BLUE) values of all traits over 2 years were calculated by using the R package lme4.1

High performance liquid chromatography (HPLC) was used to determine the rutin contents of TB seeds. The detailed steps of this procedure were as follows: TB seeds were placed into a clean mortar and ground into a powder, which was then transferred to a 2 ml centrifuge tube, and 0.15 g of powder was accurately weighed; then, 1.5 ml of methanol (chromatographic grade) was added to the centrifuge tube, which was placed in an ultrasonic water bath for 30 min (50°C). Thereafter, the tube was centrifuged for 10 min at 10,000 × g at 4°C, and 20 μl of the supernatant was diluted with 1 ml of methanol. Finally, 100 μl of the extract was placed in a liquid chromatography vial protected from light. The analysis conditions were as follows: the column temperature was maintained at 30°C, the mobile phase was methanol:water at 46:54, the injection volume was 5 μl, the detection wavelength was 257 nm, and the standard sample was rutin (Sigma, purity ≥ 95%). An Agilent Series 1100 liquid chromatograph (United States) and C18 column (150 mm × 4.6 mm, 5 μm; Agilent, United States) were selected.

The standard curve was drawn according to the method of Sun et al. (2020). The linear regression equation was obtained according to the standard curve, and the contents of rutin in different samples were calculated according to the following equation (Supplementary Figure 1):

Y=11.302X-84.113

The univariate linear regression equation indicated a significant relative coefficient (R2 = 0.9877) between the argument X (peak area of rutin compound by HPLC) and dependent variable Y (rutin content).

Statistical Analysis of Phenotypic Traits

Descriptive statistical analysis was performed using IBM SPSS Statistics 24,2 and the genotype and environment were treated as fixed and random effects, respectively. The correlation coefficients among all agronomic traits were calculated using the R package Hmisc,3 and the factor extraction and analysis of agronomic traits were performed using the R package Psych.4 In addition, clustering analysis of the natural TB population was performed using PowerMarker 3.25.5

Development of Genome-Wide Simple Sequence Repeat Markers

The reference genome information for ‘‘Pinku1’’ was downloaded from the MBKBASE website,6 and SSR locus information was collected by MISA7 (Beier et al., 2017). The filtering conditions were set as follows: 10 repeats for dinucleotides, 7 repeats for trinucleotides, and 5 repeats for tetranucleotides, pentanucleotides, and hexanucleotides. According to the location information for the obtained SSR loci, primer sequences were designed in batches, and a 200 kb interval between two primers was chosen for further analysis. All primers were synthesized by Thermo Fisher Scientific (Supplementary Table 2).

Simple Sequence Repeat Marker Detection and Association Analysis

For each accession, the healthy leaves from three plants were pooled and chosen for DNA extraction via the modified CTAB method (following the manufacturer’s instructions, SL2071-Coolaber, Beijing, China). The DNA from 97 TB accessions was used as a template for PCR amplification, and the primers are listed in Supplementary Table 3. PCR products were detected by 8% polyamide gel electrophoresis with silver staining. For each SSR primer pair, different combinations of bands produced by different genotypes were counted individually, e.g., 1:1, 2:2, and 3:3, so different markers produced different numbers of alleles (Supplementary Table 4). The polymorphic information content (PIC), gene diversity, and other genetic polymorphic information of the selected SSR markers were calculated using PowerMarker 3.25. In addition, genetic distances were calculated according to Nei, cluster analysis was performed via the neighbor-joining method, and the genetic relationship coefficients were calculated by using Spatial Pattern Analysis of Genetic Diversity (SPAGeDi)8 (Hardy, 2016). The population structure was analyzed using Structure 2.3.4,9 and the optimal K was chosen with Structure Harvester,10 which used the Q matrix for the association analysis. Finally, an association study of the phenotypic data and genotypic data was carried out using Tassel 5.0,11 and the Q matrix was used as a covariate for adjustment (Bradbury et al., 2007).

Results

Agronomic Characteristics and Rutin Content Evaluation

All 7 traits examined in the two environments (2018 and 2019) and BLUE values showed approximately normal distributions with low skewness (Table 1). Among the traits, HGW, LB, SD, PH, rutin, and RL showed relatively small skewness values, while MN presented a large skewness value and showed a weakly skewed distribution. The variation coefficients of the 7 traits ranged from 10.30 to 36.90%, with the minimum HGW being observed in 2018 and the maximum SD in 2019. LB, RL, and SD showed relatively large variation coefficients, indicating that the materials selected for this study were broadly representative of these traits. In general, all traits showed significant differences among genotypes and environments, and they were affected by genotype × environment interactions.

TABLE 1
www.frontiersin.org

Table 1. Descriptive statistics of the main agronomic traits and rutin contents in the two environments and best linear unbiased estimator (BLUE).

Correlation Coefficients of Agronomic Traits and Rutin Contents

The basic correlations between different agronomic traits were calculated using Pearson’s approach in this study. SD was significantly and positively correlated with LB and PH in 2 years, with correlation coefficients of 0.37–0.60 and 0.54–0.63, respectively (Supplementary Table 5). SD and HGW showed a significant positive correlation, with a correlation coefficient of 0.36 in 2018. RL and SD and MN and SD showed significant positive correlations (0.49 and 0.69, respectively) in 2019. In addition, there was a significant negative correlation between rutin content and PH in 2019. PH was influenced by many other traits, such as SD, LB, and MN, in the two environments. In addition, the BLUE values of RL, LB, MN, SD, and PH showed significant positive correlations, as follows: RL-LB with a coefficient of 0.26, LB-MN with a coefficient of 0.26, LB-SD with a coefficient of 0.44, PH-MN with a coefficient of 0.28, PH-SD with a coefficient of 0.49, and MN-SD with a coefficient of 0.48 (Table 2). Overall, these agronomic traits showed relatively stable correlations in the two environments.

TABLE 2
www.frontiersin.org

Table 2. Correlations between the BLUE values of six main agronomic traits and rutin content.

Simple Sequence Repeat Markers and Genetic Structural Analysis of Tartary Buckwheat

A total of 40,901 SSR loci were uniformly distributed across the whole TB genome, with a mean distance of 11 kb between loci. There were 26,156 dinucleotide repeat SSRs, accounting for 63.95% of the total SSR loci, and the remaining SSRs were trinucleotides (28.60%), tetranucleotides (5.52%), pentanucleotides (1.36%), and hexanucleotides (0.57%) (Supplementary Table 6). The average distribution density of SSR loci on the 8 chromosomes (Chrs) ranged from 87.69 to 92.27 Mb, and the density on Chrs 4 and 8 was relatively high at 91.32 and 92.27 Mb, respectively.

Among the different types of SSR markers, the identified motifs for the dinucleotide repeats included four types, namely, AT/TA, AG/CT, AC/GT, and CG/GC (Figure 1). Among these types, the greatest number (24,705) of motifs was AT/TA, accounting for 94.45% of all dinucleotide repeats, while AG/CT and AC/GT accounted for 3.37% (881) and 2.18% (569), respectively. Only one CG/GC motif was found. There were 6 types of trinucleotide repeat units, among which AAT/ATT had the maximum number of units at 4,752, accounting for 40.62% of the total. The number of AGG/CCT repeats was the least at 366, accounting for 3.13% of all SSRs. There were 8 types of tetranucleotide repeat units, among which the number of AAAT/ATTT repeats was the highest at 1,129, accounting for 49.98%. The numbers of AAAG/CTTT, ACAT/ATGT, AAGG/CCTT, AGAT/ATCT, ACGC/CGTG, AATT/TTAA, and AAAC/GTTT repeats were 360, 237, 101, 98, 92, 90, and 40, accounting for 15.94, 10.49, 4.47, 4.34, 4.074, 3.98, and 1.77%, respectively. The number of AAGAG/CTCTT pentanucleotides was the highest at 190, accounting for 34.05%, followed by AAAAT/ATTTT (94, 16.85%); AAATT/AATTT and AAGAT/ATCTT pentanucleotides were also found. AAACCG/CGGTTT hexanucleotides accounted for 12.66%. Overall, the major repeat motifs were AT/TA (60.4%), AAT/ATT (11.62%), AAG/CTT (7.59%), ATC/ATG (3.11%), AAAT/ATTT (2.76%), AG/CT (2.15%), AAC/GTT (1.83%), AGC/CTG (1.43%), AC/GT (1.39%), and ACC/GGT (1.24%) (Supplementary Table 7).

FIGURE 1
www.frontiersin.org

Figure 1. Proportions of different types of simple sequence repeat (SSR) units in the Tartary buckwheat (TB) genome. Each pie chart represents the SSR loci with different repeat units in the genome.

A total of 8,089 pairs of SSR primers were designed after extracting the SSR locus sequences using the BMKCloud platform. To ensure maximum coverage of the TB genome, 1,119 of 8,089 SSR loci evenly distributed on Chrs and separated by more than 200 kb were used to detect SSR marker polymorphisms in a small natural population of 20 accessions. Finally, 101 primer pairs for polymorphic SSR loci were chosen for genotyping and association analysis (Figure 2 and Supplementary Tables 2, 3). The genome specificity of all SSR primers was verified by BLAST alignment, and all SSR primers were specific to their chromosomal locations. Here, Chr 1 had the greatest number of effective markers (20), while the fewest (7) occurred on Chr 6; the numbers of alleles for each marker varied from 2 to 7. For example, the highest number of alleles was found for the polymorphism marker SXAU3168 (Chr 3) at 7, while the average number of polymorphic loci on Chr 5 was 3.79. The variation in the PIC ranged from 0.43 to 0.56, with an average of 0.48, indicating that this population was relatively rich in genetic variation (Supplementary Table 8).

FIGURE 2
www.frontiersin.org

Figure 2. The chromosomal distribution of the 101 SSR markers used in this study.

The stratification of the population structure of different genotypes can cause false positives in trait-genotype association studies. Therefore, the optimal number of subgroups (K) was calculated from the effective PIC using Structure 2.3.4, and K = 2 was chosen as the optimal value (Figure 3A). In addition, the population structures of the 97 germplasm resources were displayed by the Bayesian-based structure analysis with K = 2 (Figure 3B).

FIGURE 3
www.frontiersin.org

Figure 3. Population structure and relationship analysis of 97 core TB germplasm resources. (A) Distribution of ΔK at different K-values. (B) Population structure of 97 core TB germplasm resources inferred by genotyping 101 SSR markers (K = 2). Not all names of the accessions are displayed. (C) Neighbor-joining cluster diagram of TB resources based on SSR markers. (D) Affinity within the population and the proportion of different levels of kinship.

In this study, the germplasm resources were clustered via the “neighbor-joining” method based on Nei’s (1973) genetic distance (Figure 3C). The germplasm resources were also divided into two groups. Group I contained 49 accessions that came from Anhui (3), Guangxi (1), Gansu (4), Hubei (8), Hunan (2), Shanxi (1), Qinghai (1), and Yunnan (2) in China; the United States (8); Russia (3); Bhutan (1); and unknown locations (15). Group II contained 48 accessions that came from Guangxi (4), Guizhou (8), Sichuan (5), Hunan (1), Qinghai (1), and Yunnan (1) in China; Russia (1); Nepal (1); Bhutan (5); and unknown locations (21). The kinship values of these accessions were calculated using SPAGeDi software based on the SSR polymorphic information. The kinship value (92.71%) of the accessions was less than 0.2 (Figure 3D), indicating that the accessions exhibited low genetic relatedness at the individual level and that the population contained extensive genetic diversity.

Association Analysis and Candidate Gene Mining

A general linear model (GLM) was used to detect the associations between SSR markers and variations in the 97 TB cultivars with thresholds of p < 0.05, 0.01, and 0.001. At p < 0.05, 144 SSR loci were significantly associated with 7 traits in 2018, among which 27, 25, 13, 13, 4, 3, and 2 SSR loci were associated with HGW, PH, MN, SD, LB, RL, and rutin content, explaining 4.3–21.35%, 5.05–18.96%, 4.23–14.84%, 4.8–24.57%, 4.16–21.0%, 4.03–15.03%, and 4.54–16.95% of the observed phenotypic variation, respectively. Sixty-seven SSR loci were significantly associated with the 7 traits in 2019. More than 10 SSR loci were associated with PH, RL, and HGW, and the explained variance was 3.5–17.0%. At p < 0.01, only 16 SSR loci were significantly associated with 6 traits in the two environments, with the exception of MN in 2019. At p < 0.001, only 1 SSR locus was significantly associated with LB and SD in the two environments, with additional loci associated with PH in 2018 and rutin content in 2019. Moreover, SXAU6182 and SXAU7127 were significantly associated with LB_BLUE.

The results of the mixed linear model (MLM) GWAS analysis showed that 55 SSR markers were significantly associated with 6 agronomic traits and rutin content at the critical significance threshold of p < 0.05 (Supplementary Table 9). Among them, 20, 16, and 19 SSR markers were detected for 2018, 2019, and BLUE values, respectively. Rutin content was associated with six SSRs in 2018, followed by RL (4), LB (4), SD (3), HGW (1), PH (1), and MN (1). HGW was correlated with four SSRs, followed by LB (3), SD (3), rutin content (2), MN (2), and RL (2). In addition, the BLUE values of HGW, LB, and RL were associated with 5, 4, and 4 SSR markers, respectively. Overall, relatively few SSR markers were detected for the two environments and BLUE values.

The SSR loci associated with the same trait in at least two environments were further counted and considered stable and valuable across environments (Figure 4 and Table 3). Twenty-two SSR loci were found to be associated with the same trait in at least two environments, including SXAU1108 (SD), SXAU2034 (rutin), SXAU4217 (HGW), and SXAU7127 (LB). The seven markers associated with HGW were SXAU1120, SXAU3357, SXAU4217, SXAU4308, SXAU4243, SXAU4246, and SXAU8002, respectively; the four markers associated with PH were SXAU1827, SXAU3271, SXAU5232, and SXAU8006, respectively; the three markers associated with RL were SXAU1064, SXAU4224, and SXAU8064, respectively; and the six markers associated with LB were SXAU1064, SXAU1120, SXAU6182, SXAU7127, SXAU7967, and SXAU8006, respectively. In addition, some SSR markers were significantly associated with multiple traits in both environments, such as SXAU1064 (LB and RL), SXAU1120 (SD, HGW, and LB), and SXAU8006 (SD, LB, and PH). These findings suggested that these regions could affect multiple traits in TB. Notably, SXAU4308 had significant effects on HGW in all environments at p < 0.05, with a phenotypic interpretation rate ranging from 0.04 to 0.17. Thus, this marker could be used as an optimal locus in marker-assisted breeding and yield improvement. In addition, SXAU4332 was significantly associated with MN in the two environments. After the detailed screening of gene functional annotations, a total of 21 candidate genes were found within the 300-kb region surrounding marker SXAU4332. Among the 21 genes, FtPinG0007685500 encodes an MYB transcription factor containing an MYB domain, which is a homolog of the rice MYB41 gene (LOC11429417).

FIGURE 4
www.frontiersin.org

Figure 4. Venn diagram of associated sites based on a general linear model.

TABLE 3
www.frontiersin.org

Table 3. Details of the SSR markers associated with 6 agronomic traits and rutin content via general linear model (GLM) and mixed linear model (MLM).

Discussion

Agronomic traits are the unique traits of crop varieties that are determined by their genetic background and the influence of the environment (Cao et al., 2016). The accurate investigation of agronomic traits contributes to the utilization of excellent traits and germplasm innovation (Jia et al., 2013). Brunori et al. (2012) found that the yield traits of 13 TB varieties from different regions were significantly different. In our study, significant differences in phenotypic characteristics and quality characteristics (rutin content) were found among different TB varieties and environments. These results indicated that these varieties show differences in adaptability to the environment and certain differences in phenotype. Therefore, it is necessary to cultivate new varieties that can adapt to complex environmental conditions and present excellent comprehensive phenotypes to meet the requirements of different regions.

Previous studies showed that there were significant correlations between PH and MSN, PH and SD, and MSN and SD (Liang et al., 2020). Similar results were obtained in this study, and SD was positively correlated with MSN and PH. In addition, PH directly affected HGW and other yield traits. Our results showed that buckwheat yield was directly affected by LB, PH, and the node number of the main stem. It was also found that PH was positively correlated with lateral branch number, HGW, and RL. Therefore, improvement of the plant-type structure is an effective direction for improving the yield and quality traits of TB.

Recently, the application of association analysis to plant molecular marker-assisted breeding has effectively accelerated the selection of superior varieties (Duan et al., 2020; Chu et al., 2021), especially in major crops. Breseghello (2006) used grain traits of 95 wheat varieties to conduct association analysis with 18 SSR markers and found that Xgwm30, Xwmc111, and Xgwm261 were significantly correlated with grain width, which contributed to related QTL studies. Agrama et al. (2007) analyzed 123 SSR markers among 92 rice varieties and obtained markers significantly associated with grain length, grain width, yield, and 1,000-grain weight. Feng et al. (2019) used GLM, MLM, and mrMLM to carry out marker-trait association analysis; 33 PSB single nucleotides were detected on 17 Chrs, and the proportion of phenotypic variation explained by the markers ranged from 1 to 32%. Lai et al. (2013) used GLMs and MLMs in the association analysis of barley SSR markers and agronomic traits and simultaneously detected 3 markers related to PH, 2 markers related to the spikelet-bearing density, and 1 marker related to awn length, with a maximum contribution rate of 6.95%.

However, the realization of association studies relies on the use of a large number of effective markers rich in polymorphic information, which have been lacking in TB studies. Based on high-throughput data, we conducted a detailed screening of the variation in SSR loci in the TB genome. Here, 8,089 SSR loci were identified in the TB genome, and 101 loci with high polymorphism were detected in 97 accessions. These SSR markers greatly enriched the TB molecular marker database, and multiple loci were confirmed to be related to important agronomic traits of TB. In this study, seven SSR markers associated with HGW, four markers associated with PH, three markers associated with RL, and six markers associated with LB were obtained by association analysis using 101 SSR loci in the two environments. Their phenotypic interpretation rates ranged from 1.32 to 5.07%, which lays a foundation for MAS in the cultivation of excellent TB varieties. In addition, an MYB transcription factor gene (FtPinG0007685500) was found near the significant marker SXAU4332; this gene is homologous to MYB92 with high similarity of 97.73%. MYB92 was confirmed to be involved in the formation of the lifetime cell wall and control lignin synthesis in hybrid poplars. The SXAU4332 marker was significantly correlated with the node number and the SD of the main stem, and FtPinG0007685500 may have a similar function in TB. Although additional experimental verification is necessary, FtPinG0007685500 can still be regarded as an important candidate gene affecting the node number and SD of the main stem. In general, these SSR markers and important candidate loci and genes will contribute to the improvement of important agronomic and quality traits and molecular breeding in TB.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

ZS and SH conceived and designed the experiments. SH, XR, and YY wrote the manuscript. DW, WD, and XW performed the experiments and analyzed the data. YH, LL, HL, and SH administrated the project. YH and SH revised the manuscript. All authors have read and gave final approval for publication.

Funding

This study was supported by the Applied Basic Research Program of the Shanxi Academy of Agricultural Sciences (YGC2019FZ2), the Agricultural Science and Technology Innovation Project of Shanxi Agricultural University (YGC20201), the Research Program Sponsored by the Shanxi Key Laboratory of Minor Crop Germplasm Innovation and Molecular Breeding, Shanxi Agricultural University (202105D121010-20), the National Key R&D Program of China (2017YFE0117600), and the China Agriculture Research System of MOF and MARA (CARS-07-A-2).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.819008/full#supplementary-material

Footnotes

  1. ^ https://github.com/lme4/lme4
  2. ^ https://www.ibm.com/cn-zh/products/spss-statistics
  3. ^ https://hbiostat.org/R/Hmisc/
  4. ^ https://personality-project.org/r/psych/
  5. ^ https://brcwebportal.cos.ncsu.edu/powermarker/
  6. ^ http://www.mbkbase.org/Pinku1/
  7. ^ https://webblast.ipk-gatersleben.de/misa/index.php?action=3&help=2
  8. ^ https://github.com/reedacartwright/spagedi
  9. ^ http://web.stanford.edu/group/pritchardlab/structure_software/release_versions/v2.3.4/html/structure.html
  10. ^ http://taylor0.biology.ucla.edu/structureHarvester/
  11. ^ https://tassel.bitbucket.io/

References

Agrama, H. A., Eizenga, G. C., and Yan, W. (2007). Association mapping of yield and its components in rice cultivars. Mol. Breeding. 19, 341–356. doi: 10.1007/s11032-006-9066-6

CrossRef Full Text | Google Scholar

Beier, S., Thiel, T., Münch, T., Scholz, U., and Mascher, M. (2017). MISA-web: a web server for microsatellite prediction. Bioinformatics. 33, 2583–2585. doi: 10.1093/bioinformatics/btx198

PubMed Abstract | CrossRef Full Text | Google Scholar

Bradbury, P. J., Zhang, Z. W., Kroon, D. E., Casstevens, T. M., Ramdoss, Y., and Buckler, E. S. (2007). TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 23, 2633–2635. doi: 10.1093/bioinformatics/btm308

PubMed Abstract | CrossRef Full Text | Google Scholar

Breseghello, F. (2006). Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics 172, 1165–1177. doi: 10.1534/genetics.105.044586

PubMed Abstract | CrossRef Full Text | Google Scholar

Brunori, A., Baviello, G., Corrado, Z., Gianluca, C., Gergő, S., and Végvári, G. (2010). The use of Fagopyrum tataricum gaertn. whole flour to confer preventive contents of rutin to some traditional tuscany biscuits. Annals of the University Dunarea de Jos of Galati. Fascicle VI Food Technol. 34, 148–151.

Google Scholar

Brunori, A., Baviello, G., Kajdi, F., Silva, J., and Gyrgy, V. (2012). Grain yield and rutin content of common and tartary buckwheat varieties grown in north-western hungary. Euro. J. Plant Sci. Biotechnol. 6, 70–74.

Google Scholar

Cao, K., Zhou, Z. K., Wang, Q., Guo, J., Zhao, P., Zhu, G. R., et al. (2016). Genome-wide association study of 12 agronomic traits in peach. Nat. Commun. 7:13246. doi: 10.1038/ncomms13246

PubMed Abstract | CrossRef Full Text | Google Scholar

Chu, H. W., Tu, R. J., Niu, F., Zhou, J. H., Sun, B., Luo, Z. Y., et al. (2021). A New PCR/LDR-Based Multiplex Functional Molecular Marker for Marker-Assisted Breeding in Rice. Rice Sci. 28, 6–10.

Google Scholar

Du, X. L., Zhang, Z. W., Wu, B., Li, Y. Q., and Yu, A. H. (2013). Construction and analysis of genetic linkage map in tartary buckwheat (Fagopyrum tataricum) using SSR. Chin Agri. Sci. Bull. 29, 61–65.

Google Scholar

Duan, X. X., Yu, H. X., Ma, W. J., Sun, J. Q., Yang, R. C., Ning, T. Y., et al. (2020). A major and stable QTL controlling wheat thousand grain weight: identification, characterization, and CAPS marker development. Mol Breeding 40:68. doi: 10.1007/s11032-020-01147-3

CrossRef Full Text | Google Scholar

Feng, G. L., Zhai, F. Y., Liu, H. L., and Ai, N. J. (2019). Identification of genomewide single-nucleotide polymorphisms associated with presummer, summer and bolls in upland cotton. J Genet 98:72. doi: 10.1007/s12041-019-1118-x

CrossRef Full Text | Google Scholar

Garima, K., Pandey, A., Dobhal, R., and Gupta, S. (2013). Population genetic study of Fagopyrum tataricum from Western Himalaya using ISSR markers. Biochem. Genet. 51, 750–765. doi: 10.1007/s10528-013-9604-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Hardy, O. J. (2016). Population genetics of autopolyploids under a mixed mating model and the estimation of selfing rate. Mol. Ecol. Resour. 16, 103–117. doi: 10.1111/1755-0998.12431

PubMed Abstract | CrossRef Full Text | Google Scholar

Hou, S. Y., Du, W., Hao, Y. R., Han, Y. H., Li, H. Y., Liu, L. L., et al. (2021). Elucidation of the Regulatory Network of Flavonoid Biosynthesis by Profiling the Metabolome and Transcriptome in Tartary Buckwheat. J. Agric. Food Chem. 69, 7218–7229. doi: 10.1021/acs.jafc.1c00190

PubMed Abstract | CrossRef Full Text | Google Scholar

Hou, S. Y., Sun, Z. X., Linghu, B., Xu, D. M., Wu, B., Zhang, B., et al. (2016). Genetic Diversity of Buckwheat Cultivars (Fagopyrum tartaricum Gaertn.) Assessed with SSR Markers Developed from Genome Survey Sequences. Plant Mol. Biol. Rep. 34, 233–241. doi: 10.1007/s11105-015-0907-5

CrossRef Full Text | Google Scholar

Hou, Y. J., Zhang, Z. W., Wu, B., and Li, Y. Q. (2009). Genetic Diversity in Tartary Buckwheat Revealed by AFLP Analysis. Scientia Agri. Sinica. 12, 4166–4174. doi: 10.3864/j.issn.0578-1752.2009.12.006

CrossRef Full Text | Google Scholar

Jia, G. Q., Huang, X. H., Zhi, H., Zhao, Y., Zhao, Q., Li, W. J., et al. (2013). A haplotype map of genomic variations and genome-wide association studies of agronomic traits in foxtail millet (Setaria italica). Nat. Genet. 45, 957–961. doi: 10.1038/ng.2673

PubMed Abstract | CrossRef Full Text | Google Scholar

Lai, Y., Wang, P. X., Fan, G. Q., Jing, S. E., Wang, J., Yang, K., et al. (2013). Genetic Diversity and Association Analysis Using SSR Markers in Barley. Scientia Agri. Sinica 46, 233–224. doi: 10.3864/j.issn.0578-1752.2013.02.002

CrossRef Full Text | Google Scholar

Li, R. Y., Liang, L. B., Shi, T. X., Huo, D. A., and Chen, Q. F. (2017). Construction of a microsatellite based genetic map of tartary buckwheat using F5 recombinant inbred lines. J. Guizhou Normal Univ. 35, 31–45.

Google Scholar

Liang, S. H., Li, J., Zhou, D., Ren, H. L., Gao, J. F., and Wang, P. K. (2020). Genetic Diversity Analysis on the Morphological Traits of Tartary Buckwheat Germplasm from Major Production Regions of China. Mol. Plant Breeding 18, 7254–7266. doi: 10.13271/j.mpb.018.007254

CrossRef Full Text | Google Scholar

Luthar, Z., Zhou, M. L., Golob, A., and Germ, M. (2020). Breeding Buckwheat for Increased Levels and Improved Quality of Protein. Plants. 10:14. doi: 10.3390/plants10010014

PubMed Abstract | CrossRef Full Text | Google Scholar

Sharma, T., and Jana, S. (2002). Species relationships in Fagopyrum revealed by PCR-based DNA fingerprinting. Theor. Appl. Genet. 105, 306–312. doi: 10.1007/s00122-002-0938-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, T. X., Li, R. Y., Zheng, R., Chen, Q. F., Li, H., Huang, J., et al. (2021). Mapping QTLs for 1000-grain weight and genes controlling hull type using SNP marker in Tartary buckwheat (Fagopyrum tataricum). BMC Genomics 22:142. doi: 10.1186/s12864-021-07449-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Sohn, H. B., Kim, S. J., Hong, S. Y., Park, S. G., and Kim, Y. H. (2021). Development of 50 InDel-based barcode system for genetic identification of tartary buckwheat resources. PLoS One. 16:e0250786. doi: 10.1371/journal.pone.0250786

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, Z. X., Linghu, B., Hou, S. Y., Liu, R. H., Wang, L., Hao, Y. R., et al. (2020). Tartary Buckwheat FtMYB31 Gene Encoding an R2R3-MYB Transcription Factor Enhances Flavonoid Accumulation in Tobacco. J. Plant Growth Regul. 39, 564–574. doi: 10.1007/s00344-019-10000-7

CrossRef Full Text | Google Scholar

Tang, Y., Ding, M. Q., Tang, Y. X., Wu, Y. M., Shao, J. R., and Zhou, M. L. (2016). Germplasm Resources of Buckwheat in China. Molecular Breeding and Nutritional Aspects of Buckwheat. Cambridge: Academic Press, 13–20. doi: 10.1016/B978-0-12-803692-1.00002-X

CrossRef Full Text | Google Scholar

Tsuji, K., and Ohnishi, O. (2000). Origin of cultivated Tatary buckwheat (Fagopyrum tataricum Gaertn.) revealed by RAPD analyses. Genet Resour. Crop Ev. 47, 431–438. doi: 10.1023/A:1008741513277

CrossRef Full Text | Google Scholar

Uddin, M. R., Li, X. H., Kim, Y. B., Chae, S. C., and Park, S. U. (2013). Phenolic compounds in different organs of tartary buckwheat (Fagopyrum tataricum Gaertn.) cultivars. Aust. J. Crop Sci. 7, 1861–1865.

Google Scholar

Zhang, L. J., Li, X. X., Ma, B., Gao, Q., Du, H. L., Han, Y. H., et al. (2017). The tartary buckwheat genome provides insights into rutin biosynthesis and abiotic stress tolerance. Mol. Plant. 10, 1224–1237. doi: 10.1016/j.molp.2017.08.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, W. Y., Fang, Z. W., and Wang, K. H. (2012a). Genetic diversity of Fagopyrum tataricum using SRAP markers. Guangdong Agr. Sci. 39, 148–150+152.

Google Scholar

Zhang, Z. L., Zhou, M. L., Tang, Y., Li, F. L., Tang, Y. X., Shao, J. R., et al. (2012b). Bioactive compounds in functional buckwheat food. Food Res. Int. 49, 389–395. doi: 10.1016/j.foodres.2012.07.035

CrossRef Full Text | Google Scholar

Zhou, Y. M., Wang, H., Cui, L. L., Zhou, X. L., Wen, T., and Song, X. L. (2015). Evolution of nutrient ingredients in tartary buckwheat seeds during germination. Food Chem. 186, 244–248. doi: 10.1016/j.foodchem.2015.03.115

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Tartary buckwheat, genome-wide SSR marker development, association study, core germplasm, agronomic traits

Citation: Hou S, Ren X, Yang Y, Wang D, Du W, Wang X, Li H, Han Y, Liu L and Sun Z (2022) Genome-Wide Development of Polymorphic Microsatellite Markers and Association Analysis of Major Agronomic Traits in Core Germplasm Resources of Tartary Buckwheat. Front. Plant Sci. 13:819008. doi: 10.3389/fpls.2022.819008

Received: 20 November 2021; Accepted: 31 January 2022;
Published: 15 March 2022.

Edited by:

Baohua Wang, Nantong University, China

Reviewed by:

Fa Cui, Ludong University, China
Mulatu Geleta, Swedish University of Agricultural Sciences, Sweden

Copyright © 2022 Hou, Ren, Yang, Wang, Du, Wang, Li, Han, Liu and Sun. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Longlong Liu, lllong781211@sina.com; Zhaoxia Sun, 18636071356@163.com, zhaoxiasun@sxau.edu.cn

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.