- United States Department of Agriculture – Agricultural Research Service (USDA-ARS) Corn, Soybean and Wheat Quality Research Unit, Wooster, OH, United States
Soft winter wheat has been adapted to the north-central, north-western, and south-central United States over hundreds of years for optimal yield, height, heading date, and pathogen and pest resistance. Environmental factors like weather affect abiotic traits such as pre-harvest sprouting resistance. However, pre-harvest sprouting has rarely been a target for breeding. Owing to changing weather patterns from climate change, pre-harvest sprouting resistance is needed to prevent significant crop losses not only in the United States, but worldwide. Twenty-two traits including age of breeding line as well as agronomic, flour quality, and pre-harvest sprouting traits were studied in a population of 188 lines representing genetic diversity over 200 years of soft winter wheat breeding. Some traits were correlated with one another by principal components analysis and Pearson’s correlations. A genome-wide association study using 1,978 markers uncovered a total of 102 regions encompassing 226 quantitative trait nucleotides. Twenty-six regions overlapped multiple traits with common significant markers. Many of these traits were also found to be correlated by Pearson’s correlation and principal components analyses. Most pre-harvest sprouting regions were not co-located with agronomic traits and thus useful for crop improvement against climate change without affecting crop performance. Six different genome-wide association statistical models (GLM, MLM, MLMM, FarmCPU, BLINK, and SUPER) were utilized to search for reasonable models to analyze soft winter wheat populations with increased markers and/or breeding lines going forward. Some flour quality and agronomic traits seem to have been selected over time, but not pre-harvest sprouting. It appears possible to select for pre-harvest sprouting resistance without impacting flour quality or the agronomic value of soft winter wheat.
1 Introduction
Soft winter wheat has been improved primarily for agronomic traits such as yield, heading date (HD), plant height (Hght), and resistance to pathogens and pests for hundreds of years (Hedden, 2003; Cavanagh et al., 2013; Ward et al., 2019). However, other trait improvements could increase long-term product success, such as improved grain or flour quality traits and resistance to pre-harvest sprouting (PHS). This is a weather-dependent abiotic stress affecting quality and marketability of grains around the world (Patwa and Penning, 2020). Alleles to improve traits not selected in modern varieties may exist in older breeding lines and await discovery. Marker-assisted selection could be used to improve wheat varieties. However, with increases in population and food demand requiring doubling of production, quality improvement cannot come at the expense of agronomic traits (Ray et al., 2013).
PHS, measured as grain soundness, occurs throughout the world and can damage wheat, rice, barley, maize, sorghum, and rye leading to yield losses of 10%–50% and up to 30% reduction in sale value (Patwa and Penning, 2020). It has become more prevalent in Africa, Australia, Canada, China, Europe, India, Japan, and the United States from climate change (Singh et al., 2021). PHS is caused by complex physiological and biochemical mechanisms including seed dormancy, abscisic acid concentration and sensitivity, alpha amylase activity (AA), differential water imbibition due to seed coat and spike morphology, humidity, and temperature (King and Richards, 1984; DePauw and McCaig, 1991; Yanagisawa et al., 2005; Chono et al., 2006; Patwa and Penning, 2020). However, less is known about the genetics of PHS resistance needed to produce markers meaning that marker development is difficult. Current knowledge of the genetic factors affecting PHS is limited and includes signaling genes such as TaPHS1/MOTHER OF FLOWERING TIME (MFT), MAP KINASE KINASE 3 (MKK3), and VIVIPAROUS 1 (VP1); genes involved in abscisic acid synthesis and degradation like 9-cis-epoxycarotenoid dioxygenase (HvNCED1) and ABA 8’-hydroxylase (HvCYP707A1); and the mutant ENHANCED RESPONSE TO ABA8 (ERA8) in white spring wheat (Nakamura and Toyama, 2001; Chono et al., 2006; Nakamura et al., 2011; Lin et al., 2016; Martinez et al., 2016; Torada et al., 2016; Vetch et al., 2019).
Flour quality traits are used to describe the complex physical changes that occur during baking. The impact of these traits can differ depending on the baked product. Water absorption from damaged starch, measured by sodium carbonate solvent retention capacity (NaSRC), and arabinoxylans, measured by sucrose solvent retention capacity (SucSRC), can lead to brittle products that take longer to bake (Souza et al., 2012). Higher gluten strength as measured by lactic acid solvent retention capacity (LASRC) or adjusted lactic acid solvent retention capacity (LAAdSRC) binds water during leavening to better hold dough together (dough strength) and is useful for rising pound cake or biscuits (Wilderjans et al., 2008; Ma and Baik, 2018). Increased flour protein content (FlProt) also increases dough strength although lower protein is desired by the baking industry for most soft winter wheat baked products besides crackers (Deng et al., 2021). All above-mentioned solvent retention capacity (SRC) measurements interact to affect water absorption in flour measured by water absorption solvent retention capacity (WatSRC) (Souza et al., 2012). Water absorption along with flour yield (FlYld) and softness equivalence (SftEqv) can impact milling and baking performance as well. SftEqv, SucSRC, and WatSRC have been previously correlated with whole wheat cookie diameter (CkDia). Measurement of CkDia after baking of cookie dough made from flour of different wheat varieties but of the same weight and shape estimates cookie spread during baking. Texture measured by cookie top grade (CkTpGr) and consistent spread are important to the baking industry for uniform products (Souza et al., 2011).
Several previous studies have defined PHS resistance or flour quality traits on nearly every chromosome, but most are of small effect and used modern breeding lines (Gao et al., 2013; Cabral et al., 2014; Lin et al., 2016). PHS and flour quality traits have infrequently been bred into soft winter wheat due to labor, cost, and the need for time-consuming analyses. One study not only improved PHS resistance by breeding a variety related to Tom Thumb into a more modern variety but also affected height (Bhatt et al., 1977). With the advent of marker-assisted selection in plant breeding, providing reliable trait-improving markers could allow flour quality and PHS resistance improvement while maintaining agronomically important qualities.
To study the effects of many traits, a population encompassing diversity of breeding lines from the early 1800s to the early 2000s was developed based on previous milling and baking performance using a long-flow Allis-Chalmers milling system by the USDA-Agricultural Research Service Soft Wheat Quality Lab. Included varieties represent variation in important crop traits when grown in eastern North America and were available for study (Souza et al., 2012). The population displayed similar structure and more variation than a much larger modern elite set of breeding lines (Cabrera et al., 2014). One factor apparent in the population structure of the diversity breeding lines was age of the release (Cabrera et al., 2014). Few studies in soft winter wheat have investigated the impact of selective breeding to genetic changes in such a diverse set of traits available in this population. The selection of yield in elite breeding lines may have inadvertently reduced overall diversity of important but untested quality traits (Cabrera et al., 2014). Wide values for many diverse traits in this population allowed discovery of chromosome regions for improvement and overlap with other traits. A completed sequence and sets of markers in wheat allowed for the performance of a genome-wide association study (GWAS) for trait mapping (IWGSC, 2014). The GAPIT3 platform contains several GWAS models with slightly varied assumptions and regression models that could impact the association of significant markers with a phenotype (Wang and Zhang, 2021). The population size of 188 with 1,978 markers provided an opportunity to test how different GWAS models would perform in a real population of soft winter wheat rather than a simulated population without a burdensome amount of computation.
The goal of this research was to find regions of the genome that could provide resistance to PHS, an important abiotic threat to wheat crops. Also, this study sought to determine if resistance to PHS may overlap with other important flour quality or agronomic traits. This would indicate if breeding efforts over the past ~200 years impacted previously unmeasured PHS and flour quality traits in pursuit of other traits such as yield, pests, and disease resistance. With the pressures of an increasing population requiring more food of higher quality at a time when climate change can cause disruptions in the food supply, it has become more important to protect against abiotic stresses such as PHS without impacting agronomic traits or flour quality in wheat.
2 Materials and methods
2.1 Plant material
Sixty grams of seed per plot for each of the 188 members of the historic diversity population described in Souza et al. (2012) and available individually (https://www.ars-grin.gov or http://wheatpedigree.net) as listed in Supplemental File S1 were planted in 3 m × 3 m plots of six rows across a plot to achieve a full block of plants with 0.6-m alleys between each 3 m × 3 m plot at Schaffter, Snyder, and/or King farms at Ohio State University. Two sets of plots were grown each year, one for artificial irrigation of the field and one left in the field and harvested after a natural rain event. Fields were grown in successive years until at least 3 years of data could be obtained as not every year featured a natural rain event within the harvest window. Fields were tilled and pre-dressed with 28 kg/hectare of nitrogen prior to planting in October of each year. Nitrogen was reapplied at 101 kg/hectare in April of each year followed by 1 L/hectare of Huskie broadleaf herbicide. A border of >6 m of wheat or rye was used as a buffer.
For irrigated PHS tests, plots were treated in the field with overhead sprinklers simulating 2.5–5.0 cm of rain for 2 days (Irr). A second set of plots was left in the field until a natural rain event (Nat) occurred (Sneller et al., 2010). Each whole plot was harvested by a Wintersteiger Classic combine (www.wintersteiger.com). For the artificial spike wetting test (Art), ~30 spikes were hand harvested from the non-irrigated plot at maturity (no green on the spikes) in brown paper bags. Spikes were dried in the bag at ~29˚C for 5 days, and then placed upright ~2.5 cm apart in Styrofoam blocks by their stems and subjected to 95% humidity in a growth chamber with half sodium/half metal halide lights at 200–300 μmol m−2 s−1 and 20˚C for 16 h and no lights and 15˚C for eight h. Spikes were soaked with water from a spray bottle every 12 h. After 3 days, spikes were dried in the greenhouse for 1–2 days, hand threshed, placed in heavy manila envelopes, and stored at −20˚C. All collected seeds were cleaned of debris by blowing air to remove the lighter chaff. All PHS treatments were performed for 3–4 years (harvested 2018–2021 for Art, 2019–2021 for Irr, and 2018, 2019, and 2022 for Nat).
2.2 Chemical and physical tests
Grain and flour quality tests for FlProt, test weight (TstWt), WatSRC, NaSRC, SucSRC, LASRC, LAAdSRC, SftEqv, FlYld, CkDia, and CkTpGr were performed on sound grain (not exposed to PHS) and calculated as described previously (Souza et al., 2012; Cabrera et al., 2015). Briefly, FlProt was determined by near-infrared reflectance on a Unity Spectra-Star 2200 (Columbia, MD), TstWt was determined by weight of 1,000 grains, all five SRCs were determined by American Association of Cereal Chemists (AACC) method 56-11.02 (AACC, 2010a), CkDia and CkTpGr were determined by AACC method 10-52.02 (AACC, 2010b), and FlYld and SftEqv were determined based on milling grain tempered to 15% moisture through a modified Brabender Quadramat Junior flour mill (Cabrera et al., 2015).
HD was recorded visually walking the field daily and recording the first day 50% of wheat spikes had fully emerged from the boot jack. Values were calculated by assigning the first date a variety emerged as day 1 and adding the appropriate number of days for each variety that emerged afterward. Hght was recorded in centimeters at maturity, except the first year collected in inches and converted to centimeters.
Soft winter wheat has been primarily bred for improvement of traits such as yield, height, heading days, or pathogen and pest resistance over ~200 years. Other traits may have been altered because they were located near genes for selected traits. To observe if PHS or flour quality traits may have been impacted, a comparative measure for how long ago the variety was released by breeders was developed. If this measure co-located with PHS or flour quality traits, it would indicate their inadvertent selection, especially if they also co-locate with agronomic traits. Age of breeding line (Age) was calculated by subtracting the release year of the line from 2020. Awnless or bearded (Awns) and red or white seed color (SdClr) was determined by visual observation and accession reports.
PHS treatments were tested for grain soundness by Falling Number (FN) and AA tests. Threshed and cleaned seeds were ground in a UDY Mill model 3010-014 with a 1-mm screen (www.udyone.com). FN was measured using a Perten Falling Number machine 100 or 1000 (www.perkinelmer.com) in 20 ml of water with grams of ground seeds based on moisture content per manufacturer’s manual Table III (Perten Instruments AB, 2016). Moisture content was measured using AACC method 44-16.01 (1999). The paste was vortexed in a Perten Shakematic (www.perkinelmer.com) for 15 s before placing in the Falling Number machine following AACC method 56-81.03 (1992). Alpha amylase activity was calculated following AACC method 22-02.01 (2002) with an alpha amylase assay kit (Ceralpha Method) by Megazyme (www.megazyme.com) with the following modification. Instead of 7 g of ground seeds in 40 ml of extraction solution in 50-ml tubes, 0.3 g of ground seeds was extracted in 2 ml of extraction solution in a 48-well format (Corning #P-5ML-48-C, www.corning.com). Absorbance was determined colorimetrically in a 96-well plate format (Fisherbrand #21377203, www.fishersci.com) using a BioTek plate reader (www.biotek.com) at 405 nm. Activity was calculated using the manufacturer’s standard method for wheat (www.megazyme.com).
2.3 Genetics and statistical analysis
Multiple years or locations of trait data, three for irrigated alpha amylase activity (IrrAA), irrigated Falling Number (IrrFN), natural alpha amylase activity (NatAA), natural Falling Number (NatFN), TstWt, CkDia, and CkTpGr, and four for artificial alpha amylase activity (ArtAA), artificial Falling Number (ArtFN), FlProt, FlYld, HD, Hght, LAAdSRC, LASRC, NaSRC, SftEqv, SucSRC, and WatSRC were combined into Best Linear Unbiased Predictors (BLUP) using R version 4.0.4 and metafor package version 2.4-0 for samples using means and standard deviations calculated in Microsoft Excel (Viechtbauer, 2010; R Core Team, 2022). Histograms for the BLUP of each trait were plotted using the Histogram data analysis tool in Microsoft Excel and a Shapiro–Wilk normality test was performed using the R command shapiro.test(trait) (R Core Team, 2022).
Principal components analysis (PCA) was performed using pcaMethods version 1.78.0 from Bioconducter version 3.10.1 and factoextra version 1.0.7 in R version 3.6.3 necessary to run the combination of tools (Stacklies et al., 2007; Huber et al., 2015; Kassambara and Mundt, 2020; R Core Team, 2022). Microsoft Excel was used to graph combinations of principal components (PCs) 1, 2, and 3. Pearson’s correlation coefficients and their associated p-values were calculated using the rcorr command from the Hmisc library in R version 3.63, copied into comma delimited files using the write.csv command and merged into one table in Microsoft Excel.
A GWAS was performed with GAPIT3 (GitHub version on 10/14/22 from www.zzlab.net) under R version 4.0.4 running in RStudio version 4.2.1 using six models: general linear model (GLM), mixed linear model (MLM), multiple loci mixed model (MLMM), fixed and random model circulating probability unification (FarmCPU), Bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK), and settlement of MLM under progressively exclusive relationship (SUPER) to discover any differences model type might play in locating significant markers for different phenotypes in a soft winter wheat population (Wang et al., 2014a; Tang et al., 2016; Wang and Zhang, 2021; R Core Team, 2022; RStudio Team, 2022). The markers were derived from the 9K iSelect single-nucleotide polymorphism (SNP) array by removing markers that could not be uniquely mapped to a single chromosome using five recombinant inbred line populations with parents in the GWAS population (Cavanagh et al., 2013; Cabrera et al., 2015). In addition to the stacked Manhattan plots and quartile–quartile (QQ) plots generated by GAPIT3, significant results from GAPIT3 for each trait region were reported using Bonferroni significance cutoffs calculated using an alpha of 0.05 divided by the number of markers per chromosome (Wang and Zhang, 2021; Hill et al., 2022). Locations of significant quantitative trait nucleotides (QTNs) were visualized using chromPlot in R with R script in File S2 (Orostica and Verdugo, 2016; R Core Team, 2022).
3 Results
3.1 Statistical analysis
Alpha amylase activity calculated in three separate years for a subset of 20 samples had a linear correlation of 0.98, 0.95, and 0.93 between the standard and modified protocol (data not shown). Means, standard deviations, and number of samples for each trait are in File S1 and histograms of each trait by year and BLUP are in File S5. Shapiro–Wilk test statistics and p-values for normality of all traits are listed in Supplemental Table S1. Relatedness of 22 agronomic, flour quality (on sound grain), and PHS traits was measured by Pearson’s correlation and PCA combining years or locations using BLUPs. R scripts are in File S2. Pearson’s correlations with a p-value < 0.001 were considered significant. FN traits negatively correlated with AA traits. Seed color and all PHS-related tests were significantly correlated with each other. All but IrrAA were correlated with TstWt. Some PHS measures such as ArtFN appeared slightly correlated with flour quality traits including WatSRC, LASRC, LAAdSRC, and SucSRC or CkTpGr. Only ArtFN appeared correlated with HD or Hght. Age (years since release) was positively correlated with Hght, FlProt, and HD, and negatively correlated with CkDia, CkTpGr, and SftEqv. SdClr was positively correlated with TstWt and most SRC and negatively correlated with HD, Hght, CkTpGr, and CkDia. Awns were uncorrelated with other traits. Flour protein was positively correlated with Hght and most SRCs and negatively correlated with other flour quality traits while most SRCs were positively correlated with each other and negatively correlated with other flour quality traits with significant correlations (p-value <0.001) of −0.24 to 0.94 (Table 1).
Table 1 Pearson’s correlations of 22 soft winter wheat traits with significance across top and right correlations with p-values <0.001 in bold.
The first three PCs of all traits represented 31%, 18%, and 12% of variation, respectively, and were considered significant based on a plot of Eigenvalues and cumulative variance (Supplemental Figure S1).
The trait contribution biplot of PC1 vs. PC2 mirrored many of the Pearson’s correlation comparisons, accounting for the positive or negative correlation as grouped in quadrant or in opposite quadrant by PC1, PC2, or both axes. FN and SdClr were grouped together tightly and loosely with TstWt in quadrant 1. Alpha amylase activity measures were tightly grouped in quadrant 3 with HD loosely associated. Non-SRC flour quality traits except FlProt grouped loosely in quadrant 2. The SRCs grouped tightly in quadrant 4. Awns, FlProt, Hght, and Age were not well captured by the first two principal components and appeared separated from the main groups (Figure 1A). The trait contribution biplot of PC1 vs. PC3 showed similar groupings to PC1 vs. PC2 with the exception of CkDia and CkTpGr quality traits being more closely grouped with AA measures and many agronomic factors flipping to the positive PC3 axis (Figures 1A, B). The PC2 vs. PC3 trait contribution biplot revealed a strong grouping of Age, Hght, HD, and FlProt in quadrant 2, agreeing with Pearson’s correlations and the SRC grouping in quadrant 3 between AA measures (Figure 1C, Table 1).
Figure 1 Principal components analysis for all 22 trait contributions. (A) Principal Component 1 vs. Principal Component 2. (B) Principal Component 1 vs. Principal Component 3. (C) Principal Component 2 vs. Principal Component 3. Quadrant numbers shown in corners. Gray arrows point to circles where labels would be too close to read. Age of breeding line (Age), Artificial alpha amylase activity (ArtAA), Artificial falling number (ArtFN), Awnless or bearded (Awns), Cookie Diameter (CkDia), Cookie top grade (CkTpGrd), Flour Protein (FlProt), Flour Yield (FlYld), Heading date (HD), Plant height (Hght), Irrigated alpha amylase activity (IrrAA), Irrigated falling number (IrrFN), Adjusted Lactic Acid SRC (LAAdSRC), Lactic Acid SRC (LASRC), Sodium Carbonate SRC (NaSRC), Natural alpha amylase activity (NatAA), Natural falling number (NatFN), red or white seed color (SdClr), Softness Equivalence (SftEqv), Sucrose SRC (SucSRC), test weight (TstWt), and Water SRC (WatSRC).
3.2 Genome-wide association study
A GWAS for 22 agronomic, flour quality, and PHS traits with combined locations/years by BLUP using six models (Blink, FarmCPU, GLM, MLM, MLMM, and SUPER) was performed in GAPIT3 to test for model performance over diverse traits in a moderate-sized soft winter wheat population. See the R script in File S2. File S3 contains BLUP estimated values. File S4 contains HAPMAP markers with 1,978 markers mapped uniquely to a single genome (A, B, or D) for the 188 genotypes. The analysis uncovered 226 significant QTNs at 102 genomic regions across all 21 chromosomes and four unassigned regions and included 40 markers with multiple significant traits at 26 regions. Figure 2 provides a visual representation of the QTN regions for all 22 traits while Table 2 provides detailed information of the QTN including approximate regions in centiMorgans (cM), largest effect on the trait, smallest p-value for significance, and which markers and models were significant (Supplemental Table S2). Figure 3 shows the stacked Manhattan plots for Age and IrrAA as examples of important traits showing only experiment-wise significant markers. Figure 3 shows several locations where multiple models have the same marker significant at a more stringent experiment-wise Bonferroni cutoff of p-value <0.01 for Age and locations on chromosomes 1A and 4A for IrrAA. The QQ plot for Age showed some deviation from expected values for the GLM beyond the points at the end associated with phenotypic differences while the QQ plot for IrrAA shows all models within expected values except potential markers associated with phenotypic differences (Figure 4). Manhattan and QQ plots generated by GAPIT3 for each trait not in the main article are available along with histograms of the BLUP trait values as File S5. Measured traits IrrFN, FlProt, WatSRC, SucSRC, LASRC, LAADSRC, SftEqv, FlYld, and CkDia were all normally distributed by the Shapiro–Wilk normality test. However, the majority of test statistics fit well into their quartile values, W > 0.9, except ArtAA and NatAA with a limited range of values; Age, which is heavily skewed to newer lines; and Awns and SdClr, which are binary by classification (Supplemental Table S1). For inclusion as a QTN, the marker for the trait had to be significant by a 0.05 Bonferroni cutoff for at least two of the six models (Supplemental Table S2).
Figure 2 Regions of significant QTN for 22 traits. Circles are agronomic and physical traits, squares are pre-harvest sprouting, and triangles are flour quality. Gray bars surrounding symbols indicate a range of significant markers. Legend describes specific traits. Age of breeding line (Age), Artificial alpha amylase activity (ArtAA), Artificial falling number (ArtFN), Awnless or bearded (Awns), Cookie Diameter (CkDia), Cookie top grade (CkTpGrd), Flour Protein (FlProt), Flour Yield (FlYld), Heading date (HD), Plant height (Hght), Irrigated alpha amylase activity (IrrAA), Irrigated falling number (IrrFN), Adjusted Lactic Acid SRC (LAAdSRC), Lactic Acid SRC (LASRC), Sodium Carbonate SRC (NaSRC), Natural alpha amylase activity (NatAA), Natural falling number (NatFN), red or white seed color (SdClr), Softness Equivalence (SftEqv), Sucrose SRC (SucSRC), test weight (TstWt), and Water SRC (WatSRC).
Figure 3 Manhattan plots for genome-wide association of all six models, GLM, MLM, MLMM, FarmCPU, BLINK, and SUPER. Model names are shown on the right. Negative log 10 of p-value for each marker on a chromosome are colored dots. Red horizontal line is the default, more stringent experiment-wise Bonferroni significance threshold in GAPIT3 of alpha = 0.01. Dashed gray vertical lines indicate two models significant for the same marker, and solid gray vertical lines, three or more. (A) Age of breeding line (Age) shown with many very significant markers across several models. (B) Irrigated alpha amylase activity (IrrAA) shown with markers on chromosome 1A and 4A with multiple models showing strong significance.
Figure 4 QQ plots for two genome-wide association traits. Colored circles represent the six different models (GLM, MLM, MLMM, FarmCPU, BLINK, and SUPER) tested for each trait. The red diagonal line indicates where observed and expected results would match. Gray shaded region is confidence interval and colored circles significantly above the line represent deviations that may be significantly associated with phenotype. (A) Age of breeding line (Age) shown with many potentially significant phenotype associations across several models but the majority of GLM appear too far above the line. (B) Irrigated alpha amylase activity (IrrAA) shown with fewer potentially strong significant associations to phenotype.
Five regions on chromosomes 1A, 5A, 6B, and 6D overlapped Age and agronomic or flour quality traits including Hght, HD, Awns, WatSRC, FlProt, CkDia, and CkTpGr and all but Hght and CkTpGr shared common markers with Age (Figure 2, Table 2). Quantitative trait nucleotide 1A-1, 80-81 estimated cM region with Age, FlProt, and Hght had p-values of 2 × 10−5 to 2 × 10−6. Quantitative trait nucleotide 5A-5, 99-109 estimated cM region with Age and FlProt had p-values of 5 × 10−5 to 3 × 10−9. Quantitative trait nucleotide 6B-4, 82 estimated cM region with Age and HD had p-values of 1 × 10−6 to 8 × 10−8. Quantitative trait nucleotide 6B-8, 150-151 estimated cM region with Age, CkDia, WatSRC, FlProt, Hght, and CkTpGr had p-values of 2 × 10−5 to 9 × 10−9. Quantitative trait nucleotide 6D-2, 10 estimated cM region with Age and Awns had p-values of 6 × 10−4 to 4 × 10−5. These regions indicated where potential trait changes over time observed by Pearson’s correlation and PCA may have occurred. For identical markers, the FlProt effect was in the same direction as Age three times, while HD and WatSRC, once. Conversely, Awns and CkDia effects were the opposite of Age where increased Age indicated older breeding lines (Table 2).
Regions of PHS were separate from other traits except for co-located or closely located groups to SdClr on chromosomes 2A and 4A or HD and ArtAA on chromosome 4D. Seed color is QTN 2A-1 at an estimated 83–115 cM and a p-value of 3 × 10−7 while SdClr and NatAA are QTN 2A-2 at an estimated 115 cM and p-values of 1 × 10−5 to 2 × 10−6. Quantitative trait nucleotide 4A-3 is at an estimated 122 cM and p-values of 2 × 10−5 to 5 × 10−5 while QTN 4A-4 is ArtAA, IrrAA, and IrrFN at an estimated 113 cM with p-values of 4 × 10−4 to 2 × 10−9. Quantitative trait nucleotide 4D-4 is at an estimated 53 cM that includes ArtAA and HD with p-values of 4 × 10−3 to 5 × 10−3. Multiple measures of PHS co-located on chromosomes 1A, 2D, 4A, 4B, and 6D. Quantitative trait nucleotide 1A-4 is at an estimated 76 cM for IrrAA and NatAA with p-values of 5 × 10−9 to 8 × 10−10. Quantitative trait nucleotide 2D-3 is at an estimated 90 cM for ArtAA and ArtFN with p-values of 2 × 10−3 to 6 × 10−4. Quantitative trait nucleotide 4A-4 is described above. Quantitative trait nucleotide 4B-3 is at an estimated 72 cM for ArtAA and ArtFN with p-values of 3 × 10−4 to 5 × 10−5. Quantitative trait nucleotide 6D-1 is at an estimated 49 cM for NatAA and NatFN with p-values of 1 × 10−4 to 7 × 10−7. Only one region on chromosome 4A (QTN 4A-4, described above) had an overlap between ArtAA and irrigated PHS measures while irrigated and natural PHS measures were more often co-located. Multiple flour quality traits co-located on chromosomes 1A, 1B, 1D, 2A, 2D, 3A, 5A, 6A, and 6B. Quantitative trait nucleotide 1A-5 was described above. Quantitative trait nucleotide 1A-8 is at an estimated 117 cM including WatSRC and NaSRC with p-values from 3 × 10−4 to 4 × 10−5. Quantitative trait nucleotide 1B-1 is at an estimated 24–27 cM including LASRC and LAAdSRC with p-values from 2 × 10−4 to 7 × 10−12. Quantitative trait nucleotide 1B-2 is at an estimated 30 cM including CkTpGr, FlYld, and SucSRC with p-values of 1 × 10−4 to 7 × 10−7. Quantitative trait nucleotide 1D-1 is at an estimated 14-21 cM including LASRC, LAAdSRC, and SucSRC with p-values of 1 × 10−4 to 8 × 10−5. Quantitative trait nucleotide 1D-2 is at an estimated 89 cM and includes LASRC and LAdSRC with p-values of 1 × 10−6 to 5 × 10−11. Quantitative trait nucleotide 2A-3 is at an estimated 170 cM and includes NaSRC and SucSRC with p-values of 8 × 10−5 to 6 × 10−6. Quantitative trait nucleotide 2D-1 is at an estimated 12 cM and includes FlYld and WatSRC with p-values of 1 × 10−3. Quantitative trait nucleotide 3A-2 is at an estimated 63–64 cM and includes CkTpGr, CkDia, and FlProt with p-values of 2 × 10−4 to 8 × 10−6. Quantitative trait nucleotide 3A-6 is at an estimated 123 cM including LAAdSRC, LASRC, and SdClr with p-values of 1 × 10−4 to 5 × 10−9. Quantitative trait nucleotide 5A-3 is at an estimated 150 cM and includes LAAdSRC and SftEqv with p-values of 1 × 10−5 to 5 × 10−6. Quantitative trait nucleotide 5A-6 is at an estimated 80 cM and includes CkDia, NaSRC, SucSRC, and WatSRC with p-values of 2 × 10−6 to 8 × 10−8. Quantitative trait nucleotide 6A-3 is at an estimated 89–90 cM and includes CkDia, CkTpGr, FlProt, and Hght with p-values of 1 × 10−4 to 6 × 10−8. Quantitative trait nucleotide 6A-5 is at an estimated 114 cM and includes CkDia, FlProt, Hght, and SucSRC with p-values of 1 × 10−5 to 9 × 10−6. Quantitative trait nucleotide 6B-6 is at an estimated 95–97 cM and includes NaSRC, WatSRC, CkTpGr, CkDia, and FlYld with p-values of 1 × 10−4 to 1 × 10−10. Quantitative trait nucleotide 6B-8 is described above with overlap in Age (Figure 2, Table 2).
4 Discussion
The 188-member population of historically diverse soft winter wheat breeding lines was chosen for increased marker and trait diversity compared to elite breeding lines to maximize trait differences in the population. It has a rich history of genetic diversity, which could contain useful genes for PHS and flour quality trait enhancement that may have been bred out of elite lines selected only for limited agronomic traits such as yield and a few resistance genes. Population structure is often quite evident in wheat including the present population, which was previously compared to a larger elite wheat breeding population. Using Structure v 2.2, Cabrera et al. (2014) found five subpopulations of the historic diversity lines compared to six in a larger set of 449 elite parent lines. Three of the 5 historic diversity sub-populations separated mostly by Age, reported as release year. Wright’s fixation index between sub-groups for the historic diversity lines ranged from 0.178 to 0.485 while they ranged from 0.212 to 0.408 for the elite population (Cabrera et al., 2014). Diversity between the historic and much larger elite line set was found to be similar, and the historic diversity population had moderate structure due to recent and historic breeding history as well as red versus white seed color (Cabrera et al., 2014).
PHS is most often measured by FN or AA (Vetch et al., 2019; Patwa and Penning, 2020). The developed micro assay for alpha amylase activity showed strong correlation to the original method in a subset of 20 samples tested in three separate years. Both AA and FN were used in this study although more QTNs were obtained using AA (11) than FN (4). PHS only occurs when conditions such as high humidity or extensive rainfall occur at spike maturity (Vetch et al., 2019; Patwa and Penning, 2020). In addition to waiting for the correct conditions to occur in the field, NatAA and NatFN, spikes were removed and subjected to the correct conditions in a growth chamber, ArtAA and ArtFN, or once all plants reached maturity, they were subjected to overhead water sprinklers to simulate rainfall, IrrAA and IrrFN. Sixty-five potential regions affecting PHS resistance and flour quality were found including a highly significant region for PHS resistance on chromosome 4A and multiple regions for flour quality on chromosomes 6A and 6B (Figure 2, Table 2). Another goal was to observe if selective breeding of a few traits might affect unselected traits in the same chromosomal region. PHS measures showed no correlation to Age by Pearson’s correlation, PCA, or GWAS overlapping only with HD on chromosome 4D at 455 MBP. Thus, PHS resistance does not appear selected over time and could be improved with minimal impact to agronomic traits while providing resistance to an abiotic stress related to climate change. Some PHS resistance has been tied to dormancy genes or areas, so it is not unusual for PHS measures and HD, which can be affected by dormancy, to be associated (Nakamura et al., 2011; Martinez et al., 2016; Torada et al., 2016). Shorter et al. (2005) found that maturity could impact PHS depending on when a rain event occurred. The relationship between seed dormancy, HD, and PHS resistance is complicated. The rice gene, Ghd7, has been linked to pleotropic effects including seed dormancy and HD as well as other effects through the ratio of gibberellins and abscisic acid (Hu et al., 2021). A previous study in rice found a short locus on chromosome 3 for HD, and seed dormancy could be separated into two loci: one for HD, Hd8, and one for seed dormancy, Sdr1, through fine mapping of recombination events within the region (Takeuchi et al., 2003). In barley, two different HvNCED genes that catalyze a key step in abscisic acid synthesis were found to be differentially expressed based on seed developmental age or water imbibition (Chono et al., 2006). Seed dormancy has been shown to have a large effect on PHS resistance but is not the only factor involved (DePauw and McCaig, 1991). Although the literature suggests and PCA indicated some association between HD and AA measures of PHS, only ArtAA was correlated significantly with HD by Pearson’s correlation and had a single co-localization in GWAS. Age was positively correlated with Hght, FlProt, and HD. Examination of trait positive or negative effects at individual markers in the GWAS indicated older breeding lines headed later and had higher flour protein content and greater water absorption but were less likely to have awns and had a smaller CkDia. Quantitative trait nucleotides 1A-5 and 6B-8 had Age, FlProt, and Hght co-localizations while 5A-5 had Age and FlProt co-localizations and 6B-4 had Age and HD co-localizations indicating regions that may have had two agronomic and one flour quality trait selected for over time. Quantitative trait nucleotide 6B-8 also had co-localizations of Age with WatSRC, CkDia, and CkTpGr, indicating the potential impact of other flour quality traits over time. Using Supplementary Data from Wang et al. (2014b), approximate regions in cM were calculated to compare with other studies. Five Hght QTN regions, 2B-1 (Ppd-B1), 3A-3 (qRht.3A), 4B-4 (Rht-B1), and 6A-3 and 6A-5 (Rht24), were 0.7–8.8 cM from other quantitative trait loci (QTL) studies of 410 European winter wheat varieties, a fine mapping set of 110 European winter wheat varieties, and four F2 mapping populations (Würschum et al., 2015; Zhou et al., 2016; Würschum et al., 2017). Markers are listed in Table 3. However, no regions were found on 1D, 2A, 2D, 4D, 5A, 5B, and 7D but additional QTN regions were found on 1A-5, 3D-4, 6B-8, 7A-6, and two unlinked markers. Differences could be attributed to population differences between US and European wheat varieties.
FN negatively correlated with AA by Pearson’s correlation and were at opposite quadrants in PCA. This was expected as higher alpha amylase activity breaks down more starch, reducing hot flour paste thickness and thus FN (Patwa and Penning, 2020). The correlation of TstWt and PHS resistance was observed in both PCA and Pearson’s tests. A near-significant TstWt at the same marker for GWAS models, MLM, MLMM, and GLM as PHS resistance on chromosome 1B at 382 million base pairs (MBPs) was observed (data not shown). Statistical significance has been reported between TstWt and PHS in another population (Ji et al., 2018). Seed color was strongly correlated with PHS measures and PCA indicated a tight positive link between FN and SdClr. Red seed color has been widely reported as significant to PHS resistance and a gene has been located (Lin et al., 2016; Ji et al., 2018; Patwa and Penning, 2020). No QTN for PHS and SdClr on chromosome 3A near MFT was found, but the study had few mostly related white seed color varieties. Detection would be difficult due to the low minor allele frequency and a strong population structure. A highly significant QTN for PHS was observed on chromosome 4A in the region of a known PHS resistance gene, MKK3 (Torada et al., 2016). A total of 36 QTNs at 25 regions related to PHS resistance were observed. The majority of PHS QTN regions were field events, both natural weathering (Nat) and by overhead irrigation (Irr), that were co-located and offer potential new targets to improve PHS including a highly significant QTN on chromosome 1A. None of the PHS regions overlapped with flour quality traits and only one overlapped with HD; thus, improvement of PHS may be obtained without impacting crop or baking performance. Artificial spike wetting (Art) had fewer QTNs and only one overlap on chromosome 4A at 604 MBP with field-treated samples (Irr or Nat). Converting regions to approximate cM, no exact marker matches were found in the literature, but five QTN regions were within 1 to 8 cM. 1A-4, 1B-3, and 4B-3 were compared to a GWAS of 469 club and soft white wheat germplasm from the Pacific Northwest (Martinez et al., 2018). Four QTN regions were observed within 0.3 to 11 cM of a GWAS with 198 elite Eastern US soft white winter wheat lines 1B3, 4A-4, 6D-1, and 7B-1 out of 11 regions (Kulwal et al., 2012). All close matches are listed in Table 3. Thus, population, method of association, and/or environment appeared to play a key role in locating PHS resistance, although some consistent regions were found.
High genetic variation in a number of flour quality traits including FlYld, FlProt, multiple SRCs, SftEqv, and measures of cookie quality has been previously suggested for this population, but these soft wheat lines have maintained relatively stable quality standards without large improvements (Souza et al., 2012). This study has revealed potential changes of flour protein over time, but it remains possible to improve many other flour quality traits utilizing this diverse population of historic breeding lines by finding and understanding the genes underlying flour quality traits. Soft wheat flour is used mostly for baked confections such as cookies and cakes. Depending on the end use of soft wheat flour, a higher or a lower FlProt may be desired. For baking powder biscuits and pound cake, a slightly higher FlProt can increase height (Wilderjans et al., 2008; Ma and Baik, 2018). Higher protein can increase the volume and reduce collapse in pound cake but decreases CkDia and cookie spread factor (Wilderjans et al., 2008; Souza et al., 2011). Co-located flour quality QTNs had expected positive and negative values when considering individual markers. Lower FlProt or gluten strength measured by LASRC and LAAdSRC is considered a positive trait for most soft winter wheat applications except crackers (Souza et al., 2012; Deng et al., 2021). Flour quality traits showed strong overlap among themselves by Pearson’s correlation, PCA, and GWAS, indicating high potential genetic overlap affecting these traits. Solvent retention capacity measures for flour quality were tightly grouped by PCA and were strongly positively correlated with co-localizations on chromosomes 1A, 1D, 2A, 5A, and 6B. This is not surprising considering the close associations of SRCs with overall water absorption (WatSRC) in baking performance (Souza et al., 2012). Similarly, SRCs, SftEqv, and FlProt were correlated, positively or negatively with CkDia and CkTpGr. Negative correlations of SRCs (WatSRC, NaSRC, SucSRC, and LASRC) with FlYld and cookie quality (CkDia and CkTpGr) have long been known but the underlying genes are not yet discovered (Guttieri and Souza, 2003). Multiple flour quality co-localizations occurred on chromosomes 1B, 2D, 3A, 5A, 6A, and 6B. Using a completely different marker set and different GWAS models for the population, overlap with eight QTL reported by Cabrera et al. (2015) was observed (QTN 1A-2, 1B-1, 1B-2, 1B-5, 3B-2, 3B-5, 6A-2, and 6B-6) and additional QTNs were found on 17 of 21 chromosomes (Tables 2, 3). These regions deserve closer investigation with additional populations, marker sets, and/or experiments to eventually isolate genes influencing these traits.
The A and B genomes were well covered with an average of 128 markers per chromosome except chromosome 4B with 55 markers where only one QTN was identified. The D genome had fewer markers per chromosome, averaging 24. Reduced sequence diversity may have been a factor. Single-nucleotide polymorphism markers placed on the D genome were about 20% of those on the A and B genomes for the iSelect SNP array due to lack of sequence differences (Cavanagh et al., 2013). While chromosome-level cutoffs were used as in Hill et al. (2022), most of the QTNs on the A and B genomes were significant at the much more restrictive experiment-wise Bonferroni cutoff while most on the D genome were not. Additional markers from single-nucleotide polymorphisms will be needed to populate the D genome and test available diversity, which should improve chromosome-wise significant regions to experiment-wise significance. Increased markers on the D genome would reduce false positives while potentially improving QTN detection with stronger p-values by having markers closer to the responsible region. This was observed in the A and B genomes at regions 1B-1, 3A-2, 3A-6, 3A-7, 4A-2, 5A-1, 6A-2, and 6B-2 where a more significant peak for a trait was detected along with a less significant outer marker for that same trait often with SUPER and GLM models forming the tail. However, it is impossible in a population of this size to definitively differentiate a less significant tail from two closely linked genes.
GAPIT3 allowed testing of multiple GWAS models. Nine traits were normally distributed by the Shapiro–Wilk test; however, the majority had near-normal-appearing bell curves by histogram and W test statistics > 0.9, indicating that most values for each trait fit in standard normal quartiles, except ArtAA and NatAA with a limited range of values; Age, which is heavily skewed to newer lines; and Awns and SdClr, which were either/or classifications and, thus, binary. The six models tested did not appear to show any abnormal calling of significant locations for SdClr, which, being binary (red or white) with the lowest W statistic and fewer white varieties, would have been the most susceptible to false positives. The detection of Age-related QTN across all models with the exception of MLM was also robust regardless of it being skewed more towards more recent breeding lines (less than 60 years since release) as fewer very old breeding lines are available. Single-locus tests such GLM, a naïve model without population correction, and MLM using VanRaden kinship were tested along with more recently released models. The GLM can be particularly prone to false positives as illustrated by the QQ plot for Age but false positives appear less likely with the QQ plot for IrrAA. The MLMM performs multiple locus tests and is an extension of MLM with the most significant marker fitted stepwise as a co-variate (Wang and Zhang, 2021). The FarmCPU model iterates back and forth with two models, an MLM with VanRaden kinship and a GLM without kinship, reported to be faster and provide higher statistical power (Wang and Zhang, 2021). The BLINK model replaces MLM in FarmCPU with Bayesian-information content (Wang and Zhang, 2021). The SUPER model assesses marker effects like GLM, optimizes a kinship model using maximum likelihood, and tests a third time with a kinship derived from markers not in linkage disequilibrium with the tested marker (Wang et al., 2014a; Wang and Zhang, 2021). The GLM as a naïve model lacking population structure correction overestimated QTN as has been reported previously (Larsson et al., 2013). The SUPER model also reported increased numbers of QTN but often near other QTNs for the same traits significant for multiple models. The MLM and MLMM gave very similar results but appeared to underestimate QTN. Similarly, BLINK, FarmCPU, and SUPER often located the same QTN but a different set from MLM/MLMM. Since QTN differences were found based on model selection, requiring agreement of at least two models boosts confidence in reported QTN. A much stronger case for significance can be made where all models agree. However, owing to the computational intensity of testing six models as more markers or individuals are added, selecting those with the widest differences may prove more efficient. In this population, MLMM, FarmCPU, and BLINK were the most obvious choices as they were the least computationally impactful and covered the most QTN differences with less likelihood for overestimation. The BLINK model was chosen over SUPER as it does not assume causal genes are distributed evenly, provides more diversity to the chosen models, and leads to less overestimation in this population (Wang and Zhang, 2021).
5 Conclusions
Breeding over time appeared to be a factor in some traits, most notably flour protein content, Hght, and HD. Since PHS-related traits had little overlap with others, several new regions were uncovered to increase resistance in wheat under adverse weather conditions brought about by climate change without impacting agronomic performance or flour quality. Flour quality traits showed much greater overlap both among themselves and with agronomic traits making improvement more challenging. Future research to increase marker density should better resolve QTN and improve detection in areas lacking sufficient coverage utilizing a smaller set of GWAS models.
Data availability statement
All data used to generate results during this study are available as Supplementary Information files to this article.
Author contributions
NP performed some of the experiments, wrote and edited some text. BP designed and supervised the experiments, analyzed the data, wrote and edited text. All authors contributed to the article and approved the submitted version.
Funding
This research was supported by the U.S. Department of Agriculture, Agricultural Research Service project # 5082-43440-001-00D.
Acknowledgments
We would like to thank Anne Sturbaum, Cindy Hampton, and MinWoo Lee for their dedicated work to help perform many of the experiments. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. USDA is an equal opportunity provider and employer. The findings and conclusions in this report are those of the authors and should not be construed to represent any official USDA or U.S. Government determination or policy.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1137808/full#supplementary-material
File S1 | Trait data set used to calculate BLUPs. Includes name, accession number, release year and all calculated values of traits including the means, standard deviations, and number of reps for each soft winter wheat variety used to calculate the BLUPs.
File S2 | R Scripts for statistical analyses and GWAS. Includes all R scripts with notes used to generate the Pearson’s correlations, principal components analysis, genome-wide association study, and Figure 2.
File S3 | Trait data as BLUPs used in GAPIT for GWAS. Twenty-two sets of the trait data as BLUPs for 188 soft winter wheat varieties used to perform the genome-wide association, myY in the GWAS R script.
File S4 | Marker data in HAPMAP format used in GAPIT for GWAS. Twenty-two sets of 1978 markers in HAPMAP format for the genotypes used to perform the GWAS, myG in the GWAS R script.
File S5 | Histograms, Manhattan and Q-Q Plots for all traits and models of the GWAS. Twenty-two sets of histograms of BLUPs. Twenty Q-Q plots and stacked Manhattan plots with six models each for all traits except Age and IrrAA in Figures 3, 4.
Glossary
References
AACC (1992). Determination of falling number (56-81.03) (St. Paul, MN, USA: Cereals & Grains Association). doi: 10.1094/AACCIntMethod-56-81.03
AACC (1999). Moisture – air-oven (aluminum-plate) method (44-16.01) (St. Paul, MN, USA: Cereals & Grains Association). doi: 10.1094/AACCIntMethod-44-16.01
AACC (2002). Measurement of alpha-amylase in plant and microbial materials using the ceralpha method (22-02.01) (St. Paul, MN, USA: Cereals & Grains Association). doi: 10.1094/AACCIntMethod-22-02.01
AACC (2010a). Solvent retention capacity profile (56-11.02) (St. Paul, MN, USA: Cereals & Grains Association). doi: 10.1094/AACCIntMethod-56-11.02
AACC (2010b). Baking quality of cookie flour-micro method (10-52.02) (St. Paul, MN, USA: Cereals & Grains Association). doi: 10.1094/AACCIntMethod-10-52.02
Bhatt, G. M., Derera, N. F., McMaster, G. J. (1977). Utilization of tom thumb source of pre-harvest sprouting tolerance in a wheat breeding programme. Euphytica 26, 565–572.
Cabral, A. L., Jordan, M. C., McCartney, C. A., You, F. M., Humphreys, D. G., MacLachlan, R., et al. (2014). Identification of candidate genes, regions and markers for pre-harvest sprouting resistance in wheat (Triticum aestivum l.). BMC Plant Biol. 14, 340. doi: 10.1186/s12870-014-0340-1
Cabrera, A., Guttieri, M., Smith, N., Souza, E., Sturbaum, A., Hua, D., et al. (2015). Identification of milling and baking quality QTL in multiple soft wheat mapping populations. Theor. Appl. Genet. 128, 2227–2242. doi: 10.1007/s00122-015-2580-3
Cabrera, A., Souza, E., Guttieri, M., Sturbaum, A., Hoffstetter, A., Sneller, C. (2014). Genetic diversity, linkage disequilibrium, and genome evolution in soft winter wheat. Crop Sci. 54, 2433–2448. doi: 10.2135/cropsci2013.09.0601
Cavanagh, C. R., Chao, S., Wang, S., Huang, B. E., Stephen, S., Kiani, S., et al. (2013). Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars. Proc. Natl. Acad. Sci. U.S.A. 110, 8057–8062. doi: 10.1073/pnas.1217133110
Chono, M., Honda, I., Shinoda, S., Kushiro, T., Kamiya, T., Nambara, E., et al. (2006). Field studies on the regulation of abscisic acid content and germinability during grain development of barley: molecular and chemical analysis of pre-harvest sprouting. J. Exp. Bot. 57, 2421–2434. doi: 10.1093/jxb/erj215
Deng, L., Hou, G. G., Xu, M., Bock, J. (2021). Identifying functional characteristics of soft white wheat for saltine crackers. Cereal Chem. 98, 296–304. doi: 10.1002/cche.10360
DePauw, R. M., McCaig, T. N. (1991). Components of variation, heritabilities and correlations for indices of sprouting tolerance and seed dormancy in triticum spp. Euphytica 52, 221–229. doi: 10.1007/BF00029399
Gao, X., Hu, C. H., Li, H. Z., Yao, Y. J., Meng, M., Dong, J., et al. (2013). Factors affecting pre-harvest sprouting resistance in wheat (Triticum aestivum l.): a review. J. Anim. Plant Sci. 23, 556–565.
Guttieri, M. J., Souza, E. (2003). Sources of variation in the solvent retention capacity test of wheat flour. Crop Sci. 43, 1628–1633. doi: 10.2135/cropsci2003.1628
Hedden, P. (2003). The genes of the green revolution. Trends Genet. 19, 5–9. doi: 10.1016/s0168-9525(02)00009-4
Hill, M. J., Penning, B. W., McCann, M. C., Carpita, N. C. (2022). COMPILE: a GWAS computational pipeline for gene discovery in complex genomes. BMC Plant Biol. 22, 315. doi: 10.1186/s12870-022-03668-9
Hu, Y., Song, S., Weng, X., You, A., Xing, Y. (2021). The heading-date gene Ghd7 inhibits seed germination by modulating the balance between abscisic acid and gibberellins. Crop J. 9, 297–304. doi: 10.1016/j.cj.2020.09.004
Huber, W., Carey, V. J., Gentleman, R., Anders, S., Carlson, M., Carvalho, B. S., et al. (2015). Orchestrating high-throughput genomic analysis with bioconductor. Nat. Methods 12 (2), 115–121. doi: 10.1038/nmeth.3252
IWGSC (2014). A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. Sci. 345, 6194. doi: 10.1126/science.1251788
Ji, T., Penning, B., Baik, B. K. (2018). Pre-harvest sprouting resistance of soft winter wheat varieties and associated grain characteristics. J. Cereal Sci. 83, 110–115. doi: 10.1016/j.jcs.2018.08.006
Kassambara, A., Mundt, F. (2020). “Factoextra: extract and visualize the results of multivariate data analyses,” in R package version 1.0.7. Available at: https://CRAN.R-project.org/package=factoextra.
King, R. W., Richards, R. A. (1984). Water uptake in relation to pre-harvest sprouting damage in wheat: ear characteristics. Aust. J. Agric. Res. 35, 327–336. doi: 10.1071/AR9840327
Kulwal, P., Ishikawa, G., Benscher, D., Feng, Z., Yu, L. X., Jadhav, A., et al. (2012). Association mapping for pre-harvest sprouting resistance in white winter wheat. Theor. Appl. Genet. 125, 793–805. doi: 10.1007/s00122-012-1872-0
Larsson, S. J., Lipka, A. E., Buckler, E. S. (2013). Lessons from Dwarf8 on the strengths and weaknesses of structured association mapping. PloS Genet. 9, e1003246. doi: 10.1371/journal.pgen.1003246
Lin, M., Zhang, D., Liu, S., Zhang, G., Yu, J., Fritz, A. K., et al. (2016). Genome-wide association analysis on pre-harvest sprouting resistance and grain color in U.S. winter wheat. BMC Genomics 17, 794. doi: 10.1186/s12864-016-3148-6
Ma, F., Baik, B. K. (2018). Soft wheat quality characteristics required for making baking powder biscuits. J. Cereal Sci. 79, 127–133. doi: 10.1016/j.jcs.2017.10.016
Martinez, S. A., Godoy, J., Huang, M., Zhang, Z., Carter, A. H., Garland Campbell, K. A., et al. (2018). Genome-wide association mapping for tolerance to preharvest sprouting and low falling numbers in wheat. Front. Plant Sci. 9. doi: 10.3389/fpls.2018.00141
Martinez, S. A., Tuttle, K. M., Takebayashi, Y., Seo, M., Campbell, K. G., Steber, C. M. (2016). The wheat ABA hypersensitive ERA8 mutant is associated with increased preharvest sprouting tolerance and altered hormone accumulation. Euphytica 212, 229–245. doi: 10.1007/s10681-016-1763-6
Nakamura, S., Abe, F., Kawahigashi, H., Nakazono, K., Tagiri, A., Matsumoto, T., et al. (2011). A wheat homolog of MOTHER OF FT and TFL1 acts in the regulation of germination. Plant Cell 23, 3215–3229. doi: 10.1105/tpc.111.088492
Nakamura, S., Toyama, T. (2001). Isolation of a VP1 homologue from wheat and analysis of its expression in embryos of dormant and non-dormant cultivars. J. Exp. Bot. 52, 875–876. doi: 10.1093/jexbot/52.357.875
Orostica, K. Y., Verdugo, R. A. (2016). chromPlot: visualization tool of genomic data in chromosomal context. Bioinformatics 32, 2366–2368. doi: 10.1093/bioinformatics/btw137
Patwa, N., Penning, B. W. (2020). “Environmental impact on cereal crop grain damage from pre-harvest sprouting and late maturity alpha-amylase,” in Sustainable agriculture in the era of climate change. Eds. Roychowdhury, R., Choudhury, S., Hasanuzzaman, M., Srivastava, S. (Cham: Springer), 23–41. doi: 10.1007/978-3-030-45669-6_2.
Perten Instruments AB (2016). Falling number 1000 operation manual (Release 20161207) Vol. 2016 (Hägersten, Sweden: Perten Instruments AB), 33.
Ray, D. K., Mueller, N. D., West, P. C., Foley, J. A. (2013). Yield trends are insufficient to double global crop production by 2050. PloS One 8, e66428. doi: 10.1371/journal.pone.0066428
R Core Team (2022). R: a language and environment for statistical computing (Vienna, Austria: R Foundation for Statistical Computing). Available at: https://www.R-project.org/.
RStudio Team (2022). “RStudio: integrated development environment for r,” in RStudio (Version 2022.7.1.554) (Boston, MA: PBC). Available at: http://www.rstudio.com/.
Shorter, S. C., Munro, C. A., Hodgkinson, J. (2005). Predicting pre-harvest sprouting susceptibility in new Zealand wheat cultivars. Euphytica 143, 309–312. doi: 10.1007/s10681-005-7888-7
Singh, C., Kamble, U. R., Gupta, V., Singh, G., Sheoran, S., Gupta, A., et al. (2021). Pre-harvest sprouting in wheat: current status and future prospects. J. Cereal Res. 13, 1–22. doi: 10.25174/2582-2675/2021/114484
Sneller, C. H., Paul, P., Guttieri, M. (2010). Characterization of resistance to fusarium head blight in an Eastern U.S. soft red winter wheat population. Crop Sci. 50, 123–133. doi: 10.2135/cropsci2008.09.0569
Souza, E. J., Guttieri, M. J., Sneller, C. (2011). Selecting soft wheat genotypes for whole grain cookies. Crop Sci. 51, 189–197. doi: 10.2135/cropsci2010.05.0263
Souza, E. J., Sneller, C., Guttieri, M. J., Sturbaum, A., Griffey, C., Sorrells, M., et al. (2012). Basis for selecting soft wheat for end-use quality. Crop Sci. 52, 21–31. doi: 10.2135/cropsci2011.02.0090
Stacklies, W., Redestig, H., Scholz, M., Walther, D., Selbig, J. (2007). pcaMethods – a bioconductor package providing PCA methods for incomplete data. Bioinformatics 23, 1164–1167. doi: 10.1093/bioinformatics/btm069
Takeuchi, Y., Lin, S. Y., Sasaki, T., Yano, M. (2003). Fine linkage mapping enables dissection of closely linked quantitative trait loci for seed dormancy and heading in rice. Theor. Appl. Genet. 107, 1174–1180. doi: 10.1007/s00122-003-1364-3
Tang, Y., Liu, X., Wang, J., Li, M., Wang, Q., Tian, F., et al. (2016). GAPIT version 2: an enhanced integrated tool for genomic association and prediction. Plant J. 9 (2). doi: 10.3835/plantgenome2015.11.0120
Torada, A., Koike, M., Ogawa, O., Tadamura, K., Wu, J., Matsumoto, T., et al. (2016). A causal gene for seed dormancy on wheat chromosome 4A encodes a MAP kinase kinase. Curr. Biol. 26, 782–787. doi: 10.1016/j.cub.2016.01.063
Vetch, J. M., Stougaard, R. N., Martin, J. M., Giroux, M. J. (2019). Review: revealing the genetic mechanisms of pre-harvest sprouting in hexaploid wheat (Triticum aestivum l.). Plant Sci. 281, 180–185. doi: 10.1016/j.plantsci.2019.01.004
Viechtbauer, W. (2010). Conducting meta-analyses in r with the metafor package. J. Stat. Software 36 (3), 1–48. doi: 10.18637/jss.v036.i03
Wang, Q., Tian, F., Pan, Y., Buckler, E. S., Zhang, Z. (2014a). A SUPER powerful method for genome wide association study. PloS One 9, e107684. doi: 10.1371/journal.pone.0107684
Wang, S., Wong, D., Forrest, K., Allen, A., Chao, S., Huang, B. E., et al. (2014b). Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array. Plant Biotech. J. 12, 787–796. doi: 10.1111/pbi.12183
Wang, J., Zhang, Z. (2021). GAPIT version 3: boosting power and accuracy for genomic association and prediction. Genomics Proteomics Bioinf. S1672-0229 (21), 00177–00172. doi: 10.1016/j.gpb.2021.08.005
Ward, B. P., Brown-Guedira, G., Tyagi, P., Kolb, F. L., Van Sanford, D. A., Sneller, C. H., et al. (2019). Multienvironment and multitrait genomic selection models in unbalanced early-generation wheat yield trials. Crop Sci. 59, 491–507. doi: 10.2135/cropsci2018.03.0189
Wilderjans, E., Pareyt, B., Goesaert, H., Brijs, K., Delcour, J. A. (2008). The role of gluten in a pound cake system: a model approach based on gluten-starch blends. Food Chem. 110, 909–915. doi: 10.1016/j.foodchem.2008.02.079
Würschum, T., Langer, S. M., Longin, C. F. H. (2015). Genetic control of plant height in European winter wheat cultivars. Theor. Appl. Genet. 128, 865–874. doi: 10.1007/s00122-015-2476-2
Würschum, T., Langer, S. M., Longin, C. F. H., Tucker, M. R., Leiser, W. L. (2017). A modern green revolution gene for reduced height in wheat. Plant J. 92, 892–903. doi: 10.1111/tpj.13726
Yanagisawa, A., Nishimura, T., Amano, Y., Torada, A., Shibata, S. (2005). Development of winter wheat with excellent resistance to pre-harvest sprouting and rain damage. Euphytica 143, 313–318. doi: 10.1007/s10681-005-7889-6
Keywords: wheat, pre-harvest spouting, flour quality, grain quality, genetics, genome-wide association
Citation: Patwa N and Penning BW (2023) Genetics of a diverse soft winter wheat population for pre-harvest sprouting, agronomic, and flour quality traits. Front. Plant Sci. 14:1137808. doi: 10.3389/fpls.2023.1137808
Received: 04 January 2023; Accepted: 02 May 2023;
Published: 06 June 2023.
Edited by:
Carolina Ballen-Taborda, Clemson University, United StatesReviewed by:
Satish Kumar, Indian Council of Agricultural Research (ICAR), IndiaSunil S. Gangurde, University of Georgia, United States
Copyright © 2023 Patwa and Penning. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bryan W. Penning, YnJ5YW4ucGVubmluZ0B1c2RhLmdvdg==
†Present address: Nisha Patwa USDA-ARS Dale Bumpers National Rice Research Center, Stuttgart, AR, United States