- 1Research Centre for Cereal and Industrial Crops (CREA-CI), CREA—Council for Agricultural Research and Economics, Foggia, Italy
- 2Department of Agricultural Sciences, University of Naples Federico II, Portici, Italy
- 3Institute of Biosciences and Bioresources (CNR-IBBR), Bari, Italy
- 4Department of Agriculture and Forest Sciences (DAFNE), University of Tuscia, Viterbo, Italy
Although wheat (Triticum aestivum L.) is the main staple crop in the world and a major source of carbohydrates and proteins, functional genomics and allele mining are still big challenges. Given the advances in next-generation sequencing (NGS) technologies, the identification of causal variants associated with a target phenotype has become feasible. For these reasons, here, by combining sequence capture and target-enrichment methods with high-throughput NGS re-sequencing, we were able to scan at exome-wide level 46 randomly selected bread wheat individuals from a recombinant inbred line population and to identify and classify a large number of single nucleotide polymorphisms (SNPs). For technical validation of results, eight randomly selected SNPs were converted into Kompetitive Allele-Specific PCR (KASP) markers. This resource was established as an accessible and reusable molecular toolkit for allele data mining. The dataset we are making available could be exploited for novel studies on bread wheat genetics and as a foundation for starting breeding programs aimed at improving different key agronomic traits.
Introduction
Bread wheat (Triticum aestivum L. 2n = 6x = 42, AABBDD) is a major staple crop that provides about 20% of daily calories and 21% of protein needs consumed by the world population (FAO 2017, http://www.fao.org/3/a-i6583e.pdf). However, to meet global food demand without expanding acreage, wheat grain production is projected to increase by at least 50% within the next few decades (Tshikunde et al., 2019). This means that the average annual genetic gain of wheat is expected to increase 1.0%–1.7% per year, reaching global production of 1 billion tons in 2050 (Le Mouël et al., 2017). Considering also the need to adopt environmentally sustainable agronomic practices capable of mitigating the effect of climate change on soil degradation and water scarcity (https://www.ipcc.ch/report/srccl/), the next challenges appear to be very demanding for wheat breeders and scientists (De Vita and Taranto, 2019; Senapati et al., 2019; Xiong et al., 2021; The grand challenge of breeding by design, 2022). In addition, the genomic resources of wheat are scarce compared with those of other cereals such as rice and maize, mainly due to its complex and polyploid genome. Polyploidy, in fact, on the one hand confers a high degree of plasticity to the organism, on the other hand it makes genetic analysis more difficult for scientists and breeders aimed at dissecting the molecular basis underlying quantitative and qualitative traits (Dubcovsky et al., 2007; Bevan et al., 2017). So far, most efforts to understand the genetic basis of key traits related to yield, grain quality and adaptability have been made through map-based cloning approaches (summarized by Colasuonno et al., 2021; Soriano et al., 2021; Arrigada et al., 2021). For example, genes involved in vernalization and photoperiod response (Yan et al., 2003, 2004; Díaz et al., 2012), grain protein content (Uauy et al., 2006), grain quality (Jin et al., 2018; Goel et al., 2019; Semagn et al., 2021) stem solidness (Nilsen et al., 2020), male sterility (Ni et al., 2017; Xia et al., 2017), and resistance to fungal diseases (Fu et al., 2009) have been identified and cloned. However, this approach is costly and time-consuming, as multiple steps are required, from developing specific mapping populations, through identifying target loci using co-segregating genetic markers, to sequencing relevant loci.
The availability of the reference genome of the allohexaploid landrace Chinese spring (IWGSC et al., 2018; Zhu et al., 2021) has allowed the application and combination of high-throughput sequencing methods to map, identify and clone candidate genes much faster than in the recent past (Henry et al., 2014; Dong et al., 2020; Martinez et al., 2020; Xie et al., 2020). Genome availability has resulted in a remarkable change in bread wheat genetics, which now has a powerful tool for re-sequencing new accessions and for investigating sequence variations across the entire genome. Despite the availability of a gold-standard reference genome, some target genes can only be found in certain cultivars and be absent in the reference accession. For this reason, several strategies have been successfully developed with the aim of reducing genome complexity and sequencing costs and favoring the discovery of a large number of accession-specific variants in wheat (Krasileva et al., 2017; He et al., 2019). For example, genotyping-by-sequencing (GBS) (Elshire et al., 2011; Poland et al., 2012) has been used in wheat to perform genome-wide association studies (GWAS) and quantitative trait loci (QTL) mapping, as well as to disclose patterns of genetic variation (Bernardo et al., 2015; Juliana et al., 2019; Blackburn et al., 2021). Likewise, whole exome sequencing (WES) allows for the identification of nucleotide variability across the exome, i.e., the exon sequences of all protein-coding genes in a genome (Uavy et al., 2017; Mo et al., 2018; He et al., 2019). WES data of nearly 500 accessions from all over the world has been used to reveal the wheat breeding history (Pont et al., 2019). In a separate study, the exome of ∼900 hexaploid and tetraploid wheat accessions has been selectively captured and sequenced to understand how wild-relative introgression enables adaptation in modern bread wheat (He et al., 2019). WES has also been successfully coupled with bulked-segregant analysis (BSA) to identify candidate genes associated with key agronomic traits, as it dramatically reduces genotyping costs by using selective sampling, and the statistical power in QTL-mapping is comparable to that of full-population analysis (Gardiner et al., 2016; Mo et al., 2018; Martinez et al., 2020). For example, Martinez et al. (2020) combined WES with BSA to map ethylmethanesulfonate mutations and identified a novel allele linked to the wheat ERA8 ABA-hypersensitive germination phenotype. Within this motivating context, this study aimed to determine the efficacy of WES for identifying useful alleles within a recombinant inbred line (RIL) population of 46 individuals plus the two parents selected from a previous work in which the entire RIL population was used to identify genomic regions associated with target traits: plant height (PH), juvenile growth habit (GH), heading date (HD), fertile tiller number (FTN) and total tiller number (TTN) (Vitale et al., 2021). We identified many single nucleotide polymorphisms (SNPs) that were classified based on their genomic location and putative biological effect.
To validate the sequencing results, eight SNP-containing coding sequences were used to develop Kompetitive Allele-Specific PCR (KASP) markers to detect and distinguish specific alleles in the entire population. This resource has been primarily established to be used for allele mining (Kumar et al., 2010) and for BSA. The carefully phenotyped genetic materials can be effectively used to identify QTL and trait-associated genes, develop gene markers, and build genomics-assisted prediction models in bread wheat. Indeed, we believe that data FAIRability is an essential prerequisite to ensure the reuse of data and knowledge for downstream investigations, alone or in combination with newly generated data.
Methods
Plant material and DNA extraction
Forty-six individuals, derived from the tails of a bi-parental recombinant inbred line population (RILF6:7), previously studied for five high-correlated morpho-physiological traits (i.e., plant height, juvenile growth habit, total tiller number, fertile tiller number and heading date) (Vitale et al., 2021), were analyzed. The population of 176 RILs was developed from a cross between Lankaodali and Rebelde bread wheat cultivars, using the single seed descent method by advancing random F2 plants to the F6:F7 generation (Vitale et al., 2021). The two parental accessions were characterized by contrasting quantitative and qualitative agronomic traits (Botticella et al., 2018; Vitale et al., 2021).
Lankaodali is an early bread wheat cultivar of Chinese origin with very large kernels, low tillering ability, and poor qualitative attributes. Rebelde is an Italian cultivar with late flowering, small kernels, high tillering ability and excellent grain quality traits. During the 2020–2021 growing season, the leaves of 14-day-old seedlings of each of 48 individuals were collected and ground using liquid nitrogen. DNA was extracted using the Quick-DNA Plant/Seed Miniprep Kit (Zymo Research, United States) according to the manufacturer’s instructions. DNA quality and quantity were estimated using the NanoDrop ND-1000 spectrophotometer (Thermo Scientific, Wilmington, DE, United States) and the Qubit fluorometer (Invitrogen, Carlsbad, CA, United States), respectively.
Exome capture
In-solution-based hybridization was applied to capture the target loci. Baits (304,327) were designed to specifically capture over 250 Mb of coding DNA sequences (CDS) (myBaits®, Arbor Biosciences, Ann Arbour, MI, United States; http://www.arborbiosci.com). Bait design was based on the Chinese spring wheat genome v1.0 (IWGSC et al., 2018) and was carried out following the manufacturer’s protocol version 3.01 (https://arborbiosci.com/wp-content/uploads/2020/01/myBaitsExpert_WheatExome_Product_Sheet.pdf; accessed 14/07/2022).
As shown in Supplementary Table S1, the baits were evenly distributed along the genome, with the lowest number (10,345 baits) on chromosome 4D and the highest (17,587 baits) on chromosome 2B.
Illumina sequencing and data processing
The extracted DNA (250 ng −1 μg) was subjected to random mechanical shearing to obtain fragments with an average size of 400 bp. The fragments underwent an A-tailing reaction at 3’ of the blunt-end, where barcoded adapters were then ligated. Libraries were paired-end sequenced on an Illumina NovaSeq 6,000 platform. After Illumina sequencing, an average of 153 million raw reads per individual were obtained, ranging from 96 M (sample 21) to 230 M reads (sample 31) (Supplementary Table S2). Overall sequencing quality was assessed by FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and reads with a base quality score below 20 (Q < 20) were removed using Trimmomatic (Bolger et al., 2014) (SLIDINGWINDOW = 6). High quality reads were then aligned to the Chinese spring wheat genome v1.0 (IWGSC et al., 2018) with BWA-MEM (Li et al., 2009) (MINIMUMSEEDLENGTH = 19; BANDWIDTH = 100). SAMtools (Heng et al., 2009) were used to convert SAM files to BAM; the latter were processed by the MarkDuplicates utility of Picard version 1.109, (http://picard.sourceforge.net), to remove duplicate reads. Reads with mapping quality scores below 30 (Q < 30) were also filtered out. QualiMap (Garcia Alcalde et al., 2009) was used to evaluate the effectiveness of the mapping process: the percentage of mapped reads was greater than 99% for each individual with an average error rate of 0.74% (Supplementary Table S2). The reads-to-genome mapping produced over 55% of on-target reads for all individuals except sample 26, which showed the lowest percentage (∼47%) (Supplementary Figure S1). Hybridization capture methods are known to be prone to off-target enrichment and capture (Kaur et al., 2017) and in polyploid species target specificity and efficiency is affected by the presence of homoeologous sequences (King et al., 2015). This reflects the percentage of on-target reads we have achieved which is consistent with what obtained by King et al. (2015).
The CoverageBed utility of the BEDtools package (http://bedtools.readthedocs.org/) was used to derive the depth of coverage of the target regions for each individual. On average, more than 50% of the bases in the bait regions were covered at a depth greater than 20x (Figure 1A). Coverage per individual averaged 13.7x, with most individuals showing a mean coverage between 4x and 35x (Supplementary Table S3). We also estimated the depth of coverage per-base at the gene level (Figure 1B). As an example, Figure 1B shows per-base depth of coverage for the TraesCS2B01G175300 gene in all individuals. As expected, the depth of coverage was fairly uniform across exons and very shallow for introns.
FIGURE 1. (A) Cumulative distribution of coverage depth across target region in 48 wheat individuals. The graph highlights the fraction of bases captured in the target regions covered at a depth ranging between 0x–400x. (B) Per-base depth of coverage for the TraesCS2B01G175300 gene in all individuals. The exon (yellow boxes) -intron (black lines) gene structure is shown above the multi-line graph.
Polymorphism discovery, variant annotation, and biological effect prediction
SNPs were identified using the Genome Analysis Toolkit (GATK) version 4.0 (Van der Auwera et al., 2013), following the best practices recommended by the GATK documentation. The VariantRecalibrator utility was used to distinguish true genetic variants from sequencing or data processing artifacts (false positives) and remove the latter from the raw VCF (Variant Call Format) file. A total of 15,046,465 SNPs with a sequencing and alignment quality score ≥30 (Phred-scaled) and a coverage ≥10 were identified. The VCF-stat utility in VCFtools (King et al., 2015) was used to retrieve the statistics by sample. The number of variants ranged from 509,707 SNPs (sample 21) to 1,939,789 (sample 37), with a mean of 1,062,065 SNPs. To assess the level of residual heterozygosity for each individual, we calculated the percentage of heterozygosity at each locus by dividing the number of heterozygous genotypes for a given locus by the total number of individuals. This number was averaged across all the chromosomes and returned residual heterozygosity equal to 5.85% (Supplementary Figure S2) (Danecek et al., 2011). As expected, transitions (Ts) were the most abundant (∼70%), while transversions (Tv) accounted for ∼30%. Their ratio was 2.11. SnpEff (Cingolani et al., 2012; Velmurugan et al., 2018) was used to classify variants based on genomic location and biological effect (Supplementary Figures S3, S4). As observed by Suren et al. (2016), we found that some of the variants also came from non-target regions, although bait design was exclusively based on annotated exons. Indeed, most SNPs were detected in the intergenic (47%) and upstream gene regions (17%), while approximately 26% fell within exons (Supplementary Figure S3). A stacked bar chart showing SNPs grouped by their impact (low, moderate, and high) is reported in Supplementary Figure S4. SNPs with “moderate” impact on protein functioning were the most abundant (52%). Variants marked as “low” impact accounted, on average, for 46% of all variants (Supplementary Figure S4). Finally, the variants estimated to have a deleterious impact on gene functioning (i.e., “high” impact) accounted for 2% on average. The latter are to be considered the most interesting variants, since they might determine phenotypes of interest.
Data validation and quality control
The conversion of SNP markers into KASP markers is particularly challenging in polyploid crops, due to the presence of homoeologous sequences (Makhoul et al., 2020). To validate the quality of polymorphism discovery and variant annotation generated by exome capture, we designed and tested eight KASP assays that distinguished between alleles for the 48 individuals (Figure 2). SNP markers were selected following different criteria: presence of polymorphism between parental lines, sequencing depth, biological effect, randomness. The KASP primers design was carried out using the commercial KASP assay design service (KASP-by-Design) developed by LGC Biosearch Technologies. For each SNP, 100-bp flanking regions were obtained from the reference genome and converted into KASP to detect the specific parental allele. The LGC Genomics (Hoddeson, United Kingdom) designed two allele-specific forward primers carrying the standard FAM- or VIC-compatible tail (FAM: 5′-GAAGGTGACCAAGTTCATGCT-3′; VIC: 5′-GAAGGTCGGAGTCAACGGATT-3′) with the targeted SNP at the 3′ end (Supplementary Table S4). Almost all primer pairs had perfect match with the target sequences and mismatches at 3’ tail with the other wheat sub-genomes. DNA samples were arrayed into a 96-well PCR plate, each containing ∼5 µl reaction mix (45 ng of dry DNA, 2.5 µl of 1 × KASP master mixture, and 0.1 µl of primer mix). Primer mix included a final concentration of 30 µM of the common primer and 10 µM of each tail primer. PCR experiments were performed using the ABI ViiA7 instrument (Applied Biosystems, Foster City, CA, United States) as follows: initial denaturation at 94°C for 15 min, followed by ten touchdown cycles (94°C for 20 s; touchdown at an initial temperature of 61°C and decreasing by −0, 6°C per cycle for 60 s) reaching a final annealing temperature of 55°C, followed by 26 additional annealing cycles (94°C for 20 s; 55°C for 60 s). Fluorescence readings were performed with a temperature below 37 °C and allelic discrimination plots were drawn using the SNP viewer software (https://www.biosearchtech.com/support/tools/genotyping-software/snpviewer). Individuals with contrasting alleles at each SNP locus (0/0 corresponds to the reference allele, 1/1 to the alternative allele, 0/1 to a heterozygous locus, and “N/A” to a missing data point) were genotyped using the KASP assays (Supplementary Table S5). Results showed that roughly 80% of alleles for all loci were scored identically between WES data and KASP genotyping, including those with low coverage (Supplementary Table S5).
FIGURE 2. Scatter plot for eight Kompetitive Allele-Specifc PCR (KASP) marker assays in 48 wheat varieties. KASP assays showing clustering of individuals on the X-(FAM) and Y-(HEX) axes. Red individuals have the HEX-type allele; blue individuals have the FAM-type allele. In both cases, individuals are homozygous for the reference or alternate allele. Green individuals are heterozygous for the allele. Black dots represent negative control and pink dots uncallable genotypes.
Potential reuse
In addition to sequence data and sequence variations, we will make available the plant material described in this paper under a standard material transfer agreement This will allow the scientific community to identify marker-trait associations linked to key agronomic traits (including those mentioned above) assessed in other environments (year, site, management), and, thus, detect stable QTL based on genotype-environment (GxE) interactions.
Indeed, SNP discovery supports the use of bulked segregant analysis as a powerful tool to accelerate gene identification and QTL mapping cost-effectively.
In addition, the data described here could be used to dissect naturally occurring allelic variation at candidate genes controlling key agronomic traits, identify useful alleles (i.e., loci affecting the traits of interest) and understanding their function.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/, Accession Number PRJNA821683; https://figshare.com/, DOI: https://doi.org/10.6084/m9.figshare.19485569.v1.
Author contributions
Conceptualization, PD, FT, GS, ND; genetic materials, DL, FS, GS, and PD; Data curation, SE and ND; Formal analysis, SE; Investigation, SE, ND, FT, and PD; validation, SE and ND; writing—original draft, SE; writing—review and editing, SE, ND, PD, and FT. funding acquisition, PD; All authors have read and agreed to the published version of the manuscript.
Funding
This research was partially funded by the Ministry of Agricultural, Food and Forestry Policies (MIPAAF) within the project CERESBIO “Identification of soft and hard wheat genotypes resistant to common bunt and to foot root diseases”, and by the European Union Next-Generation EU (Piano Nazionale di Ripresa e Resilienza (PNRR)–Missione 4 Componente 2, Investimento 1.4–D.D. 1032 17/06/2022, CN00000022) within the Agritech National Research Center. This manuscript reflects only the authors’ views and opinions, neither the European Union nor the European Commission can be considered responsible for them.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.1058471/full#supplementary-material
References
Arriagada, O., Gadaleta, A., Marcotuli, I., Maccaferri, M., Campana, M., Reveco, S., et al. (2022). A comprehensive meta-QTL analysis for yield-related traits of durum wheat (Triticum turgidum L. var. durum) grown under different water regimes. Front. Plant Sci. 13, 984269. doi:10.3389/fpls.2022.984269
Bernardo, A., Wang, S., Amand, P., and Bai, G. (2015). Using Next Generation Sequencing for multiplexed trait-linked markers in wheat. PLoS ONE 10, e0143890. doi:10.1371/journal.pone.0143890
Bevan, M. W., Uauy, C., Wulff, B. B. H., Zhou, J., Krasileva, K., and Clark, M. D. (2017). Genomic innovation for crop improvement. Nature 543, 346–354. doi:10.1038/nature22011
Blackburn, A., Sidhu, G., Schillinger, W. F., Skinner, D., and Gill, K. (2021). QTL mapping using GBS and SSR genotyping reveals genomic regions controlling wheat coleoptile length and seedling emergence. Euphytica 217, 45. doi:10.1007/s10681-021-02778-z
Bolger, A. M., Lohse, M., and Usadel, B. (2014). Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120. doi:10.1093/bioinformatics/btu170
Botticella, E., Pucci, A., and Sestili, F. (2018). Molecular characterisation of two novel starch granule proteins 1 in wild and cultivated diploid A genome wheat species. J. Plant Res. 131, 487–496. doi:10.1007/s10265-017-1005-6
Cingolani, P., Platts, A., Wang le, L., Coon, M., Nguyen, T., Wang, L., et al. (2012). A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118, Fly (Austin), 80–92. 6, iso-2; iso-3. 6.
Colasuonno, P., Marcotuli, I., Gadaleta, A., and Soriano, J. M. (2021). From genetic maps to QTL cloning: An overview for durum wheat. Plants 10, 315. doi:10.3390/plants10020315
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., et al. (2011). The variant call format and VCFtools. Bioinformatics 27, 2156–2158. doi:10.1093/bioinformatics/btr330
De Vita, P., and Taranto, F. (2019). “Durum wheat (Triticum turgidum ssp. durum) breeding to meet the challenge of climate change,” in Advances in plant breeding strategies: Cereals. Editors J. Al-Khayri, S. Jain, and D. Johnson (Heidelberg, Germany: Springer).
Díaz, A., Zikhali, M., Turner, A. S., Isaac, P., and Laurie, D. A. (2012). Copy number variation affecting the Photoperiod-B1 and Vernalization-A1 genes is associated with altered flowering time in wheat (Triticum aestivum). PLoS One 7, e33234. doi:10.1371/journal.pone.0033234
Dong, C., Zhang, L., Chen, Z., Xia, C., Gu, Y., Wang, J., et al. (2020). Combining a new exome capture panel with an effective VARbscore algorithm accelerates BSA-based gene cloning in wheat. Front. Plant Sci. 11, 1249. doi:10.3389/fpls.2020.01249
Dubcovsky, J., and Dvorak, J. (2007). Genome plasticity a key factor in the success of polyploid wheat under domestication. Science 316, 1862–1866. doi:10.1126/science.1143986
Elshire, R. J., Glaubitz, J. C., Sun, Q., Poland, J. A., Kawamoto, K., Buckler, E. S., et al. (2011). A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6, e19379. doi:10.1371/journal.pone.0019379
FAO (2017). The future of food and agriculture: Trends and challenges. Food and Agriculture Organization of the United Nations, Rome, Italy. Available online at:accessed March 28th, 2022] http://www.fao.org/3/a-i6583e.pdf: Fao.
Fu, D., Uauy, C., Distelfeld, A., Blechl, A., Epstein, L., Chen, X., et al. (2009). A kinase-START gene confers temperature-dependent resistance to wheat stripe rust. Science 323, 1357–1360. doi:10.1126/science.1166289
García-Alcalde, F., Okonechnikov, K., Carbonell, J., Cruz, L. M., Götz, S., Tarazona, S., et al. (2012). Qualimap: Evaluating next-generation sequencing alignment data. Bioinformatics 28, 2678–2679. doi:10.1093/bioinformatics/bts503
Gardiner, L. J., Bansept-Basler, P., Olohan, L., Joynson, R., Brenchley, R., Hall, N., et al. (2016). Mapping-by-sequencing in complex polyploid genomes using genic sequence capture: A case study to map yellow rust resistance in hexaploid wheat. Plant J. 87, 403–419. doi:10.1111/tpj.13204
Goel, S., Singh, K., Singh, B., Grewal, S., Dwivedi, N., Alqarawi, A. A., et al. (2019). Analysis of genetic control and QTL mapping of essential wheat grain quality traits in a recombinant inbred population. PLoS One 14, e0200669. doi:10.1371/journal.pone.0200669
He, F., Pasam, R., Shi, F., Kant, S., Keeble-Gagnere, G., Kay, P., et al. (2019). Exome sequencing highlights the role of wild-relative introgression in shaping the adaptive landscape of the wheat genome. Nat. Genet. 51, 896–904. doi:10.1038/s41588-019-0382-2
Heng, L., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079. doi:10.1093/bioinformatics/btp352
Henry, I., Nagalakshmi, U., Lieberman, M. C., Ngo, K. J., Krasileva, K. V., Vasquez-Gross, H., et al. (2014). Efficient genome-wide detection and cataloging of ems-induced mutations using exome capture and next-generation sequencing. Plant Cell 26, 1382–1397. doi:10.1105/tpc.113.121590
Jin, J., Duan, S., Qi, Y., Yan, S., Li, W., Li, B., et al. (2020). Identification of a novel genomic region associated with resistance to Fusarium crown rot in wheat. Theor. Appl. Genet. 133, 2063–2073. doi:10.1007/s00122-020-03577-1
Juliana, P., Poland, J., Huerta-Espino, J., Shrestha, S., Crossa, J., Crespo-Herrera, L., et al. (2019). Improving grain yield, stress resilience and quality of bread wheat using large-scale genomics. Nat. Genet. 51, 1530–1539. doi:10.1038/s41588-019-0496-6
Kaur, P., and Gaikwad, K. (2017). From genomes to GENE-omes: Exome sequencing concept and applications in crop improvement. Front. Plant Sci. 19, 2164. doi:10.3389/fpls.2017.02164
King, R., Bird, N., Ramirez-Gonzalez, R., Coghill, J. A., Patil, A., Hassani-Pak, K., et al. (2015). Mutation scanning in wheat by exon capture and next-generation sequencing. PLoS ONE 10, e0137549. doi:10.1371/journal.pone.0137549
Krasileva, K. V., Vasquez-Gross, H. A., Howell, T., Bailey, P., Paraiso, F., Clissold, L., et al. (2017). Uncovering hidden variation in polyploid wheat. Proc. Natl. Acad. Sci. U. S. A. 114, E913-E921–E921. doi:10.1073/pnas.1619268114
Kumar, G. R., Sakthivel, K., Sundaram, R. M., Neeraja, C. N., Balachandran, S. M., Rani, N. S., et al. (2010). Allele mining in crops: Prospects and potentials. Biotechnol. Adv. 28, 451–461. doi:10.1016/j.biotechadv.2010.02.007
Le Mouël, C., and Forslund, A. (2017). How can we feed the world in 2050? A review of the responses from global scenario studies. Eur. Rev. Agric. Econ. 44, 541–591. doi:10.1093/erae/jbx006
Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760. doi:10.1093/bioinformatics/btp324
Makhoul, M., Rambla, C., Voss-Fels, K. P., Hickey, L. T., Snowdon, R. J., and Obermeier, C. (2020). Overcoming polyploidy pitfalls: A user guide for effective SNP conversion into KASP markers in wheat. Theor. Appl. Genet. 133, 2413–2430. doi:10.1007/s00122-020-03608-x
Martinez, S. A., Shorinola, O., Conselman, S., See, D., Skinner, D. Z., Uauy, C., et al. (2020). Exome sequencing of bulked segregants identified a novel TaMKK3-A allele linked to the wheat ERA8 ABA-hypersensitive germination phenotype. Theor. Appl. Genet. 133, 719–736. doi:10.1007/s00122-019-03503-0
Mo, Y., Howell, T., Vasquez-Gross, H., de Haro, L. A., Dubcovsky, J., and Pearce, S. (2018). Mapping causal mutations by exome sequencing in a wheat TILLING population: A tall mutant case study. Mol. Genet. Genomics 293, 463–477. doi:10.1007/s00438-017-1401-6
Ni, F., Qi, J., Hao, Q., Lyu, B., Luo, M. C., Wang, Y., et al. (2017). Wheat Ms2 encodes for an orphan protein that confers male sterility in grass species. Nat. Commun. 8, 15121. doi:10.1038/ncomms15121
Nilsen, K. T., Walkowiak, S., Xiang, D., Gao, P., Quilichini, T. D., Willick, I. R., et al. (2020). Copy number variation of TdDof controls solid-stemmed architecture in wheat. Proc. Natl. Acad. Sci. U. S. A. 117, 28708–28718. doi:10.1073/pnas.2009418117
Poland, J. A., Brown, P. J., Sorrells, M. E., and Jannink, J. L. (2012). Development of high-density genetic maps for Barley and wheat using a novel two-enzyme genotyping-by-sequencing approach. PLoS ONE 7, e32253. doi:10.1371/journal.pone.0032253
Pont, C., Leroy, T., Seidel, M., Tondelli, A., Duchemin, W., Armisen, D., et al. (2019). Tracing the ancestry of modern bread wheats. Nat. Genet. 51, 905–911. doi:10.1038/s41588-019-0393-z
Semagn, K., Iqbal, M., Chen, H., Perez-Lara, E., Bemister, D. H., Xiang, R., et al. (2021). Physical mapping of QTL associated with agronomic and end-use quality traits in spring wheat under conventional and organic management systems. Theor. Appl. Genet. 134, 3699–3719. doi:10.1007/s00122-021-03923-x
Senapati, N., Stratonovitch, P., Paul, M. J., and Semenov, M. A. (2019). Drought tolerance during reproductive development is important for increasing wheat yield potential under climate change in Europe. J. Exp. Bot. 9, 2549–2560. doi:10.1093/jxb/ery226
Shifting the limits in wheat research and breeding using a fully annotated reference genome. The International Wheat Genome Sequencing Consortium. Science 17, 6403.
Soriano, J. M., Colasuonno, P., Marcotuli, I., and Gadaleta, A. (2021). Meta-QTL analysis and identification of candidate genes for quality, abiotic and biotic stress in durum wheat. Sci. Rep. 11, 11877. doi:10.1038/s41598-021-91446-2
Suren, H., Hodgins, K. A., Yeaman, S., Nurkowski, K. A., Smets, P., Rieseberg, L. H., et al. (2016). Exome capture from the spruce and pine giga-genomes. Mol. Ecol. Resour. 16, 1136–1146. doi:10.1111/1755-0998.12570
Tshikunde, N. M., Mashilo, J., Shimelis, H., and Odindo, A. (2019). Agronomic and physiological traits, and associated quantitative trait loci (QTL) affecting yield response in wheat (Triticum aestivum L.): A review. Front. Plant Sci. 10, 1428. doi:10.3389/fpls.2019.01428
Uauy, C., Distelfeld, A., Fahima, T., Blechl, A., and Dubcovsky, J. (2006). A NAC gene regulating senescence improves grain protein, zinc, and iron content in wheat. Science 314, 1298–1301. doi:10.1126/science.1133649
Uauy, C., Wulff, B. B. H., and Dubcovsky, J. (2017). Combining traditional mutagenesis with new high-throughput sequencing and genome editing to reveal hidden variation in polyploid wheat. Annu. Rev. Genet. 51, 435–454. doi:10.1146/annurev-genet-120116-024533
Van der Auwera, G. A., Carneiro, M. O., Hartl, C., Poplin, R., Del Angel, G., Levy-Moonshine, A., et al. (2013). From FastQ data to high-confidence variant calls: The genome analysis toolkit best practices pipeline. Curr. Protoc. Bioinforma. 43, 11. doi:10.1002/0471250953.bi1110s43
Velmurugan, J., Milbourne, D., Connolly, V., Heslop-Harrison, J. S., Anhalt, U. C. M., Lynch, M. B., et al. (2018). An immortalized genetic mapping population for perennial ryegrass: A resource for phenotyping and complex trait mapping. Front. Plant Sci. 9, 717. doi:10.3389/fpls.2018.00717
Vitale, P., Fania, F., Esposito, S., Pecorella, I., Pecchioni, N., Palombieri, S., et al. (2021). QTL Analysis of five morpho-physiological traits in bread wheat using two mapping populations derived from common parents. Genes 12, 604. doi:10.3390/genes12040604
Xia, C., Zhang, L., Zou, C., Gu, Y., Duan, J., Zhao, G., et al. (2017). A TRIM insertion in the promoter of Ms2 causes male sterility in wheat. Nat. Commun. 8, 15407. doi:10.1038/ncomms15407
Xie, J., Guo, G., Wang, Y., Hu, T., Wang, L., Li, J., et al. (2020). A rare single nucleotide variant in Pm5e confers powdery mildew resistance in common wheat. New Phytol. 228, 1011–1026. doi:10.1111/nph.16762
Xiong, W., Reynolds, M. P., Crossa, J., Schulthess, U., Sonder, K., Montes, C., et al. (2021). Increased ranking change in wheat breeding under climate change. Nat. Plants 7, 1207–1212. doi:10.1038/s41477-021-00988-w
Yan, L., Loukoianov, A., Blechl, A., Tranquilli, G., Ramakrishna, W., SanMiguel, P., et al. (2004). The wheat VRN2 gene is a flowering repressor down-regulated by vernalization. Science 303, 1640–1644. doi:10.1126/science.1094305
Yan, L., Loukoianov, A., Tranquilli, G., Helguera, M., Fahima, T., and Dubcovsky, J. (2003). Positional cloning of the wheat vernalization gene VRN1. Proc. Natl. Acad. Sci. U. S. A. 100, 6263–6268. doi:10.1073/pnas.0937399100
Keywords: wheat, exome capture, target-enrichment, recombinant inbred lines, bulked segregant analysis (BSA), single nucleotide polymorphisms (SNPs)
Citation: Esposito S, D’Agostino N, Taranto F, Sonnante G, Sestili F, Lafiandra D and De Vita P (2022) Whole-exome sequencing of selected bread wheat recombinant inbred lines as a useful resource for allele mining and bulked segregant analysis. Front. Genet. 13:1058471. doi: 10.3389/fgene.2022.1058471
Received: 04 October 2022; Accepted: 07 November 2022;
Published: 22 November 2022.
Edited by:
Satinder Kaur, Punjab Agricultural University, IndiaReviewed by:
Pramod Prasad, ICAR-Indian Institute of Wheat and Barley Research, Regional Station, IndiaRenan Santos Uhdre, Washington State University, United States
Guriqbal Singh Dhillon, Thapar Institute of Engineering and Technology, India
Copyright © 2022 Esposito, D’Agostino, Taranto, Sonnante, Sestili, Lafiandra and De Vita. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Pasquale De Vita, cGFzcXVhbGUuZGV2aXRhQGNyZWEuZ292Lml0