Corrigendum: Triticeae crop genome biology: an endless frontier
- 1State Key Laboratory of Protein and Plant Gene Research, School of Advanced Agriculture Sciences and School of Life Sciences, Peking University, Beijing, China
- 2Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Shandong, China
- 3State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
- 4University of Chinese Academy of Sciences, Beijing, China
- 5CAS-JIC Centre of Excellence for Plant and Microbial Science (CEPAMS), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
- 6State Key Laboratory for Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
- 7Peking-Tsinghua Center for Life Sciences, Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- 8State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, China
Triticeae, the wheatgrass tribe, includes several major cereal crops and their wild relatives. Major crops within the Triticeae are wheat, barley and rye which are important for human consumption, animal feed, and rangeland protection. Species within this tribe are known for their large genomes and complex genetic histories. Powered by recent advances in sequencing technology, researchers worldwide have made progress in elucidating the genomes of Triticeae crops. In addition to assemblies of high-quality reference genomes, pan-genome studies have just started to capture the genomic diversities of these species, shedding light on our understanding of the genetic basis of domestication and environmental adaptation of Triticeae crops. In this review, we focus on recent signs of progress in genome sequencing, pan-genome analyses, and resequencing analysis of Triticeae crops. We also propose future research avenues in Triticeae crop genomes, including identifying genome structure variations, the association of genomic regions with desired traits, mining functions of the non-coding area, introgression of high-quality genes from wild Triticeae resources, genome editing, and integration of genomic resources.
Introduction
Relationship between Triticeae crop
The tribe Triticeae within the subfamily Pooideae includes 27 genera, and 501 diploid and polyploid species (Soreng et al., 2015). Triticeae comprises several major crop species such as barley (Hordeum vulgare L.), rye (Secale cereale L.) and wheat, including bread wheat (Triticum aestivum L. ssp. aestivum) and durum wheat (Triticum turgidum L. ssp. durum). Wheat and barley are the founder crops of the Neolithic Agricultural Revolution in the Fertile Crescent of the Middle East, and they continue to be major cereal crops of temperate regions globally to the present day. Not surprisingly, extensive efforts have been made to study Triticeae crops throughout the world. However, Triticeae crops lagged since crop studies have entered the molecular genetic era, mainly due to the large genome sizes of Triticeae crops. After the completion of rice (Oryza sativa) and maize (Zea mays) genome sequencing, it has been relatively routine to identify causal genes underlying agronomic traits (Liu and Yan, 2019).
The bread wheat genome is substantially larger than barley and has a complex evolutionary history. Like barley, bread wheat also originated in the Fertile Crescent. This hexaploid species contains three subgenomes (A, B, and D) obtained through two hybridization events. In the first round of hybridization, two diploid wild ancestors created wild emmer wheat (T. turgidum L. ssp. dicoccoides) (BBAA) (Haas et al., 2019). The second hybridization event occurred at least 8,800 years ago and involved a cross between free-threshing tetraploid wheat and the donor of the D genome, Aegilops tauschii Coss. (DD) (Salamini et al., 2002; Zhao et al., 2023). Furthermore, the D subgenome possibly resulted from an ancient hybridization event followed by gene loss (Marcussen et al., 2014). Following these duplication events, highly similar genes coexist among the B, A, and D subgenomes. The large Triticeae crop genomes have a high content of transposable elements, which impeded genome assembly. Furthermore, wheat is polyploid, further complicating genome sequencing and assembly.
Development of sequencing technology
The genome of Arabidopsis has assembled two decades ago, which heralds the beginning of plant genome sequencing (Sun et al., 2022a). A reference-level genome provides basic information for plant research. Importantly, well-assembled and annotated genomes are even more important in studies of crop domestication, natural variation, and breeding (Sun et al., 2022a). DNA sequencing started with Sanger sequencing in 1977. Shotgun sequencing emerged in 1982. With the advent of ‘next-generation’ DNA sequencing (NGS) in 2005, the number of genome assemblies increased vastly. The third-generation sequencing technology has been developed for nearly a decade (Shendure et al., 2017). With the development of these sequencing technologies, algorithms, and software for high throughput data analysis were also continuously developed and updated. All these technological advances had made significant contributions to complex genome assembling (Shendure et al., 2017) (Figure 1). To continuously explore and update the huge genome of wheat and barley, International Wheat Genome Sequencing Consortium (IWGSC) and International Barley Genome Sequencing Consortium (IBGSC) were established. The research on functional genomics of wheat lagged far behind that of rice and maize, which restricted the development of gene cloning and molecular design breeding techniques for important agronomic traits. A high-quality reference genome sequence map is the infrastructure to achieve a breakthrough (Brenchley et al., 2012).
Figure 1 Statistics of the published Triticeae crop genomes. The contig N50 (the sequence length of the shortest contig at 50% of the total assembly size) is plotted by the year of publication. The size of each dot is the numerical value of N50. The sequencing platforms are color-coded. The sequencing technologies and the size of N50 have driven a large improvement over the years.
Triticeae crop genome biology
Using the development of sequencing technology, barley has obtained an accurate genome
Barley is the trendsetter in Triticeae genome sequencing. Many technological breakthroughs for Triticeae sequencing are first demonstrated in barley (Table 1). Barley (2n = 2x = 14, HH) is an important cereal crop with a genome size of 5.1 Gb and a repeat sequence ratio of up to 84%. The first effort was to develop a chromosome draft sequence based on flow-sorted chromosomes (Mayer et al., 2011). Following that, a whole-genome shotgun-based draft genome sequence was released for ‘Morex’, a hulled barley cultivar, and 26,159 high-confidence genes were annotated (Consortium TIBGS, 2012). By employing chromosome conformation capture (Hi-C), a highly contiguous chromosome-scale genome assembly was obtained, in which the scaffold N50 length was increased to 1.9 Mb from 1.4 kb in the previous draft genome, and 39,734 high-confidence genes were annotated (Mascher et al., 2017). Subsequently, Genebanks was established through the collection of barley data. Genebanks can provide insight into the global population structure of domesticated barley by analyzing genome-wide genotyping data for almost all barley germplasm (Milner et al., 2018).
The tool TRITEX (Illumina PE/MP, 10 × Genomics and Hi-C technology) was used to obtain a 4.65 Gb barley reference genome (Morex V2), with increased scaffold N50 up to 40.2 Mb, and 32,787 high confidence genes were annotated. The BUSCO evaluation index was increased to 97.8% from 92.5% in the Morex V1 version (Monat et al., 2019). Based on the TRITEX strategy, the Morex V3 barley reference genome with a size of 4.50 Gb was assembled by combining third-generation of sequencing techniques (PacBio long read length and cyclic consensus sequencing data; Nanopore long read length data). The scaffold N50 jumped to 118.9 Mb and 35,827 high-confidence genes were annotated (Mascher et al., 2021).
Genomic assembly of the Tibetan hulless barley and wild barley
The Tibetan hulless barley (Hordeum vulgare L. var. nudum), also called “Qingke” in Chinese, is an important staple food for Tibetans (Zeng et al., 2015). The genome of Tibetan hulless barley “Lhasa Goumang” was assembled using the whole-genome shotgun method. The assembly obtained a 3.89-Gb size genome, accounting for 87% of the Tibetan hulless barley genome, and containing 39,197 protein-coding genes (Zeng et al., 2015). With the development of the latest third-generation sequencing technology, 4.84G genome assembly of another hulless barley cv. Zangqing320 has been completed (Dai et al., 2018).
As the ancestor of cultivated barley, wild barley (Hordeum spontaneum) has important research and application value in the evolution and domestication fields of barley, as well as genetic breeding. Therefore, it is of great significance to study the beneficial genes related to disease resistance and stress tolerance in wild barley. A 4.28 Gb high-quality draft assembly of wild barley accession AWCS276 from Iran was obtained with in-depth sequencing using the whole genome shotgun sequencing (150×), accounting for 93% of the entire genome. A total of 36,395 coding protein genes were predicted using genomic and transcriptome data (Liu et al., 2019). The most effective barley genome for genetic transformation obtained a 4.13-Gb genome by adding Hi-C data to increase the contiguity to full chromosome size (Schreiber et al., 2020). As a wild species of barley and wheat, Hordeum marinum Huds. (2n=2x=14, XaXa) had excellent characteristics such as salt tolerance and resistance to water-logging, and enriched genetic resources for crop improvement. A 3816 Mb reference level genome was assembled using third-generation, second-generation, and Hi-C data, with a contig N50 6.83 Mb (Kuang et al., 2022). The genome information and efficient gene editing system of sea barley had important value for the breeding improvement of Triticeae crops.
Long-sequence sequencing technology propels the progress of the wheat genome
Following barley, the wheat genomes are unmasked step by step (Table 1). The hexaploid bread wheat genome is by far the most conspicuous in the Triticeae tribe. The Chinese landrace Chinese Spring (CS) is chosen for sequencing as the reference first. Due to the extremely complex genome of wheat (2n = 6x = 42 chromosomes), the physical map of the 3B chromosome was first constructed using bacterial artificial chromosome (BAC) (Yu et al., 2008). Six years later, the first reference sequence of the 3B chromosome was finished, providing a proof-of-concept and template for sequencing the remaining chromosomes of wheat (Choulet et al., 2014). The first draft sequence used ~60,000 Illumina-based genic sequences to assemble and putatively assign to subgenomes using Triticum monococcum, Ae. tauschii, and Aegilops speltoides sequence data, a total of 17Gb sequences were assembled, and 94,000-96,000 genes were predicted (Brenchley et al., 2012). Following that, an ordered draft sequence was produced by shotgun sequencing of isolated chromosome arms (International Wheat Genome Sequencing Consortium, 2014). In this version, 124,201 gene loci were identified across homoeologous chromosomes and subgenomes. Subsequently, the assembly length of the wheat genome gradually increased (Chapman et al., 2015; Clavijo et al., 2017; Zimin et al., 2017a). Although high sequence similarity and structural conservation are generally retained between bread wheat subgenomes and corresponding diploid and tetraploid wheat relatives, dynamic and sporadic gene gain, loss, and duplication are also evident. By combining chromosome shotgun sequencing, pseudomolecules with high-density genetic (POPSEQ), BAC-based physical mapping, Hi-C, and Bionano optical mapping, a substantially updated full reference genome release, IWGSC RefSeq v1.0, was obtained. In this version, 97% (14.1 Gb) of the assembled 14.5-Gb genome (N50 of super scaffold = 22.8 Mb) was assigned and ordered to 21 chromosomes (Appels et al., 2018). The classical BAC-by-BAC strategy provides highly ordered, high-confidence, chromosome-level sequences. This approach has been applied to the long arm of chromosome 7D (Feng et al., 2020). In the more recently updated IWGSC RefSeq v2.1 version, using optical maps and PacBio long reads, scaffolds were anchored and corrected, increasing the length of pseudomolecules by 168 Mb (Zhu et al., 2021). The 14.26 Gb genome of the French bread wheat cultivar Renan was assembled by long reads produced on the Oxford Nanopore Technology PromethION device, with a scaffold N50 of 48 Mb (Aury et al., 2022). A 14.7-Gb assembled chromosome of South African bread wheat Kariega was obtained by combining high-fidelity long reading, optical mapping, and chromosome conformation capture (Athiyannan et al., 2022). The resulting assembly sequence has an order of magnitude more continuity than previous wheat sequences.
The genomic research of AABB, AA, DD, other subgenomes, and wild and semi-wild wheat is conducted to accelerate the genomic research of wheat
To decipher the complex wheat genomes, parallel efforts have been made to sequence and assemble diploid wheat ancestors, including Triticum urartu (AA) and Ae. tauschii (DD), tetraploid wild emmer and durum wheat (BBAA), and hexaploid bread wheat (BBAADD) (Table 1). As for barley, these efforts have been accompanied by rapid progress in sequencing technologies and assembly algorithms. As such, reference genome assemblies have been substantially improved over just a few years.
For example, the first released draft of the T. urartu (AA) genome assembly employed an Illumina platform-based whole-genome shotgun sequencing strategy to obtain a total of 4.66 Gb genome sequence with a scaffold N50 length of 63.69 kb (Ling et al., 2013). By combining bacterial artificial chromosome (BAC)-by-BAC sequencing and PacBio long read-based whole-genome shotgun sequencing, the T. urartu genome assembly was subsequently improved, and a genome of 4.86 Gb in size with a scaffold N50 length of 344 kb was obtained (Ling et al., 2018). Similar progress has been made in Ae. tauschii (DD) genome sequencing (Jia et al., 2013; Luo et al., 2017; Zhao et al., 2017; Zimin et al., 2017b; Gaurav et al., 2021; Zhou et al., 2021), and emmer and durum wheat genome sequencing (Avni et al., 2017; Maccaferri et al., 2019). Through the optical maps, the genome sequence of wild emmer wheat was improved (Zhu et al., 2019).
Genomic assembly of the other varieties of hexaploid wheat
The high-quality reference genome of Tibetan semi-wild wheat was obtained with 14.71 Gb in size and 118,078 annotated genes. It is the second reference genome of hexaploid wheat published in the world (Guo et al., 2020). Using PacBio circular consensus sequencing (CCS) with the HiFi approach, the most easily transformed wheat ‘Fielder’ genome was obtained, with an N50 greater than 20 Mb (Sato et al., 2021). By combining the sequencing strategy including DeNovoMAGIC-2, PacBio sequencing, and multiple mapping techniques, a high-quality genome of nitrogen-efficient wheat variety “Kenong 9204” was assembled, with a 14.77 Gb genome size, and the N50 was 21.87 Mb (Shi et al., 2022). AK58 was an elite Chinese common wheat cultivar and had a draft genome at the chromosome level (Jizeng Jia et al., 2021). The N50 of assembling was 715 Mb. A draft genome of Tibetan semi-wild wheat Zang1817 (Triticum aestivum ssp. tibetanum Shao) was assembled with an N50 of 37.62 Mb (Guo et al., 2020).
The rye genome made a significant breakthrough in 2021
By employing these technical advances, the recently finished rye (Secale cereale L.) genome assemblies have reached a high standard (Table 1). Rye is a diploid crop grown extensively as a grain, cover crop, and forage crop especially on marginal land. Rye is exceptionally stress tolerant and has been extensively introgressed into bread wheat to improve grain yield and biotic and abiotic stress tolerance (Crespo-Herrera et al., 2017; Ayalew et al., 2018). Through shotgun sequencing of the whole genome of the winter rye inbred line Lo7, a 2.8-Gb whole-genome draft sequence of rye was obtained (Bauer et al., 2017). This reference sequence represents almost the entire low-copy portion of the rye genome. Rye is an out-crossing crop, and there are more exchanges with wild gene pools. The diploid 7.9 Gb rye genome was much larger than the syntenic diploid barley and bread wheat subgenomes (Li et al., 2021; Rabanus-Wallace et al., 2021). The two released genome assemblies were of high quality, with a scaffold N50 size of 1.04 Gb for the Weining rye genome and 29.44 Mb for the Lo7 rye genome. These high-quality rye genome assemblies reveal a recent long terminal repeat-retrotransposon burst around 0.5 million years ago, explaining the genome size expansion. Notably, a series of genes related to disease resistance, as well as genes involved in seed storage, heading data, frost tolerance, and fertility restoration, have been identified in the rye chromosome arm 1RS. This chromosome arm has been of broad interest as it has translocated to bread wheat chromosome 1BL to form 1B1R translocation lines, which show strong resistance to stripe rust diseases and powdery mildew (Crespo-Herrera et al., 2017).
Pan-genome analysis in Triticeae crop
The release of reference genomes further enabled pan-genome analyses of Triticeae crops. Triticeae cereals are widely cultivated and selected in divergent environments throughout the world. Pan-genome analyses take advantage of the rich genetic resources and century-long germplasm collection efforts for these crops. The initial efforts relied on microarrays that can detect single nucleotide polymorphism (SNP) (Winfield et al., 2012). Following that, Illumina platform-based short reads resequencing had been employed, which did not rely on prior knowledge of the genome polymorphism (Montenegro et al., 2017). However, short reads resequencing cannot identify genome structure variations, and the application of long reads-based resequencing and de novo assembly has further expanded our understanding of genome diversity (Hao et al., 2020; Jayakodi et al., 2020; Walkowiak et al., 2020). In addition to the technical evolution, the sizes of the population being analyzed have also expanded fast (Table 2).
Low-resolution pan-genome analyses uncovered the genetic adaption, domestication, and breeding history of Triticeae crops. Several recent pan-genome studies have been carried out on wheat (Montenegro et al., 2017; Walkowiak et al., 2020) and barley (Jayakodi et al., 2020). A year before the publication of Chinese Spring RefSeq genome v1.0, the first pan-genome containing 18 wheat species was constructed through highly in-depth next-generation sequencing. However, 12,150 genes with variants were not found in Chinese Spring. These genes are associated with stress and defense (Montenegro et al., 2017). Another study reported ten reference-quality pseudomolecule assemblies and five scaffold-level assemblies of hexaploid wheat. These high-quality assemblies facilitated the identification of extensive structural rearrangements and introgressions from wild relatives, which are related to grain yield and quality, resistance to stresses, and adaptation to diverse environments (Walkowiak et al., 2020). Improved long-read sequencing-based assembly held promise to identify large genome structure variations (Jayakodi et al., 2020; Walkowiak et al., 2020). In barley, a study using 20 representative accessions from 19,778 worldwide domesticated varieties not only resolved barley breeding history but also provided detailed views of two frequently identified inversions in current elite barley germplasm (Jayakodi et al., 2020).
Evolutionary genomics analysis in Triticeae crop
Before the whole genome resequencing of wheat was conducted, 500,000 SNPs were obtained through targeted re-sequencing based upon NimbleGen array technology, of which 80% belonged to A, B, and D subgenome variations and 20% belonged to inter variability among varieties (Winfield et al., 2012). Through the whole exome capture and genotyping-by-sequencing approaches, the first haplotype map of wheat was generated, detailing the genetic differences among wheat sample lines worldwide (Jordan et al., 2015). Approximately 4500 years ago, wheat was introduced to China. SNP markers reveal the genetic diversity of wheat varieties in China (Akhunov et al., 2010; Wang et al., 2014). Meanwhile, using wheat Wheat660K gene chip and DArT-seq technology, the possible transmission routes of wheat after its introduction into China from abroad were analyzed (Zhou et al., 2018). The endemic Tibetan semi-wild wheat in China had been acclimated to Tibet varieties (Guo et al., 2020).
Although different landraces and cultivars were selected by different research teams, several key common conclusions can be drawn from bread wheat resequencing studies (Balfourier et al., 2019; Hao et al., 2020; Sansaloni et al., 2020; Zhou et al., 2020; Wang et al., 2021b; Zhao et al., 2023). It has long been proposed that after its formation in the Fertile Crescent during the Neolithic period, bread wheat germplasm spread via two routes along with ancient human migration. Genome sequence analyses provided solid evidence for this hypothesis, as population structure separated European and New World varieties from Asian varieties (Balfourier et al., 2019). After their dissemination, bread wheat populations adapt to local environments to become landraces. A notable finding was that Indian dwarf wheat (T. aestivum ssp. sphaerococcum), endemic to southern Pakistan and northwestern India, was likely a geographically isolated and low-diversity taxon (Zhou et al., 2020). A genome-wide genetic variation map of wheat (VMap 1.0) was constructed by resequencing 414 wheat materials (Zhou et al., 2020). Subsequently, this map was upgraded using whole genome sequencing data from 795 wheat materials (VMap 1.1) (Zhao et al., 2023). A large fraction of genome variations present in landraces had been lost in cultivars (Sansaloni et al., 2020). In the process of origin and domestication of polyploid wheat, interploidy introgression played an important role in restoring population genetic diversity (Wang et al., 2022). These works have revealed the original mechanism of domesticated wheat and the evolution law of long-term gradual domestication, accelerating the analysis of wheat evolution (Zhou et al., 2020; Wang et al., 2022; Zhao et al., 2023) (Figure 2). Nevertheless, there have also been reshufflings over time, presumably related to modern breeding programs. For example, modern Chinese landraces possessed significant contributions from European varieties (Balfourier et al., 2019; Hao et al., 2020; Zhou et al., 2020). Through the resequencing of 145 landmark cultivars of wheat, the process of genome remodeling and optimization of wheat cultivars in China in the past 70 years has been revealed. It has been found that there were significant breeding selection asymmetries among the three subgenomes of wheat, providing a scientific basis and ideas for cultivating candidate backbone parents and assembly breeding (Hao et al., 2020). More variations from landraces had been incorporated into cultivars released after the 1960s (Balfourier et al., 2019). In the 1970s, wheat had the highest diversity of varieties in China (Wang et al., 2021b).
Figure 2 Schematic illustration of the domestication of polyploid wheat. Modern hexaploidy wheat is generated through two polyploidization events from diploid ancestors. Diploid AA is derived from wild einkorn and becomes urartu AA through domesticated einkorn. BB, as a maternal parent, hybridizes with AA, which is a paternal parent, to form a tetraploid of BBAA. Modern durum arose through domesticated emmer and free-threshing tetraploids wheat. The hybridization between BBAA and DD resulted in the final hexaploid wheat. During the process from landrace to cultivar, the interpolyploidy introgression of tetraploids from BBAA can increase the diversity of hexaploid wheat. The blue rectangle represents the domestication processes, while the gray rectangle shows different ploidy.
For the study of barley domestication trajectory, people sequenced the genome of the 6,000-year-old barley, and genetic analysis supports the current hypothesis that barley domestication occurred in the Upper Jordan Valley (Mascher et al., 2016; Russell et al., 2016). Targeted resequencing analysis showed that although the domesticated barley originated from the Fertile Crescent, the candidate domestication genes originated in its eastern and western regions (Pankin et al., 2018). Qingke was introduced to southern Tibet most likely via north Pakistan, India, and Nepal between 4,500 and 3,500 years ago. And Qingke maybe not be the origin of barley of the low genetic diversities (Zeng et al., 2018).
By whole genome sequencing and analyzing the 116 varieties containing wild, weedy, and cultivated rye germplasm resources, it was revealed that cultivated rye was domesticated from weedy rye (Sun et al., 2022b).
Alien introgressions have been identified by further including pan-genome analyses of tetraploid and diploid ears of wheat (Figure 2; Table 2) (Zhou et al., 2020). Tetraploid wheat is the primary contributor to the high genetic diversity of the bread wheat A and B subgenomes (Cheng et al., 2019; He et al., 2019; Zhou et al., 2020). Among them, free-threshing domesticated emmers contribute more diversity than wild emmers or other domesticated emmers (Zhou et al., 2020). By contrast, diploid T. urartu had made very limited introgression contributions to bread wheat, which was distinct from previous assumptions (Zhou et al., 2020). Notably, gene flow from the tetraploid emmers significantly contributed to the genetic diversity of the A and B subgenomes in bread wheat. By contrast, the barrier to gene flow from the diploid T. urartu or Ae. tauschii to bread wheat was much higher, as they cannot cross easily with bread wheat. Gene flow from wild wheat (T. turgium subsp. dicoccoides) had a significant impact on single nucleotide polymorphisms (SNPs) throughout the wheat genome (He et al., 2019; Pont et al., 2019). After the publication of the wheat genome (Appels et al., 2018), the study by Cheng et al. was the first one to analyze genetic variations using resequencing data, and it was found that the genetic diversity of hexaploid wheat originated from intraspecific and interspecific hybridization (Cheng et al., 2019). Gene flow (gene migration) and chromosome rearrangement had important contributions to the formation of genetic diversity in common wheat, and the impact of gene flow on future breeding strategies had been evaluated (Przewieslik-Allen et al., 2021). The whole genome sequencing of 3000-year-old Egyptian emmer wheat revealed genetic similarities between ancient Egyptian wheat and modern Arab and Indian wheat, indicating the early eastward and southward spread of ancient Egyptian wheat (Scott et al., 2019). Using chips and genome resequencing, population genetic analysis was conducted on 168 wild emmer wheat. It was found that the wild emmer wheat utilized different adaptive strategies in the same domain environment, resulting in different evolutionary pathways (Wang et al., 2020b). Similarly, strong genetic selection occurred in wild barley populations for soil drought adaptation (Wang et al., 2018) and soil ecological differences (Bian et al., 2020).
Large genome size and genetic complexity significantly hamper gene function analyses in Triticeae crops through map-based cloning of key regulatory genes. Pan-genome analyses provided an efficient alternative strategy for identifying genes under selection during domestication and breeding (Haas et al., 2019). New inference models and methods have been formulated for identifying selective sweeps and localizing selection target sites throughout the genome. These methods aim at uncovering beneficial alleles from the vast majority of neutral variations. In barley, pan-genome studies had identified genomic regions putatively under selection, which harbored regulatory genes underlying several morphological traits (Milner et al., 2018; Jayakodi et al., 2020). Bristling awns is a beneficial trait for seed dispersal in wild barley but a nuisance for farmers when harvesting and for animals when chewing (Elbaum et al., 2007). Regulatory genes for awn length and awn roughness were found in GWAS peaks (Milner et al., 2018). Pan-genome analyses of hexaploid and diploid wheat populations had identified regions that harbor genes encoding putative regulators of agronomic traits (Walkowiak et al., 2020; Zhou et al., 2021). For example, analysis of the brittle rachis trait revealed convergent adaptation in different wheat populations (Zhou et al., 2020). Domesticated emmer wheat and einkorn wheat harbor independent mutations in orthologs of the barley Btr1 gene, which controls seed shattering.
Identification of selected genes in Triticeae crop
When phenotypic data is available, a genome-wide association study (GWAS) is possible, in which genomes from a population are scanned for genetic variations that can be used to predict phenotypic traits (Liu and Yan, 2019). Using genomic tools to improve wheat is crucial for accelerating the improvement of trait inheritance (Russell et al., 2016; Juliana et al., 2019; Pont et al., 2019; Guo et al., 2020; Zeng et al., 2020; Joynson et al., 2021). It has been reported that 35 key traits had genomic predictability and demonstrated the potential of genomic selection for wheat. A large genome-wide association study is also conducted to identify several important loci for trait associations in 50 traits in South Asia, Africa, and the Americas (Juliana et al., 2019). In an exon sequencing analysis in wheat, 48 and 40 genomic loci were identified that they were significantly related to flowering stage and plant height variation, including known genes that control flowering stage (Ppd, VRN, FDL, WPCL) and Rht genes that affect plant height (Pont et al., 2019). Targeted sequencing data combined with genome-wide association analysis reveal genetic loci related to photosynthesis in wheat (Joynson et al., 2021). In Tibetan semi-wild wheat, important de-domestication sites were identified through association analysis, providing further evidence for de-domestication (Guo et al., 2020). Through exon sequencing analysis and a common garden experiment involving 267 landraces and wild accessions collected from diverse geographical locations, a study in barley shed light on environmental adaption, and identified allelic variation in barley associated with geographical adaptation, such as heading days related to seasonal temperature and plant height related to dryness variables (Russell et al., 2016). Using a metabolite-based genome-wide association study (mGWAS), the metabolic gene base for adapting to high levels of UV-B in highland barley was revealed, providing a reference for the improvement of highland barley varieties and the study of molecular genetic mechanisms of biological and abiotic stresses in other crops (Zeng et al., 2020). In addition, complete genomic information can facilitate the cloning of key genes. For example, after obtaining a reference-quality genome assembly from Aegilops sharonensis, the stem rust resistance gene Sr62 was cloned (Yu et al., 2022). The search for these key genes in development and resistance contributes to the development of Triticeae crop breeding.
The development of the Triticeae crop genome has greatly promoted the progress of Triticeae crop epigenome analysis
The method of mutation accumulation has been used to prove that epigenetic modifications and environmental changes can affect the rate and location of mutations in the genome, and the importance of epigenetic modifications in mutation rate variation has been demonstrated (Habig et al., 2021) (Table 3). Before the release of the IWGSC RefSeq v1.0 genome, the first epigenome map of wheat was drawn, and the results showed that the methylation of the three subgenomes and diploid wheat ancestors Aegilops tauschii is highly conserved (Gardiner et al., 2015). With the improvement of genomic information, more and more epigenetic studies have been conducted (Geng et al., 2019; Li et al., 2019; Concia et al., 2020; Yuan et al., 2020; Habig et al., 2021; Jia et al., 2021; Liu et al., 2021; Wang et al., 2021a). Genomic methylation can affect genome-specific gene expression and specific biological processes, such as affecting plant disease resistance (Geng et al., 2019; Li et al., 2019). Genome merging and segregation in tetraploid to hexaploid wheat cause dynamic and reversible DNA methylation, and these changes were related to changes in gene expression and transposon activity (Yuan et al., 2020). The histone modification of H3K27me3 is relatively stable in different ploidy wheat, but the peak levels of H3K27me2 increased with increasing ploidy levels (Liu et al., 2021). The subgenomic differentiation activity of homologous regulatory elements was related to the dynamic regulation of methylase complexes, demethylases, and H3K27me3 (Wang et al., 2021a).
The physical chromosome organization is vital in the polyploid genome (Concia et al., 2020; Jia et al., 2021; Yuan et al., 2022). Transposons play an important role in stabilizing subgenomic stability (Jia et al., 2021), which influences transcriptional regulation in wheat (Concia et al., 2020). Recent research has used OCEAN-C technology to map open chromatin interactions in different ploidy wheat, integrating chromatin accessibility, histone modification, and transcriptome to deeply analyze the long-distance interactions of open elements during the polyploidy process in hexaploidy wheat (Yuan et al., 2022).
Prospects
Sequencing technology has advanced rapidly in the past decade, inspiring the development of genome mapping, assembly methods, and population genetic algorithms (Mardis, 2017; Shendure et al., 2017; Sun et al., 2022a). For example, the recently developed fast and accurate long-read sequencing by circular consensus sequencing technology holds promise to assemble highly repetitive Triticeae genomes, as the recent pilot experiment in barley suggested (Mascher et al., 2021). In the Triticeae genome, accurate assembly of huge tandem repeats, complex chromosome rearrangements, and translocation regions are extensively existing, which may prevent precise genome assembly. In existing technologies, the assembly results can be corrected through the assistance of Hi-C data. In addition, advanced molecular cytogenetic approaches called the Oligo-FISH painting system, may provide future experimental instead of computational correction or verification of the complex Triticeae assembly (Li et al., 2020). This new technology can accurately identify non-homologous chromosomes in polyploid Triticeae crops.
How to fully exploit the identified genome variations is another challenge. Graphic-pangenomics has been an emerging method to identify regions under selection and introgression. Annotation of the genome structure variations, including coding regions and non-coding (regulatory) regions, is also a key breakthrough to expect. When applied to pan-genome studies, long reads-based resequencing and de novo assembly would identify genome structure variations. Although already applied in barley and wheat, the broader application is an urgent need to identify more large-scale variations, which often affect agronomic traits (Montenegro et al., 2017; Jayakodi et al., 2020). To systematically mine regions contributing to agronomic traits, pan-genome sequences need to be combined with high-throughput phenotypic data to accelerate breeding programs.
The exploration and utilization of key genes play an important role in promoting the genetic improvement of wheat yield. For example, the “Green Revolution” in the middle of the last century significantly reduced the plant height of wheat and achieved an increase in yield by using dwarf genes such as Rht-B1b or Rht-D1b. Similar to the development in rice (Oryza sativa) and maize (Zea mays), it has been beneficial to identify causal genes underlying agronomic traits based on complete genome reference (Liu and Yan, 2019). TaCol-B5 was found by map-based cloning, which had functions in modifying spike architecture and enhancing grain yield in wheat (Zhang et al., 2022). Based on accurate genomic information, differences in amino acid sites result in changes in gene function. A large fragment of approximately 500 Kb was missing on the short arm of chromosome 4B, leading to the loss of three closely linked genes. This site is a semi-dwarf site that synergistically enhances wheat yield and nitrogen utilization efficiency (Song et al., 2023). The comprehensive genomic information provides a foundation for the search of these genes, and the base or sequence changes in their regulatory mechanisms also depend on the accuracy of genomic information.
In addition, wheat breeding has long benefited from distant hybridization with wild relatives. Crossing with wild Triticeae species, such as Aegilops spp., Thinopyrum spp., Elytrigia elongata syn. Lophopyrum elongatum, and Hordeum spp., commonly introduce chromosome fragments into bread wheat to provide enhanced stress tolerance and/or disease resistance. For example, a series of Xiaoyan varieties, derived from the cross between bread wheat and Thinopyrum ponticum, has a high-stress tolerance and enhanced photosynthesis and had been broadly cultivated in China. Sequencing these wild species would also be a frontier in Triticeae genome biology. Recently, sequencing Thinopyrum elongatum has helped the identification of a Fusarium head blight-resistant gene that was introduced from this wild relative into bread wheat through distant hybridization breeding (Wang et al., 2020a). Genome sequencing would accelerate comparable studies to better utilize the rich genetic resource embedded in the entire Triticeae tribe.
Genomic selection (GS) is a breeding strategy of predicting and choosing the superior populations through the construction of statistical models based on the molecular markers and phenotypes of the training population (Meuwissen et al., 2001). Furthermore, the research on GS has received widespread attention and significant progress in maize and rice (Cui et al., 2019; Xiao et al., 2021). However, due to the limitations of the large and complex genome in wheat, there is relatively little progresses in this area. Recently, in wheat, it was found that plant height marker, DTHD and DTMT marker, and disease resistance marker can achieve average prediction efficiency at 74%, 76%, and 85% levels, respectively (Juliana et al., 2019). These results indicate that a small number of molecular markers can be used for GS breeding in wheat, and excellent prediction results can be achieved. These also provide directions for future GS breeding research in wheat. Firstly, due to the limited genetic variation in hexaploid wheat, fewer molecular markers can represent most variations, thus effectively predicting the agronomic traits of hexaploid wheat. Currently, this severely restricted the further improvement of wheat traits. So, expanding the GS genetic variation population, including tetraploid and distant hybridizations, is an effective approach. Secondly, the current variations in the wheat populations were got from GBS (Genotyping-by-Sequencing) method in the wheat population. It will more effectively improve the prediction efficiency of the model by capturing and modeling many variations in the wheat population, as well as structural variation. The ongoing development of third-generation sequencing combined with next-generation sequencing can assist in this process. Lastly, due to wheat being a widely cultivated crop worldwide, environmental plasticity is one of the key traits in GS, which is also an important direction for the wheat improvement of environmental adaptability. At present, there is relatively little research that integrates environmental factors into the GS models, which will also be an important research direction for applying the GS model to the production of higher yield and stronger stress resistance wheat varieties.
Author contributions
HH and YJ designed the scope of the manuscript. ZG, JB, FL, YJ, and HH together wrote the manuscript. ZG, YJ, and HH prepared the figures. All authors contributed to the article and approved the submitted version.
Funding
The work of the authors is funded by the Strategic Priority Research Program of the Chinese Academy of Sciences XDA24020203 (YJ) and XDA24020201 (FL), by Shandong Provincial Natural Science Foundation grants ZR2021ZD30 and ZR2022ZD22 (HH and YJ), and by the National Natural Science Foundation of China, grants 31921005 (YJ and FL), and 31825002 (YJ).
Acknowledgments
We apologize to all colleagues whose work could not be cited due to space limitations.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Akhunov, E. D., Akhunova, A. R., Anderson, O., Anderson, J. A., Blake, N., Clegg, M. T., et al. (2010). Nucleotide diversity maps reveal variations in diversity among wheat genomes and chromosomes. BMC Genomics 11, 1–22. doi: 10.1186/1471-2164-11-702
Appels, R., Eversole, K., Stein, N., Feuillet, C., Keller, B., Rogers, J., et al. (2018). Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science 361, 1–13. doi: 10.1126/science.aar7191
Athiyannan, N., Abrouk, M., Boshoff, W. H. P., Cauet, S., Rodde, N., Kudrna, D., et al. (2022). Long-read genome sequencing of bread wheat facilitates disease-resistance gene cloning. Nat. Genet. 54, 227–231. doi: 10.1038/s41588-022-01022-1
Aury, J. M., Engelen, S., Istace, B., Monat, C., Lasserre-Zuber, P., Belser, C., et al. (2022). Long-read and chromosome-scale assembly of the hexaploid wheat genome achieves high resolution for research and breeding. Gigascience 11, 1–18. doi: 10.1093/gigascience/giac034
Avni, R., Nave, M., Barad, O., Baruch, K., O. Twardziok, S., Gundlach, H., et al. (2017). Wild emmer genome architecture and diversity elucidate wheat evolution and domestication. Science 357, 93–97. doi: 10.1126/science.aan0032
Ayalew, H., Kumssa, T. T., Butler, T. J., Ma, X. F. (2018). Triticale improvement for forage and cover crop uses in the southern great plains of the united states. Front. Plant Sci. 9, 1–9. doi: 10.3389/fpls.2018.01130
Balfourier, F., Bouchet, S., Robert, S., De Oliveira, R., Rimbert, H., Kitt, J., et al. (2019). Worldwide phylogeography and history of wheat genetic diversity. Sci. Adv. 5, 1–10. doi: 10.1126/sciadv.aav0536
Bauer, E., Schmutzer, T., Barilar, I., Mascher, M., Gundlach, H., Martis, M. M., et al. (2017). Towards a whole-genome sequence for rye (Secale cereale l.). Plant J. 89, 853–869. doi: 10.1111/tpj.13436
Bian, J., Cui, L., Wang, X., Yang, G., Huo, F., Ling, H., et al. (2020). Genomic and phenotypic divergence in wild barley driven by microgeographic adaptation. Adv. Sci. 7, 1–13. doi: 10.1002/advs.202000709
Brenchley, R., Spannagl, M., Pfeifer, M., Barker, G. L., D'Amore, R., Allen, A. M., et al. (2012). Analysis of the bread wheat genome using whole-genome shotgun sequencing. Nature 491, 705–710. doi: 10.1038/nature11650
Chapman, J. A., Mascher, M., Buluç, A., Barry, K., Georganas, E., Session, A., et al. (2015). A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome. Genome Biol. 16, 26. doi: 10.1186/s13059-015-0582-8
Cheng, H., Liu, J., Wen, J., Nie, X., Xu, L., Chen, N., et al. (2019). Frequent intra- and inter-species introgression shapes the landscape of genetic variation in bread wheat. Genome Biol. 20, 136. doi: 10.1186/s13059-019-1744-x
Choulet, F., Alberti, A., Theil, S., Glover, N., Barbe, V., Daron, J., et al. (2014). Structural and functional partitioning of bread wheat chromosome 3B. Science 345, 1–7. doi: 10.1126/science.1249721
Clavijo, B. J., Venturini, L., Schudoma, C., Accinelli, G. G., Kaithakottil, G., Wright, J., et al. (2017). An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations. Genome Res. 27, 885–896. doi: 10.1101/gr.217117.116
Concia, L., Veluchamy, A., Ramirez-Prado, J. S., Martin-Ramirez, A., Huang, Y., Perez, M., et al. (2020). Wheat chromatin architecture is organized in genome territories and transcription factories. Genome Biol. 21, 104. doi: 10.1186/s13059-020-01998-1
Consortium TIBGS. (2012). A physical, genetic, and functional sequence assembly of the barley genome. Nature 491, 711–716. doi: 10.1038/nature11543
Crespo-Herrera, L. A., Garkava-Gustavsson, L., Ahman, I. (2017). A systematic review of rye (Secale cereale l.) as a source of resistance to pathogens and pests in wheat (Triticum aestivum l.). Hereditas 154, 14. doi: 10.1186/s41065-017-0033-5
Cui, Y., Li, R., Li, G., Zhang, F., Zhu, T., Zhang, Q., et al. (2019). Hybrid breeding of rice via genomic selection. Plant Biotechnol. J. 18, 57–67. doi: 10.1111/pbi.13170
Dai, F., Wang, X., Zhang, X.-Q., Chen, Z., Nevo, E., Jin, G., et al. (2018). Assembly and analysis of a qingke reference genome demonstrate its close genetic relation to modern cultivated barley. Plant Biotechnol. J. 16, 760–770. doi: 10.1111/pbi.12826
Elbaum, R., Zaltzman, L., Burgert, I., Fratzl, P. (2007). The role of wheat awns in the seed dispersal unit. Science 316, 884–886. doi: 10.1126/science.1140097
Feng, K., Cui, L., Wang, L., Shan, D., Tong, W., Deng, P., et al. (2020). The improved assembly of 7DL chromosome provides insight into the structure and evolution of bread wheat. Plant Biotechnol. J. 18, 732–742. doi: 10.1111/pbi.13240
Fradgley, N., Gardner, K. A., Cockram, J., Elderfield, J., Hickey, J. M., Howell, P., et al. (2019). A large-scale pedigree resource of wheat reveals evidence for adaptation and selection by breeders. PloS Biol. 17, 1–20. doi: 10.1371/journal.pbio.3000071
Gardiner, L.-J., Quinton-Tulloch, M., Olohan, L., Price, J., Hall, N., Hall, A. (2015). A genome-wide survey of DNA methylation in hexaploid wheat. Genome Biol. 16, 273. doi: 10.1186/s13059-015-0838-3
Gaurav, K., Arora, S., Silva, P., Sánchez-Martín, J., Horsnell, R., Gao, L., et al. (2021). Population genomic analysis of aegilops tauschii identifies targets for bread wheat improvement. Nat. Biotechnol. 40, 422–431. doi: 10.1038/s41587-021-01058-4
Geng, S., Kong, X., Song, G., Jia, M., Guan, J., Wang, F., et al. (2019). DNA Methylation dynamics during the interaction of wheat progenitor aegilops tauschii with the obligate biotrophic fungus blumeria graminis f. sp. tritici. New Phytol. 221, 1023–1035. doi: 10.1111/nph.15432
Guo, W., Xin, M., Wang, Z., Yao, Y., Hu, Z., Song, W., et al. (2020). Origin and adaptation to high altitude of Tibetan semi-wild wheat. Nat. Commun. 11, 5085. doi: 10.1038/s41467-020-18738-5
Haas, M., Schreiber, M., Mascher, M. (2019). Domestication and crop evolution of wheat and barley: genes, genomics, and future directions. J. Integr. Plant Biol. 61, 204–225. doi: 10.1111/jipb.12737
Habig, M., Lorrain, C., Feurtey, A., Komluski, J., Stukenbrock, E. H. (2021). Epigenetic modifications affect the rate of spontaneous mutations in a pathogenic fungus. Nat. Commun. 12, 5869. doi: 10.1038/s41467-021-26108-y
Hao, C., Jiao, C., Hou, J., Li, T., Liu, H., Wang, Y., et al. (2020). Resequencing of 145 landmark cultivars reveals asymmetric sub-genome selection and strong founder genotype effects on wheat breeding in China. Mol. Plant 13, 1733–1751. doi: 10.1016/j.molp.2020.09.001
He, F., Pasam, R., Shi, F., Kant, S., Keeble-Gagnere, G., Kay, P., et al. (2019). Exome sequencing highlights the role of wild-relative introgression in shaping the adaptive landscape of the wheat genome. Nat. Genet. 51, 896–904. doi: 10.1038/s41588-019-0382-2
International Wheat Genome Sequencing Consortium (2014). A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. Science 345, 1–11. doi: 10.1126/science.1251788
Jayakodi, M., Padmarasu, S., Haberer, G., Bonthala, V. S., Gundlach, H., Monat, C., et al. (2020). The barley pan-genome reveals the hidden legacy of mutation breeding. Nature 588, 284–289. doi: 10.1038/s41586-020-2947-8
Jia, J., Xie, Y., Cheng, J., Kong, C., Wang, M., Gao, L., et al. (2021). Homology-mediated inter-chromosomal interactions in hexaploid wheat lead to specific subgenome territories following polyploidization and introgression. Genome Biol. 22, 26. doi: 10.1186/s13059-020-02225-7
Jia, J., Zhao, S., Kong, X., Li, Y., Zhao, G., He, W., et al. (2013). Aegilops tauschii draft genome sequence reveals a gene repertoire for wheat adaptation. Nature 496, 91–95. doi: 10.1038/nature12028
Jizeng Jia, K. W., Zhao, G., Yan, X., Xu, S., Li, D., Jiao, Y., et al. (2021). Draft genome in chromosome level for AK58. Available at: https://ngdc.cncb.ac.cn/gwh/Assembly/23313/show.
Jordan, K. W., Wang, S., Lun, Y., Gardiner, L. J., MacLachlan, R., Hucl, P., et al. (2015). A haplotype map of allohexaploid wheat reveals distinct patterns of selection on homoeologous genomes. Genome Biol. 16, 48. doi: 10.1186/s13059-015-0606-4
Joynson, R., Molero, G., Coombes, B., Gardiner, L. J., Rivera-Amado, C., Pinera-Chavez, F. J., et al. (2021). Uncovering candidate genes involved in photosynthetic capacity using unexplored genetic variation in spring wheat. Plant Biotechnol. J. 19, 1537–1552. doi: 10.1111/pbi.13568
Juliana, P., Poland, J., Huerta-Espino, J., Shrestha, S., Crossa, J., Crespo-Herrera, L., et al. (2019). Improving grain yield, stress resilience and quality of bread wheat using large-scale genomics. Nat. Genet. 51, 1530–1539. doi: 10.1038/s41588-019-0496-6
Kuang, L., Shen, Q., Chen, L., Ye, L., Yan, T., Chen, Z.-H., et al. (2022). The genome and gene editing system of sea barleygrass provide a novel platform for cereal domestication and stress tolerance studies. Plant Commun. 3, 1–12. doi: 10.1016/j.xplc.2022.100333
Li, Z., Wang, M., Lin, K., Xie, Y., Guo, J., Ye, L., et al. (2019). The bread wheat epigenomic map reveals distinct chromatin architectural and evolutionary features of functional genetic elements. Genome Biol. 20, 139. doi: 10.1186/s13059-019-1746-8
Li, G., Wang, L., Yang, J., He, H., Jin, H., Li, X., et al. (2021). A high-quality genome assembly highlights rye genomic characteristics and agronomically important genes. Nat. Genet. 53, 574–584. doi: 10.1038/s41588-021-00808-z
Li, G., Zhang, T., Yu, Z., Wang, H., Yang, E., Yang, Z. (2020). An efficient oligo-FISH painting system for revealing chromosome rearrangements and polyploidization in triticeae. Plant J. 105, 978–993. doi: 10.1111/tpj.15081
Ling, H. Q., Ma, B., Shi, X., Liu, H., Dong, L., Sun, H., et al. (2018). Genome sequence of the progenitor of wheat a subgenome triticum urartu. Nature 557, 424–428. doi: 10.1038/s41586-018-0108-0
Ling, H. Q., Zhao, S., Liu, D., Wang, J., Sun, H., Zhang, C., et al. (2013). Draft genome of the wheat a-genome progenitor triticum urartu. Nature 496, 87–90. doi: 10.1038/nature11997
Liu, M., Li, Y., Ma, Y., Zhao, Q., Stiller, J., Feng, Q., et al. (2019). The draft genome of a wild barley genotype reveals its enrichment in genes related to biotic and abiotic stresses compared to cultivated barley. Plant Biotechnol. J. 18, 443–456. doi: 10.1111/pbi.13210
Liu, H. J., Yan, J. (2019). Crop genome-wide association study: a harvest of biological relevance. Plant J. 97, 8–18. doi: 10.1111/tpj.14139
Liu, Y., Yuan, J., Jia, G., Ye, W., Jeffrey Chen, Z., Song, Q. (2021). Histone H3K27 dimethylation landscapes contribute to genome stability and genetic recombination during wheat polyploidization. Plant J. 105, 678–690. doi: 10.1111/tpj.15063
Luo, M. C., Gu, Y. Q., Puiu, D., Wang, H., Twardziok, S. O., Deal, K. R., et al. (2017). Genome sequence of the progenitor of the wheat d genome aegilops tauschii. Nature 551, 498–502. doi: 10.1038/nature24486
Maccaferri, M., Harris, N. S., Twardziok, S. O., Pasam, R. K., Gundlach, H., Spannagl, M., et al. (2019). Durum wheat genome highlights past domestication signatures and future improvement targets. Nat. Genet. 51, 885–895. doi: 10.1038/s41588-019-0381-3
Marcussen, T., Sandve S, R., Heier, L., Spannagl, M., Pfeifer, M., International Wheat Genome Sequencing Consortium, et al. (2014). Ancient hybridizations among the ancestral genomes of bread wheat. Science 345, 1–4. doi: 10.1126/science.1250092
Mardis, E. R. (2017). DNA Sequencing technologies: 2006-2016. Nat. Protoc. 12, 213–218. doi: 10.1038/nprot.2016.182
Mascher, M., Gundlach, H., Himmelbach, A., Beier, S., Twardziok, S. O., Wicker, T., et al. (2017). A chromosome conformation capture ordered sequence of the barley genome. Nature 544, 427–433. doi: 10.1038/nature22043
Mascher, M., Schuenemann, V. J., Davidovich, U., Marom, N., Himmelbach, A., Hübner, S., et al. (2016). Genomic analysis of 6,000-year-old cultivated grain illuminates the domestication history of barley. Nat. Genet. 48, 1089–1093. doi: 10.1038/ng.3611
Mascher, M., Wicker, T., Jenkins, J., Plott, C., Lux, T., Koh, C. S., et al. (2021). Long-read sequence assembly: a technical evaluation in barley. Plant Cell 33, 1888–1906. doi: 10.1093/plcell/koab077
Mayer, K. F., Martis, M., Hedley, P. E., Simkova, H., Liu, H., Morris, J. A., et al. (2011). Unlocking the barley genome by chromosomal and comparative genomics. Plant Cell 23, 1249–1263. doi: 10.1105/tpc.110.082537
Meuwissen, T. H. E., Hayes, B. J., Goddard, M. E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829. doi: 10.1093/genetics/157.4.1819
Milner, S. G., Jost, M., Taketa, S., Mazón, E. R., Himmelbach, A., Oppermann, M., et al. (2018). Genebank genomics highlights the diversity of a global barley collection. Nat. Genet. 51, 319–326. doi: 10.1038/s41588-018-0266-x
Monat, C., Padmarasu, S., Lux, T., Wicker, T., Gundlach, H., Himmelbach, A., et al. (2019). TRITEX: chromosome-scale sequence assembly of triticeae genomes with open-source tools. Genome Biol. 20, 284. doi: 10.1186/s13059-019-1899-5
Montenegro, J. D., Golicz, A. A., Bayer, P. E., Hurgobin, B., Lee, H., Chan, C. K. K., et al. (2017). The pangenome of hexaploid bread wheat. Plant J. 90, 1007–1013. doi: 10.1111/tpj.13515
Pankin, A., Altmuller, J., Becker, C., von Korff, M. (2018). Targeted resequencing reveals genomic signatures of barley domestication. New Phytol. 218, 1247–1259. doi: 10.1111/nph.15077
Pont, C., Leroy, T., Seidel, M., Tondelli, A., Duchemin, W., Armisen, D., et al. (2019). Tracing the ancestry of modern bread wheats. Nat. Genet. 51, 905–911. doi: 10.1038/s41588-019-0393-z
Przewieslik-Allen, A. M., Wilkinson, P. A., Burridge, A. J., Winfield, M. O., Dai, X., Beaumont, M., et al. (2021). The role of gene flow and chromosomal instability in shaping the bread wheat genome. Nat. Plants 7, 172–183. doi: 10.1038/s41477-020-00845-2
Rabanus-Wallace, M. T., Hackauf, B., Mascher, M., Lux, T., Wicker, T., Gundlach, H., et al. (2021). Chromosome-scale genome assembly provides insights into rye biology, evolution and agronomic potential. Nat. Genet. 53, 564–573. doi: 10.1038/s41588-021-00807-0
Russell, J., Mascher, M., Dawson, I. K., Kyriakidis, S., Calixto, C., Freund, F., et al. (2016). Exome sequencing of geographically diverse barley landraces and wild relatives gives insights into environmental adaptation. Nat. Genet. 48, 1024–1030. doi: 10.1038/ng.3612
Salamini, F., Ozkan, H., Brandolini, A., Schafer-Pregl, R., Martin, W. (2002). Genetics and geography of wild cereal domestication in the near east. Nat. Rev. Genet. 3, 429–441. doi: 10.1038/nrg817
Sansaloni, C., Franco, J., Santos, B., Percival-Alwyn, L., Singh, S., Petroli, C., et al. (2020). Diversity analysis of 80,000 wheat accessions reveals consequences and opportunities of selection footprints. Nat. Commun. 11, 4572. doi: 10.1038/s41467-020-18404-w
Sato, K., Abe, F., Mascher, M., Haberer, G., Gundlach, H., Spannagl, M., et al. (2021). Chromosome-scale genome assembly of the transformation-amenable common wheat cultivar 'Fielder'. DNA Res. 28, 1–7. doi: 10.1093/dnares/dsab008
Schreiber, M., Mascher, M., Wright, J., Padmarasu, S., Himmelbach, A., Heavens, D., et al. (2020). A genome assembly of the barley ‘Transformation reference’ cultivar golden promise. G3-Genes Genom. Genet. 10, 1823–1827. doi: 10.1534/g3.119.401010
Scott, M. F., Botigue, L. R., Brace, S., Stevens, C. J., Mullin, V. E., Stevenson, A., et al. (2019). A 3,000-year-old Egyptian emmer wheat genome reveals dispersal and domestication history. Nat. Plants 5, 1120–1128. doi: 10.1038/s41477-019-0534-5
Shendure, J., Balasubramanian, S., Church, G. M., Gilbert, W., Rogers, J., Schloss, J. A., et al. (2017). DNA Sequencing at 40: past, present and future. Nature 550, 345–353. doi: 10.1038/nature24286
Shi, X., Cui, F., Han, X., He, Y., Zhao, L., Zhang, N., et al. (2022). Comparative genomic and transcriptomic analyses uncover the molecular basis of high nitrogen-use efficiency in the wheat cultivar kenong 9204. Mol. Plant 15, 1440–1456. doi: 10.1016/j.molp.2022.07.008
Song, L., Liu, J., Cao, B., Liu, B., Zhang, X., Chen, Z., et al. (2023). Reducing brassinosteroid signalling enhances grain yield in semi-dwarf wheat. Nature 617, 118–124. doi: 10.1038/s41586-023-06023-6
Soreng, R. J., Peterson, P. M., Romaschenko, K., Davidse, G., Zuloaga, F. O., Judziewicz, E. J., et al. (2015). A worldwide phylogenetic classification of the poaceae (Gramineae). J. Syst. Evol. 53, 117–137. doi: 10.1111/jse.12150
Sun, Y., Shang, L., Zhu, Q. H., Fan, L., Guo, L. (2022a). Twenty years of plant genome sequencing: achievements and challenges. Trends Plant Sci. 27, 391–401. doi: 10.1016/j.tplants.2021.10.006
Sun, Y., Shen, E., Hu, Y., Wu, D., Feng, Y., Lao, S., et al. (2022b). Population genomic analysis reveals domestication of cultivated rye from weedy rye. Mol. Plant 15, 552–561. doi: 10.1016/j.molp.2021.12.015
Walkowiak, S., Gao, L., Monat, C., Haberer, G., Kassa, M. T., Brinton, J., et al. (2020). Multiple wheat genomes reveal global variation in modern breeding. Nature 588, 277–283. doi: 10.1038/s41586-020-2961-x
Wang, X., Chen, Z. H., Yang, C., Zhang, X., Jin, G., Chen, G., et al. (2018). Genomic adaptation to drought in wild barley is driven by edaphic natural selection at the tabigha evolution slope. Proc. Natl. Acad. Sci. USA 115, 5223–5228. doi: 10.1073/pnas.1721749115
Wang, Z., Hao, C., Zhao, J., Li, C., Jiao, C., Xi, W., et al. (2021b). Genomic footprints of wheat evolution in China reflected by a Wheat660K SNP array. Crop J. 9, 29–41. doi: 10.1016/j.cj.2020.08.006
Wang, M., Li, Z., Zhang, Y., Zhang, Y., Xie, Y., Ye, L., et al. (2021a). An atlas of wheat epigenetic regulatory elements reveals subgenome divergence in the regulation of development and stress responses. Plant Cell 33, 865–881. doi: 10.1093/plcell/koab028
Wang, H., Sun, S., Ge, W., Zhao, L., Hou, B., Wang, K., et al. (2020a). Horizontal gene transfer of Fhb7 from fungus underlies fusarium head blight resistance in wheat. Science 368, 1–7. doi: 10.1126/science.aba5435
Wang, Z., Wang, W., Xie, X., Wang, Y., Yang, Z., Peng, H., et al. (2022). Dispersed emergence and protracted domestication of polyploid wheat uncovered by mosaic ancestral haploblock inference. Nat. Commun. 13, 3891. doi: 10.1038/s41467-022-31581-0
Wang, S., Wong, D., Forrest, K., Allen, A., Chao, S., Huang, B. E., et al. (2014). Characterization of polyploid wheat genomic diversity using a high-density 90,000 single nucleotide polymorphism array. Plant Biotechnol. J. 12, 787–796. doi: 10.1111/pbi.12183
Wang, H., Yin, H., Jiao, C., Fang, X., Wang, G., Li, G., et al. (2020b). Sympatric speciation of wild emmer wheat driven by ecology and chromosomal rearrangements. Proc. Natl. Acad. Sci. USA 117, 5955–5963. doi: 10.1073/pnas.1920415117
Winfield, M. O., Wilkinson, P. A., Allen, A. M., Barker, G. L., Coghill, J. A., Burridge, A., et al. (2012). Targeted re-sequencing of the allohexaploid wheat exome. Plant Biotechnol. J. 10, 733–742. doi: 10.1111/j.1467-7652.2012.00713.x
Xiao, Y., Jiang, S., Cheng, Q., Wang, X., Yan, J., Zhang, R., et al. (2021). The genetic mechanism of heterosis utilization in maize improvement. Genome Biol. 22, 1–29. doi: 10.1186/s13059-021-02370-7
Yu, H., Braun P, A., Yıldırım, M., Lemmens, I., Venkatesan, K., Sahalie, J., et al. (2008). B-genome-A physical map of the 1-gigabase bread wheat chromosome 3B. Science 322, 101–104. doi: 10.1126/science.1161847
Yu, G., Matny, O., Champouret, N., Steuernagel, B., Moscou, M. J., Hernández-Pinzón, I., et al. (2022). Aegilops sharonensis genome-assisted identification of stem rust resistance gene Sr62. Nat. Commun. 13, 1–13. doi: 10.1038/s41467-022-29132-8
Yuan, J., Jiao, W., Liu, Y., Ye, W., Wang, X., Liu, B., et al. (2020). Dynamic and reversible DNA methylation changes induced by genome separation and merger of polyploid wheat. BMC Biol. 18, 171. doi: 10.1186/s12915-020-00909-x
Yuan, J., Sun, H., Wang, Y., Li, L., Chen, S., Jiao, W., et al. (2022). Open chromatin interaction maps reveal functional regulatory elements and chromatin architecture variations during wheat evolution. Genome Biol. 23, 34. doi: 10.1186/s13059-022-02611-3
Zeng, X., Guo, Y., Xu, Q., Mascher, M., Guo, G., Li, S., et al. (2018). Origin and evolution of qingke barley in Tibet. Nat. Commun. 9, 5433. doi: 10.1038/s41467-018-07920-5
Zeng, X., Long, H., Wang, Z., Zhao, S., Tang, Y., Huang, Z., et al. (2015). The draft genome of Tibetan hulless barley reveals adaptive patterns to the high stressful Tibetan plateau. Proc. Natl. Acad. Sci. USA 112, 1095–1100. doi: 10.1073/pnas.1423628112
Zeng, X., Yuan, H., Dong, X., Peng, M., Jing, X., Xu, Q., et al. (2020). Genome-wide dissection of co-selected UV-b responsive pathways in the UV-b adaptation of qingke. Mol. Plant 13, 112–127. doi: 10.1016/j.molp.2019.10.009
Zhang, X., Jia, H., Li, T., Wu, J., Nagarajan, R., Lei, L., et al. (2022). TaCol-B5 modifies spike architecture and enhances grain yield in wheat. Science 376, 180–183. doi: 10.1126/science.abm0717
Zhao, X., Guo, Y., Kang, L., Yin, C., Bi, A., Xu, D., et al. (2023). Population genomics unravels the Holocene history of bread wheat and its relatives. Nat. Plants 9, 403–419. doi: 10.1038/s41477-023-01367-3
Zhao, G., Zou, C., Li, K., Wang, K., Li, T., Gao, L., et al. (2017). The aegilops tauschii genome reveals multiple impacts of transposons. Nat. Plants 3, 946–955. doi: 10.1038/s41477-017-0067-8
Zhou, Y., Bai, S., Li, H., Sun, G., Zhang, D., Ma, F., et al. (2021). Introgressing the aegilops tauschii genome into wheat as a basis for cereal improvement. Nat. Plants 7, 774–786. doi: 10.1038/s41477-021-00934-w
Zhou, Y., Chen, Z., Cheng, M., Chen, J., Zhu, T., Wang, R., et al. (2018). Uncovering the dispersion history, adaptive evolution and selection of wheat in China. Plant Biotechnol. J. 16, 280–291. doi: 10.1111/pbi.12770
Zhou, Y., Zhao, X., Li, Y., Xu, J., Bi, A., Kang, L., et al. (2020). Triticum population sequencing provides insights into wheat adaptation. Nat. Genet. 52, 1412–1422. doi: 10.1038/s41588-020-00722-w
Zhu, T., Wang, L., Rimbert, H., Rodriguez, J. C., Deal, K. R., De Oliveira, R., et al. (2021). Optical maps refine the bread wheat triticum aestivum cv. Chinese spring genome assembly. Plant J. 107, 303–314. doi: 10.1111/tpj.15289
Zhu, T., Wang, L., Rodriguez, J. C., Deal, K. R., Avni, R., Distelfeld, A., et al. (2019). Improved genome sequence of wild emmer wheat zavitan with the aid of optical maps. G3-Genes Genom. Genet. 9, 619–624. doi: 10.1534/g3.118.200902
Zimin, A. V., Puiu, D., Hall, R., Kingan, S., Clavijo, B. J., Salzberg, S. L. (2017a). The first near-complete assembly of the hexaploid bread wheat genome, triticum aestivum. GigaScience 6, 1–7. doi: 10.1093/gigascience/gix097
Keywords: Triticeae, wheat, barley, rye, genome sequencing, pan-genome
Citation: Gao Z, Bian J, Lu F, Jiao Y and He H (2023) Triticeae crop genome biology: an endless frontier. Front. Plant Sci. 14:1222681. doi: 10.3389/fpls.2023.1222681
Received: 15 May 2023; Accepted: 04 July 2023;
Published: 20 July 2023.
Edited by:
Xiaozeng Yang, Beijing Academy of Agricultural and Forestry Sciences, ChinaReviewed by:
Zujun Yang, University of Electronic Science and Technology of China, ChinaLiangliang Gao, Kansas State University, United States
Copyright © 2023 Gao, Bian, Lu, Jiao and He. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hang He, aGVoYW5nQHBrdS5lZHUuY24=; Yuling Jiao, eXVsaW5nLmppYW9AcGt1LmVkdS5jbg==
†These authors have contributed equally to this work