Genotyping by Sequencing Advancements in Barley

Rajendran, Nirmal Raj; Qureshi, Naeela; Pourkheirandish, Mohammad

doi:10.3389/fpls.2022.931423

REVIEW article

Front. Plant Sci. , 08 August 2022

Sec. Plant Breeding

Volume 13 - 2022 | https://doi.org/10.3389/fpls.2022.931423

This article is part of the Research Topic Use of Barley and Wheat Reference Sequences: Downstream Applications in Breeding, Gene Isolation, GWAS and Evolution, Volume II View all 17 articles

Genotyping by Sequencing Advancements in Barley

Nirmal Raj Rajendran¹

Naeela Qureshi²

Mohammad Pourkheirandish¹^*

¹Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Parkville, VIC, Australia
²International Maize and Wheat Improvement Center (CIMMYT), El Batan, Texcoco, Estado de Mexico, Mexico

Barley is considered an ideal crop to study cereal genetics due to its close relationship with wheat and diploid ancestral genome. It plays a crucial role in reducing risks to global food security posed by climate change. Genetic variations in the traits of interest in crops are vital for their improvement. DNA markers have been widely used to estimate these variations in populations. With the advancements in next-generation sequencing, breeders could access different types of genetic variations within different lines, with single-nucleotide polymorphisms (SNPs) being the most common type. However, genotyping barley with whole genome sequencing (WGS) is challenged by the higher cost and computational demand caused by the large genome size (5.5GB) and a high proportion of repetitive sequences (80%). Genotyping-by-sequencing (GBS) protocols based on restriction enzymes and target enrichment allow a cost-effective SNP discovery by reducing the genome complexity. In general, GBS has opened up new horizons for plant breeding and genetics. Though considered a reliable alternative to WGS, GBS also presents various computational difficulties, but GBS-specific pipelines are designed to overcome these challenges. Moreover, a robust design for GBS can facilitate the imputation to the WGS level of crops with high linkage disequilibrium. The complete exploitation of GBS advancements will pave the way to a better understanding of crop genetics and offer opportunities for the successful improvement of barley and its close relatives.

Introduction

Barley, Hordeum vulgare, is one of the earliest domesticated crop species that has played a critical role in the development of human civilization (Pankin and Von Korff, 2017). It is the fourth most important cultivated cereal crop globally (Pham et al., 2019), produced mainly for livestock feeding and the brewing industry (Ullrich, 2010). Barley is an ideal model crop for the tribe Triticeae due to its diploid nature, the low number of chromosomes (n = 7), close relationship with wheat, and a wide diversity with no crossing barriers between cultivated forms and their wild progenitors (Pourkheirandish and Komatsuda, 2007; Romero et al., 2018; Harwood, 2019). There are approximately 400,000 Hordeum accessions registered in various genebanks around the world that serve as a valuable resource for crop breeding to solve the important global challenges in agriculture (Knüpffer, 2009; FAO, 2010; Galluzzi et al., 2020). A robust barley reference genome with high-quality annotation and a pan-genome assembly with novel genetic variations is now publicly available, enabling the analysis of high-throughput sequencing data (Mascher et al., 2017; Jayakodi et al., 2020).

In the wake of unprecedented climate change and decline in the area of arable land, crop improvement has become an uphill task for plant breeders (Wang et al., 2019). Crop management and plant breeding strategies offer large potential to tackle present and future challenges of crop production (Wang et al., 2015). Breeding for combining desirable traits is the most sustainable, economical and efficient way of crop improvement. Integrating precise high-throughput phenotyping approaches with genome sequencing allows the identification of critical genomic regions controlling important agronomic traits. The selection of cultivars for traits of interest based on DNA markers is known as marker-assisted selection (MAS). With the development and easy access to a wide range of DNA markers and genetic maps, it is now possible to carry out MAS for traits governed by major as well as minor genes called quantitative trait loci (QTL) (Babu et al., 2004). Molecular markers are not affected by environmental factors, and selection can be performed at the early stages of plant development (Hasan et al., 2021). For the successful application of MAS, the precise location of QTL, the availability of closely linked markers and the effect of QTL within the genome must be studied in advance (Wang et al., 2016). Once a close linkage disequilibrium between a trait and molecular markers is established, the markers can be used to select desirable traits during breeding cycles (Vafadar Shamasbi et al., 2017; Bhavani et al., 2021). MAS becomes highly unreliable when dealing with complex quantitative traits regulated by many minor QTLs. The effects of these minor QTLs determined by linkage mapping and genome-wide association mapping is limited due to its reliance on statistical power, hence require a very large population to identify the desirable QTL combinations (Bhat et al., 2016).

Molecular markers can be categorized into three classes based on their mechanism of detection: hybridization, polymerase chain reaction (PCR) and sequencing. Hybridization-based molecular markers require a segment of DNA known as a probe to identify an individual. In plants, restriction fragment length polymorphism (RFLP) was the first-generation of hybridization-based molecular markers. The application of RFLP was very limited due to its low-throughput nature, difficulties in performing hybridization and the low polymorphism ratio due to the use of restriction enzymes at random. The hybridization-based markers were gradually replaced by the more efficient and inexpensive PCR-based tags, in which a specific segment of DNA is multiplied millions of times (Singh and Singh, 2015). There are many types of PCR-based markers, including randomly amplified polymorphic DNA (RAPD), sequence characterized amplified region (SCAR), cleaved amplified polymorphic sequence (CAPS), inter-simple sequence repeats (ISSRs), amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSRs). Among these, SSRs have proved to be one of the most efficient markers as they are present in abundance to detect a high level of polymorphism (Deschamps et al., 2012). Most PCR-based molecular markers were traditionally designed or identified based on a minimal set of sequence data. Recently, sequencing-based markers have gained traction since variations at single-nucleotide resolution can be directly selected and effectively utilized as genetic markers, owing to their abundance in all species (Garrido-Cardenas et al., 2018).

Scientists employed Sanger Sequencing method in many projects advancing the era of sequencing-based markers, including expressed sequence tags (ESTs) which proved to be a steppingstone to identify single-nucleotide polymorphisms (SNPs) in the expressed region of the genome (Wang et al., 1998). However, due to high cost and low throughput, Sanger sequencing is replaced by high throughput and more cost-efficient approaches including hybridization-based microarrays and next-generation sequencing (NGS) technologies. The hybridization-based microarrays are performed by designing thousands of probes based on the available sequence data on a small chip to detect polymorphisms. Microarray-based high-throughput SNP genotyping platforms have been developed over the years and many commercial platforms are now available such as, Illumina Infinium II, Axiom BioBank and Ion Torrent (Gupta et al., 2008; Phan and Sim, 2017). In barley, an array-based platform with 1,572 SNP markers was first introduced in 2006 with the development of Illumina GoldenGate assays (Close et al., 2009). The 9 k Illumina Infinium iSelect BeadChip succeeded the GoldenGate assays in 2009 (Comadran et al., 2012). The latest addition to this line of high-throughput genotyping platform is the 50 k Illumina Infinium iSelect SNP array (Bayer et al., 2017). These SNP markers are now being widely used in genome-wide studies (Nielsen et al., 2011).

Researchers often use NGS techniques on a large scale to assess genetic variations in a wide range of crops (Poland and Rife, 2012). Genome sequencing protocols have been developed in combination with bioinformatics procedures, including SNP marker discovery and genotyping. The most popular technologies are multiplexed shotgun sequencing (MSG), restriction enzyme based genotyping by sequencing (GBS) and whole genome sequencing (WGS) to access sequence variation in the studied species (Yang et al., 2017). In general, GBS has been referred to all those approaches aiming to construct a reduced genome representation libraries for sequencing. There has been a growing preference for using reduced genome sequencing protocols rather than whole genome sequencing, especially for crops with large genome size and high levels of linkage disequilibrium due to cost effectiveness, with an efficient sequence analysis performed in crops even without a robust reference sequence, often referred to as de novo sequencing (Lu et al., 2015; Darrier et al., 2019). However, the application of GBS over other genotyping approaches such as, array-based genotyping and WGS depends upon the research objectives, resource availability, timeframe of project and the skillset of the personnel’s involved. The cost effectiveness can be mainly attributed to the simplified genome, the inexpensive barcoding system used for multiplexing in every GBS strategy (He et al., 2014), and the increased sequencing capacity per run in platforms. The SNP data generated by the GBS approach can now be easily analyzed using plenty of well-established bioinformatics pipelines to filter SNP markers with precision (Raman et al., 2014). High-density genetic maps constructed with such SNP markers are better than those built by traditional microsatellite markers (Mayer et al., 2012). SNPs of interest generated by the GBS platforms can be verified by converting them into PCR-based tags, followed by marker validation, which facilitates the genotyping for large populations without the need for repeating the whole GBS assay (Yang et al., 2017). Given the size of the barley genome (5.5GB), it is essential to optimize the sequencing protocols to overcome genome complexity. The primary focus of this review article is to put forward ideas about the GBS based protocols that are currently used and have the potential to be used for barley improvement in the future.

Barley Genomic Resources

Germplasm Collections

Barley germplasm collection, including cultivars, landraces, advanced breeding materials, genetic stocks, hybrids, induced mutant lines and wild relatives, established in various plant Genebanks, provides precious genetic resources for breeding and research programs worldwide (Table 1). The total number of accessions conserved in Genebanks can be found by browsing through online databases, such as FAO WIEWS¹ and Genesys.² Barley accessions from collected materials, ranks second among the cereal crops, after wheat (Kant et al., 2016). As of March 2022, in the world’s largest Svalbard Global seed vault, managed by Nordic Genetic Resource Center (NordGen), Sweden, there are 106,887 accessions of the genus Hordeum.³ The most extensive collection of barley germplasm accessions is in Plant Gene Resources of Canada, followed by the National Small Grains Germplasm Research Facility in the United States. Another main center of worldwide barley collection is in the International Centre for Agricultural Research in Dry Areas (ICARDA), Lebanon, which maintains wild and cultivated barley accessions originating from different regions across the globe (Kant et al., 2016). Barley genetic resources representing the South and East Asian regions are available in Japan. The University of Okayama in Japan maintains a subset of international barley core collection with 380 lines (Sato, 2020). Additionally, there are mapping populations registered to be used by researchers. For example, three different mapping populations viz., Steptoe × Morex doubled haploids (DHs), Harrington × Morex DHs and Oregon Wolfe Barley DHs (Kleinhofs et al., 1993; Marquez-Cedillo et al., 2000; Szűcs et al., 2009) developed under the North American Barley Genome Mapping Project (NABGMP) have been deposited in the Okayama University. These mapping populations are crucial for the development of high-density consensus genetic maps and identifying QTLs for prioritized traits in barley (Sato, 2020). Apart from these conventional approaches, researchers have produced a cross between cultivated barley and Hordeum bulbosum, a wild self-incompatible relative from the secondary gene pool of barley to develop a set of H. bulbosum introgression lines (Pickering, 1984; Pickering et al., 1995). These introgression lines harbor a wide range of resistant traits that have been used to identify resistant genes for various diseases like leaf rust (Yu et al., 2018), mild mosaic virus (Ruge-Wehling et al., 2006), yellow dwarf virus (Scholz et al., 2009) and powdery mildew (Hoseinzadeh et al., 2020).

TABLE 1

Table 1. Barley accessions present in major gene banks (FAO WIEWS, 2022; Genesys, 2022).

The research on barley mutations started in1928 (Ohnoutkova, 2019). The mutants produced are preserved at the Nordic Genetic Resource Centre and the United States Department of Agriculture (USDA) National Small Grain Collection (Lundqvist and Franckowiak, 2003). The mutants are grouped into different phenotypic categories, labeled with a UL prefix (Lundqvist and SvalöfWeibull, 2005), and can be accessed through the International Database for Barley Genes and Barley Genetic Stocks.⁴ This database currently lists 754 barley genetic stocks with detailed description. Several researchers have used these mutants in genetic studies and for gene identification (Komatsuda et al., 2007; Li et al., 2013, 2017).

Genomic Resources

Since the beginning of 21st century, EST sequencing projects have gained momentum in crops, including barley, allowing functional characterization of genomic sequences (Sato, 2020). High-resolution EST maps with 1,032 EST based loci and 2890 EST based loci were constructed by Stein et al. (2007) and Sato et al. (2009), respectively. The six-row American malting cultivar known as Morex (reference genotype) has been utilized to construct bacterial artificial chromosome (BAC) libraries, that have been helpful in creating the physical map with few gaps (Yu et al., 2000; Schulte et al., 2011). The International Barley Genome Sequencing Consortium (IBSC) was established in 2006 to construct a high-quality genome sequence by consolidating all the available genomic resources (Schulte et al., 2009), and later the Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)⁵ launched a search tool to run BLAST against the barley genome sequence compiled by IBSC. Similarly, the EnsemblPlants portal also allows researchers to browse and blast search on the target sequence. The first generation of chromosome-scale barley pan-genome assembly have been obtained from a core set of 20 diverse barley genotypes (Jayakodi et al., 2020) and this has given the researchers some insights on the effect of structural variants at single base resolution in barley evolution (Kamal et al., 2022). A complete transcriptome profile for the Morex reference genome, which is essential for the annotation of available genome assemblies, is also available, in addition to its genomic sequences. A reference transcript dataset (BaRTv1.0) consisting of 60,444 genes covering various organs and tissues was also developed (Rapazote-Flores et al., 2019) and a set of transcript sequences from RNA-Seq is catalogued in an open-access genome database barleyGenes.⁶ Recently, a barley expression database known as EORNA⁷ provided a single-window platform for researchers to visualize variations in gene expression profiles across different barley genotypes (Milne et al., 2021).

Genotyping by Sequencing

GBS has become a widely adopted high-throughput sequencing approach to mine the potential high value SNPs and small insertion/deletion variations in species with complex genomes, such as wheat and barley when compared to whole genome sequencing and the array-based SNP genotyping (Bajgain et al., 2016). Though SNPs are efficient, the application of whole genome sequencing for their detection is costly and cumbersome, especially in crops with a complex genome structure. A complex genome has highly repetitive regions, duplicative DNA sequence and polyploidy, posing problems during sequence assembly and SNP detection (Mardis, 2008; Mammadov et al., 2012). Among cereals, wheat (16 GB) and barley (5.5 GB) have large complex genomes with over 80% of repetitive elements (Wicker et al., 2009). Hence, sequencing has not always been a straightforward approach for these crops, and thus a higher sequencing depth is required to achieve SNP calling (Dou et al., 2012; Jighly, 2022). To overcome these challenges in SNP identification, the genome size reduction was introduced as a low-depth sequencing strategy (Huang et al., 2009; Scheben et al., 2017). Also, exponential growth in NGS have simplified GBS protocols to mine thousands of SNPs covering a substantial portion of the barley genome (Alqudah et al., 2020).

The reduced genome representation strategies being successfully adopted in barley are generally divided into two classes. The first class involves techniques that rely on restriction enzyme (RE) cut sites to generate fragments for the construction of sequencing libraries. Genome reduction based on RE cut sites ensures consistency in the sequenced portion of genome in individuals as RE sites are generally conserved across species (Chung et al., 2017). Moreover, the use of methylation-sensitive REs in GBS provides a high SNP coverage in gene-rich regions of the genome rather than in repetitive inter-genic regions (Fellers, 2008). For example, methylation-sensitive REs cannot cleave methylated cytosine residues, and gene-rich regions exhibit very low levels of cytosine methylation in plants (Zhang et al., 2010). In gene-rich regions, SNP markers are usually preferred due to the nature of unique sequences in genic regions and the probability of finding a desirable QTL in the vicinity (Pootakham et al., 2016). The second class of GBS for genome reduction is sequencing targeted regions in the genome, that could involve various approaches, such as multiplex PCR assay for genes or genomic subsets (Tewhey et al., 2009), hybridization-based sequence capture (Gnirke et al., 2009), molecular inversion probes (MIPs) (Hardenbol et al., 2005) RNA sequencing (RNA-Seq) for transcriptome profiling, exome capture to isolate the coding sequence (exon) variants, deoxyribonuclease I (DNase I)-hypersensitive site sequencing (DNase-Seq) to determine chromatin accessibility and chromatin immunoprecipitation sequencing (ChIP-Seq) to analyze protein-DNA interactions. Several target enrichment methods for NGS have also been developed, which provide the highest degree of control for the identification of targeted genomic regions for SNP discovery in a cost-effective and time-efficient manner.

The reduced genome GBS approaches based on REs can also be applied in species without a reference genome (de novo) and also without any prior SNP information (Rasheed et al., 2017). In de novo approaches, first, GBS sequence data is processed based on identical raw reads; a bioinformatic pipeline would then identify other highly similar reads that are probably from the same location. The rest of the sequence reads without high similarity are usually discarded from the analysis. However, in the presence of a robust reference sequence assembly, GBS data is directly aligned against the reference sequence, and most sequence reads could be used to discover polymorphisms (Kim et al., 2019). Currently, reference quality genomes are available for many crops, including barley, alongside extensive data from well characterized collections of SNPs (Chung et al., 2017; Alqudah et al., 2020); thus, making this approach widely applicable for crop genetics and plant breeding research, with high confidence (Figure 1).

FIGURE 1

Figure 1. Application of GBS approaches for various genetics and plant breeding studies. The high-quality SNPs derived from GBS are being used for diversity analysis, genetic map construction, genome-wide association studies and genomic selection.

Genotyping by Sequencing Based on REs

The first step in GBS is to construct a robust library, the most critical step before any sequencing process, which determines the overall quality and coverage of reads and affects the achieved sequencing depth. The construction of the GBS library involves two major steps, including the RE based genome reduction, followed by multiplexing samples using barcode adapters or oligos. This approach is simple, fast, unique and reproducible that can even reach the genomic regions of interest, which were previously inaccessible to sequence capture approaches (Elshire et al., 2011). The importance of restriction site associated genomic DNA sequencing for SNP discovery and genotyping was first demonstrated by Baird et al. (2008). Multiplex sequencing by adding inexpensive DNA barcodes in species with small genomes like rice (0.43 GB), with a genome size 10 times smaller than that of barley, was suggested by Craig et al. (2008). Since then, it has become a standard practice in most Illumina supported NGS applications. Simple multiplexing is easily accessible for crops with small genome sizes, whereas in crops with large genome sizes, using a combination of two techniques, involving RE for genome reduction and subsequent multiplexing is required to achieve high-throughput and reliable sequencing (Elshire et al., 2011). The RE targets low-copy genomic regions, thereby minimizing repetitive sequences from sequence reads. The GBS based on REs has several versions according to the number of restriction enzymes used, along with the type of adapters ligated to DNA fragments (Figure 2).

FIGURE 2

Figure 2. Schematic overview of Restriction Enzyme based GBS methodology. (A) Tissue sample collection is followed by DNA isolation from the crop. (B,D,F) DNA digestion by; One enzyme, ApeKI, that makes a cut to produce overhangs (B), classical Two enzyme, a rare cutter PstI and a common cutter MspI produces overhangs (D), and new version of Two enzyme called tGBS, using a different set of restriction enzymes, NspI and BfuCI capable of better digestion produces overhangs in opposite directions (F). (C,E,G) Adapter ligation to the digested sample; One enzyme, ligation of barcode and common adapter (C), classical Two enzyme, ligation of forward barcode and reverse Y adapter (E), tGBS, ligation of barcode and universal oligo. Nucleotide sequence in red possesses the matching bases in selective primer for target specificity (G). (H) Multiplexing; a unique barcode used for each accession allows the pooling of DNA samples before the amplification step. (I) The mixed libraries, uniquely barcoded, and amplified samples will be run in a next-generation sequencer. The sequence reads will be analyzed in various bioinformatic pipelines based on the availability of reference sequence.

One Enzyme GBS

One enzyme GBS approach is one of the first-generation techniques for genome reduction developed by Elshire et al. (2011), with a straightforward protocol widely used in breeding and trait mapping (Goddard et al., 2019). This method was first applied in both maize and barley, with the selection of REs as a key factor in determining coverage. A methylation-sensitive 5 bp cutter, ApeKI was suggested as an ideal RE, which significantly reduced the repetitive sequence in barley and maize genomes (Figure 2B). The overhangs generated by the RE would anchor the adapter ligation to the intended DNA sequence. A double-stranded universal and barcoded adapter were ligated to digested DNA samples (Figure 2C). A different barcoded adapter is considered for each sample, allowing multiplexing of individuals per sequencing lane during a single run to significantly reduce genotyping cost. The adapters were designed to avoid restriction enzyme recognition sites and no regeneration of the recognition site should occur after ligation of genomic DNA fragments. Unlike earlier RE based library construction, where RE generated DNA fragments were too large for sequencing; thus, requiring further fragmentation and size selection, one enzyme GBS generates fragments suitable for NGS. This allows digestion and adapter ligation to proceed in a single well and direct sequencing (Elshire et al., 2011; Wickland et al., 2017). Digested fragments from the libraries were amplified in a PCR reaction with primer sets that are complementary to the ligated adapters before sequencing. The sequencing of the GBS library can be performed in a variety of NGS platforms, such as Illumina Genome Analyzer II (48/96 plex). Using the one enzyme approach, 24,186 sequence tags were mapped onto the Oregon Wolfe Barley genetic map (Elshire et al., 2011). With the consensus read sequence associated with restriction sites as reference from this GBS approach, the mapping can be performed to uncover genetic variations even without a reference genome. In the one enzyme GBS approach, a non-uniform library was constructed due to biases associated with a single RE targeting a single cut site, and there was a scope for further reduction in genome complexity.

Two Enzyme GBS

To construct a library with a greater degree of genome reduction and uniformity, the two enzyme GBS approach was introduced as a successor, which was successfully demonstrated in both barley and wheat (Poland et al., 2012). In this approach, a rare cutter with six base recognition sites (PstI) in combination with a common cutter with four base recognition sites (MspI) were used. Two adapters were designed for PstI and MspI restriction overhangs (Figure 2D); the forward adapter with a barcode for the rare cut site and the universal reverse Y adapter for the common cut site (Figure 2E). In this approach, each fragment in the library has two different adapters, forward and reverse, in the same orientation as the cut sites. The digestion by two enzymes would result in DNA fragments with both rare and common cut sites whereas, fragments with only common cut sites and rare cut sites would be in extremely low frequency. To avoid the amplification of fragments with two common cut sites and adapter dimers, the universal Y shaped adapter was designed as a reverse adapter. The Y adapters introduced by Illumina ensures that each DNA fragment is only ligated to different adapter pairs (Bentley et al., 2008). During the first round of PCR amplification, only the forward primer can anneal to the forward barcode adapter, whereas the reverse primer cannot anneal during this time as the tail sequence of the reverse Y adapter is not a complement of the reverse primer. The complementary sequence of the reverse primer was only generated at the end of the first round of PCR, ensuring the DNA amplification of only PstI–MspI fragments. The fragments with the same adapters on each end of the fragment would inhibit the formation of library clusters on the sequencing flow cell as they cannot undergo bridge amplification, leading to the loss of sequencing efficiency for segments. The two enzyme approach could ensure the uniformity of sequences from forward to reverse direction but not vice versa, which led to an increase in efficiency while representing the subset of a whole genome. The analysis of 82 Oregon Wolfe Barley DH populations using the Illumina Genome Analyzer II (48 plex) identified approximately 34,000 SNPs and 24,159 tags with less than 20% missing data (Poland et al., 2012).

Both one enzyme and two enzyme GBS approaches have been frequently used for linkage mapping analysis to identify QTLs for agronomic traits (Scheben et al., 2017) due to their simplicity, efficiency and cost effectiveness. However, these approaches produced highly skewed coverage of genomic positions (Beissinger et al., 2013). Moreover, they produce low read depth per locus when common cutting REs are used to produce small fragments (Pootakham et al., 2016) and fragmentation is confined to RE sites, with some genomic regions not being sequenced, it leads to more missing data (missing a genotype value at loci). Low read depths are likely responsible for the ineffective representation of heterozygous loci (Torkamaneh et al., 2016). To correct the errors associated with heterozygous loci, SNP calls from these approaches were considered as missing data or as dominant markers during GBS data analysis. However, these errors do not cause major problems in barley or wheat since they are self-pollinated. The GBS missing data had a significant impact during genetic mapping, as specific SNPs were placed in more than one position on a map. Such variable proportions of shared loci would result in low call rates per sample (Dacosta and Sorenson, 2014). One way to deal with missing data is to avoid SNP loci with missing data altogether; however, this would lead to reduction in the number of available SNPs. Another viable option is to use data imputation, in which a predicted allelic dosage would be substituted with the missing data (Torkamaneh and Belzile, 2015). It is interesting to note that though both one and two enzyme GBS approaches appear to have certain drawbacks associated with low read depth, such as missing data and high error rates at heterozygous loci. However, their robustness would increase with the availability of more efficient imputation algorithms to fill in missing data where the algorithms can function well in the presence or absence of reference genomes (Manching et al., 2017; Munyengwa et al., 2021).

Tunable GBS

The shortcomings of both one and two enzyme approaches were addressed by the most recent GBS technology, known as tunable GBS (tGBS^®). This approach allows a researcher to manipulate the number of targeted genomic sites by merely modifying a single primer in the protocol. Such flexibility in tGBS libraries resulted in higher read depths for each target site compared to the previously mentioned GBS protocols (Ott et al., 2017). Two new sets of REs capable of generating overhangs in opposite directions on the same target strand were employed: a 4 bp cutter, BfuCI, producing a 5′ overhang, and a 5 bp cutter, NspI, generating a 3′ overhang. These REs achieved a better representation of predicted target sites, and most of the restriction sites were digested in maize (Ott et al., 2017). Two complementary oligos (single-stranded DNA) would be ligated; the oligo complementary to the 3′ overhang possesses a sample-specific barcode sequence, whereas the oligo complementary to the 5′ overhang is universal (Figures 2F,G). After ligation, the samples were subjected to two PCR steps namely, selective PCR and final enrichment PCR. Two primers (selective and non-selective) were specifically designed to ensure their specific annealing to genomic fragments produced during RE digestion. The selective primer was designed to be the reverse complement of the ligated universal oligo, and it can extend up to three nucleotides in the insert sequence at the 3′ end, referred to as selective bases. These selective bases match the digested fragment sequence and enable the primer to precisely select target sites. The non-selective primer was designed to match the region preceding the barcode present in the oligo at the 5′ end. This strategy of ligating unique oligos to overhangs ensure that only the adapter ligated double digested fragments enter the sequencing phase. The replacement of double-stranded adapters with single-stranded oligos in the tGBS^® approach enhanced the reliability of library preparation process, as the accurate quantification of oligos is much easier in this method than the quantification of adapters in previous methods. During the final PCR, the sequencing platform-specific primers gets annealed to produce the sequencing library (Ott et al., 2017). To assess the efficiency of tGBS^®, a study was conducted using both the maize inbred and segregating populations, and sequencing of the constructed library was performed on the Ion Torrent platform. The results revealed that the tGBS^® approach was highly efficient in many aspects, especially for the accuracy of SNP calling (>97–98%) at both homozygous and heterozygous sites, compared to the previous GBS approaches. The same DNA library from tGBS® was sequenced in Illumina platform for comparative analysis by using Illumina-specific oligos, which resulted in a similar level of accuracy as reported with the use of the Ion Torrent platform (Ott et al., 2017).

Nano-GBS

The advancement of sequencing technologies reduced the cost of high-throughput genotyping (Wetterstrand, 2021); however, the construction of GBS libraries is still the costly phase. A miniaturized protocol for GBS library construction, known as nano-GBS was developed with a reduced cost (Torkamaneh et al., 2020). The new improvisation exploited the non-contact liquid transfer technique (acoustic droplet ejection), which facilitated liquid transfer on a nano-liter scale. The study involved nine libraries constructed with three different combinations of REs (ApeKI, PstI/MspI, and SbfI/MspI) and three different multiplexing levels (96 plex, 384 plex, and 768 plex). The library preparation protocol involving one enzyme (ApeKI) and two enzyme (PstI/MspI and SbfI/MspI) was based on the works by Elshire et al. (2011) and Abed et al. (2019), respectively. The cost of sequencing was reduced by more than half for genotyping of 96 soybean lines in Ion torrent platform using nano-GBS. Out of three RE combinations, PstI/MspI produced sufficient number of SNPs with a low percentage of missing data. However, the nano-transfer, automation, and the use of multiple combinations of REs are yet to be standardized in many species. The initial installation cost for the nano-transfer equipment is also very high, and the wider adoption of this method could be limited.

Genotyping by Sequencing Based on Sequence Enrichment

The target enrichment approach offers more specificity and replicable results than the RE based GBS approach (Malmberg et al., 2018). However, prior knowledge of genomic regions of interest is required to design suitable primers/probes for this approach. The most straightforward targeted sequencing protocol for enrichment of small sized genomic targets is the PCR-based method. Moreover, this method provides high efficiency when the targeted regions have highly conserved sequences across cultivars at the primer binding sites (Kaur and Gaikwad, 2019). Nowadays, multiplex PCR products are used to exploit the high-throughput nature of NGS; however, the primer design for a large genome is cumbersome, and amplification often requires specific commercial DNA polymerase enzymes capable of amplifying long genomic DNA (~30 kb) (Ostezan et al., 2021). Additionally, target enrichment can be achieved through hybrid capture by using oligonucleotide probes complementary to the genomic area of interest (Fu et al., 2010). It is also a technically demanding and time-consuming method, which requires library construction before hybridization. The efficiency of this method drops drastically when the targeted regions are small, and the increase in sample size makes the process more time-consuming (Niedzicka et al., 2016). The two methods based on PCR and hybrid capture were combined to develop a hybrid protocol known as molecular inversion probes (MIPs), overcoming the deficiency in both methods. MIPs are single-stranded DNA molecules containing sequences at both ends that are complementary to regions flanking the target sequence and linked by a highly repetitive sequence (linker sequence). First, the hybridization of MIP to the target sequence takes place, followed by gap filling with dNTPs and ligation to form a circular DNA molecule (Figure 3). This molecule is used as a template for PCR reaction using the primers complementary to the linker sequence. Apart from the primers, sequencer-specific adapters and index sequences are also introduced during the PCR process. Now, the amplicons were sequenced for SNP detection. MIPs can target genomic regions ranging in size from 1 kb to 5 Mb, and they are more suited for targeted resequencing of thousands of short genomic regions in species with even partial genomic-level information (Niedzicka et al., 2016). These targeted sequencing approaches based on PCR, hybrid capture and MIPs have limitations as they do not detect any novel variants outside the targeted region and fail to differentiate similar genomic regions (Deschamps et al., 2012; Winfield et al., 2012).

FIGURE 3

Figure 3. Schematic representation of Molecular Inversion Probe to capture a specific target for sequencing. (A) Model structure of a molecular inversion probe (MIP) with a ligation and extension arm connected by a 30 bp linker. Both ligation and extension arms are designed to complement the target region. (B) These complementary arms at the end of linker pairs with the target region, which is followed by gap filling and ligation (Circular DNA molecule). (C) Digestion of exogenous host DNA and probes with the help of exonucleases. (D) Amplification of the captured segments using the universal primers complementary to the linker sequence of MIP along with sample-specific index sequences and sequencer-specific adapters. (E) Libraries will go through the sequencing followed by data curation.

RNA-Seq capturing the whole transcriptome, is another targeted sequencing approach that can reliably identify genetic variations in genic regions (Figure 4). It has been a reliable method for evaluating the potential genome and transcriptome level polymorphisms for genome-wide association studies. The major advantage of RNA-Seq is that no pre-requisite knowledge of genome sequence is required. However, the over-representation of abundant transcripts and challenges in the detection of sequences with low expression level remains a roadblock while assembling gene expression profiles (Ostezan et al., 2021), and splice variants can also sometimes add to the complexity. The assembly of these transcripts can be improved by adopting targeted RNA-Seq, which focuses only on the amplification of the genes of interest by designing probes complementary to the targeted exons (Levin et al., 2009; Mercer et al., 2014). This could also reduce the cost as there is no need for whole transcriptome sequencing (Arts et al., 2017). The whole exome capture (Figure 5) approach, using probes designed from target genic regions to hybridize genomic DNA, resolved the limitations related to the proportion of transcripts present or the crop’s developmental stage in RNA-Seq based approach (Kaur and Gaikwad, 2017). The selective sequencing of exome can be performed by three approaches: PCR-based capture, hybrid capture and array-based capture. One well-known example of selective exome capture is about a specific gene family with a nucleotide binding site/leucine-rich repeat (NLR) domain that triggers signaling in plants during pathogen invasion. NLR baits were designed to study this gene family and its molecular role during the plant-microbe interaction (Dinh et al., 2020).

FIGURE 4

Figure 4. Schematic representation of RNA-Seq to sequence the transcriptome. (A) Poly A tailed messenger RNA (mRNA) is isolated among other RNA types using oligo dT magnetic beads. (B) Double-stranded complementary DNA (cDNA) library is constructed by employing reverse transcriptase on the isolated mRNA. (C) Enzymatic fragmentation of double-stranded cDNA is carried out to construct uniform library followed by end repair and an A nucleotide is added-to facilitate adapter ligation. These fragments are subjected to PCR amplification. (D) The amplicons are sequenced and differentially expressed genes are identified by mapping the reads with a quality reference sequence comparing the contrasting genotypes.

FIGURE 5

Figure 5. A generalized flowchart for exome capture to isolate the exon variants. It can be broadly divided into two phases viz., target enrichment and DNA sequencing. In enrichment phase high-quality genomic DNA is isolated and specific probes are designed for hybridization followed by capturing of hybridized probes. In sequencing phase, the raw exome sequence data is filtered and aligned to find potential variants.

DNase I hypersensitive sites (DHSs) in the genome usually harbors cis-regulatory DNA elements (CREs). CREs interact with transcription factors and guide the transcription process (Wang and Wang, 2021). DHSs are sensitive to the endonuclease DNase I, which can cleave these regions. The digested segments are used for sequencing to identify the important CREs and TFs (Mathelier et al., 2015). DHS mapping combined with NGS is popularly known as DNase-Seq (Figure 6). Another targeted region sequencing approach, which has regulatory importance is the chromatin immunoprecipitation sequencing (ChIP-Seq) that identifies the binding sites of DNA associated proteins. It refers to mapping of the binding proteins and their bio-chemical modifications across the genome, which play a vital role in replication, transcription, RNA processing, and DNA repair mechanisms (Furey, 2012). The unique feature of this protocol is that it involves the initial crosslinking step to fix the protein-DNA binding in vivo, allowing the capture of native interactions. Antibodies specific to the protein of interest was used to immuno-precipitate the crosslinked DNA and protein. The extracted fragments were sequenced, and the resulting reads were mapped to the genome (Figure 7).

FIGURE 6

Figure 6. A schematic representation of DNase Seq to determine chromatin accessibility. (A) Nuclei isolation followed by DNase I digestion is a critical step to collect the chromatin. (B) Targeted fragments will be selected and isolated from gel. (C) Library construction followed by next-generation sequencing and data analysis.

FIGURE 7

Figure 7. An overall workflow for ChIP Seq to analyze protein interactions with DNA. (A) Isolation of genomic DNA with conserved binding proteins. (B) Crosslinking and DNA fragmentation to access the DNA-protein complex. (C) Immunoprecipitation with protein-specific antibodies to separate the DNA-protein complex. (D) DNA purification is followed by adaptor ligation to prepare the sequencing library. (E) Discovery of DNA biding segments using next-generation sequencing platform. (F) Data analysis and alignment to reference genome will identify targeted DNA sequences that interact with the protein. This figure is adapted from “ChIP sequencing,” by BioRender.com (2022). Retrieved from https://app.biorender.com/biorender-templates.

SNP Variant Calling

The analysis of GBS sequence data can be complex; hence, there is a need for bioinformatic pipelines with advanced computational efficiency to sort, classify based on sequence barcodes, filter out poor-quality reads, score individuals based on their genotypes, and align the sequenced reads to the given reference sequence for SNP variant calling (Torkamaneh et al., 2016).

Bioinformatic pipelines for SNP variant calling are mainly categorized into two groups: with or without a reference genome. The commonly used pipelines which require a reference genome for data analysis are TASSEL-GBS (v.1 and v.2), IGST, and Fast-GBS, whereas the pipeline that does not require a reference genome is UNEAK (Bradbury et al., 2007; Lu et al., 2013; Sonah et al., 2013; Glaubitz et al., 2014; Torkamaneh et al., 2017). The pipeline Stacks can be used with or without a reference genome for SNP detection (Catchen et al., 2013), although the confidence of allele calling varies. The UNEAK and Stacks pipelines would identify pairs of nearly identical reads, which could potentially represent alternative alleles at a single locus (Catchen et al., 2013; Lu et al., 2013). The TASSEL-GBS or Fast-GBS can map sequencing reads onto the reference genome to identify SNPs, which is a more robust approach (Li and Durbin, 2009; Nielsen et al., 2011). The imputation procedure of missing data generated from GBS is more accurate when a reference sequence is present (Torkamaneh and Belzile, 2015). Therefore, TASSEL-GBS has been widely utilized in association analysis in barley for which the reference genome is available (Pasam et al., 2012; Scheben et al., 2017). Also, it can handle large volumes of low coverage data (Scheben et al., 2017). Although, TASSEL-GBS has been used in many studies with barley to generate SNPs, the alternative Fast-GBS is emerging as a more powerful pipeline providing more thorough data analysis. The efficiency of Fast-GBS was demonstrated in soybean lines by comparing with other pipelines and sequencing platforms. The study found that the SNP calls from a single sequencing technology using different pipelines (Fast-GBS, Stacks, UNEAK, TASSEL-GBS, IGST) had a common SNP overlapping percentage ranging from 72 to 92%, whereas very low overlapping percentage ranging from 50 to 70% was observed while using a single pipeline with the data generated from two different sequencing technologies (Illumina and Ion Torrent). Such variations in overlapping percentage occur due to different variant calling algorithms and read mapping in each pipeline (O’Rawe et al., 2013). Fast-GBS is more flexible and accurate than other pipelines such as, TASSEL-GBS v.2 and Stacks, as demonstrated in soybean (Torkamaneh et al., 2016). In Fast-GBS, various data filtering parameters such as, quality scores for reads, number of reads required, and missing data allowed, can be altered based on the requirements of the study, irrespective of the sequencing platform used. This flexibility makes Fast-GBS less prone to errors. In barley, Fast-GBS mined out more SNPs than the UNEAK pipeline and it has proved to be far more user-friendly (Abed et al., 2019). In addition to these pipelines, researchers have also been using several inhouse bioinformatic pipelines using different programming languages.

Irrespective of the pipelines used, it is necessary to always perform additional quality checks and filtration on datasets (Abed et al., 2019). A well-defined protocol for filtering high-quality data using variant call format (VCF) tools has been demonstrated in crops, including barley (Danecek et al., 2011; Abed et al., 2019). For filtering, parameters such as, heterozygosity, sequencing depth, SNP quality, call rate, missing data percentage and minor allele frequency range (false SNPs) are usually preferred. In order to obtain a complete SNP catalogue, missing genotypic data can be inferred by imputation (Abed et al., 2019) or it can be excluded from downstream genetic analysis (Nazzicari et al., 2016). For data imputation, several software programs are available, including BEAGLE (Browning and Browning, 2007), FImpute (Sargolzaei et al., 2014), and LinkImpute (Money et al., 2015). BEAGLE and FImpute are the most commonly used packages, which exploit the linkage disequilibrium, haplotype information, and pedigree relationships to achieve accurate imputation of missing data (Shi et al., 2017). LinkImpute is optimized for the imputation of missing data for unrelated germplasm; thus, selection of the statistical software package for imputation is based on the objective of the study and the type of germplasm used. The fastPHASE package was used in barley to impute missing data of a low-density experimental panel based on data from a high-density reference panel (Iwata and Jannink, 2010). Data imputation is only optional, and any necessity to do it only depends on the research objectives.

Applications of GBS in Barley

GBS technologies have allowed breeders to access a greater range of genetic variations to construct genetic maps, conduct linkage and genome-wide association studies (GWAS), execute genomic selection (GS), and assess genetic diversity in germplasm (Table 2).

TABLE 2

Table 2. Practical application of GBS in barley.

Genetic Mapping and QTL Detection

GBS platform has enhanced the ability to produce high-density molecular maps in cereals, a pre-requisite for trait-marker linkage studies (Chung et al., 2017). The development of genetic materials and effective implementation of high-throughput phenotyping will remain the main challenges in using this genomic data for crop improvement. High-density molecular maps combined with precise phenotype from bi-parental and multi-parental populations, has allowed to uncover the genetic architecture underlying Mendelian traits and complex traits by genetic mapping analysis (Poland and Rife, 2012). For agronomic traits, GBS based genetic maps were generated in barley recombinant inbred lines to identify three QTLs for plant height on chromosome 2H, 3H and 5H, where the QTL on 2H was positioned near a locus conferring for spike architecture (Vrs1), and the QTL on 5H was in close proximity to a dwarfing gene locus (Ari-e) (Liu et al., 2014). Similarly, DH lines developed from reciprocal crosses involving three parents were utilized to identify 17 and 18 QTLs for grain plumpness and yield, respectively, under drought-prone environments (Obsa et al., 2017). With a high-density genetic map of 3,662 SNP markers generated by GBS approach, a total of five loci and a regulatory factor related to flavonoid synthesis were identified for the economically important purple seed coat trait (Yao et al., 2018).

For disease resistance, various GBS based genetic maps have successfully identified critical QTLs related to powdery mildew and spot blotch. A RIL population with a genetic map length of 1,000 cM was used to identify four candidate genes for powdery mildew resistance on chromosome 2H (Hoseinzadeh et al., 2019). In another study, a susceptible QTL on chromosome 1H for powdery mildew resistance was identified by GBS approach (Goddard et al., 2019). Two QTLs on chromosome 1H and 7H for spot blotch susceptibility were identified from a high-density map and the QTL on chromosome 1H (Qsbs-1H-PI) was found to be a novel allele (Leng et al., 2020).

The one enzyme GBS approach (Elshire et al., 2011) was used to develop high-density genetic maps for the DH population of Oregon Wolfe Barley and Morex × Barke. These genetic maps have been used as a genetic framework to develop the barley physical map, a reference assembly for various breeding and genetic research (Mayer et al., 2012). Similar reference quality anchoring of SNP related contigs was also carried out, which increased the genetically anchored contigs by three times (Mascher et al., 2013).

Genome-Wide Association Studies

GWAS can provide high-resolution mapping by making use of multiple recombination events over many generations (Yu and Buckler, 2006). It is considered as one of the most powerful tools for identifying marker-trait associations using large populations. GWAS have high resolution due to the exploitation of historical recombination events that are limited in bi-parental populations (Zheng et al., 2008). One of the most significant setbacks in GWAS could be the need for large numbers of markers (in millions), depending on the extent of linkage disequilibrium, but GBS is well suited to provide the required high-density markers (Chung et al., 2017). However, for self-pollinated crops like barley and wheat, a few thousand SNPs are usually sufficient to cover the whole genome, given that they have large linkage disequilibrium blocks (Jighly et al., 2015). Another challenge for GWAS is the occurrence of false positives, especially when dealing with complex population structures. However, these false positives can be corrected to some extent by fitting the population structure as covariates using different models like the mixed linear model, which is capable of handling a single locus at a given time, whereas models such as, ISIS EM-BLASSO (Tamba et al., 2017), LASSO (Xu et al., 2017) and FarmCPU (Liu et al., 2016) are capable of handling multiple locus, simultaneously (Kaler and Purcell, 2019). Despite these challenges, GWAS remains a powerful method to understand the genetic architecture of traits.

Plant Genetics and Crop Plant Research institute in Germany (IPK) applied a two enzyme GBS approach to identify genomic regions conferring row type, hull adherence, awn roughness on a barley core collection. With the GWAS approach, QTL for awn roughness and lateral spikelet fertility were identified precisely on chromosome 1H, 5H and 7H (Milner et al., 2019). A study was conducted using GBS data from advanced barley lines and found that adopting an integrated approach involving both single and multi-locus GWAS had a better performance while detecting QTLs for complex traits (Abed and Belzile, 2019).

Genomic Selection

Genomic selection (GS) is a rapidly evolving approach in breeding to predict the genetic value of individuals for selection. This technique was developed to tackle the challenges occurring in both linkage mapping and GWAS while dealing with complex traits particularly governed by minor QTLs (Srivastava et al., 2020) and traits showing high genotype by environment interaction (Jighly et al., 2021). In other words, GS is a more comprehensive version of marker-assisted selection (MAS), where DNA markers covering the whole genome are utilized to select superior genotypes. Once the genomic selection model is established, genotypic data can be used to select for or against lines without phenotyping (Heffner et al., 2009). In GS, a prediction model is developed by using both phenotypic and genotypic information collected from a training or reference population, and then this model is validated by independent testing populations. The double-checked trained population is utilized to formulate a standardized genomic estimated breeding value (GEBV), estimated as the sum of all the genotyped marker effects. The standard GEBV value of the trained model can be used to calculate the GEBV of an untrained population for selection without phenotyping (Contaldi et al., 2021). The high-density, genome-wide data achieved by GBS method is well suited to calculate the sum of marker effects. The genetic maps constructed using GBS provide a guide for the genomic selection of economically important traits and expand the capacity to detect genetic loci with minor effects on a phenotype. Raw GBS reads were retrieved from the European Nucleotide Archive, and 532,253 pan-genome sequence anchors were identified based on the presence/absence of tags. These tags are essential to identify desirable genomic regions within in a species and this information can be effectively used in genomic selection and improvement of barley (Gao et al., 2020).

Diversity Studies

Analysis of DNA sequence variation between individuals is the most efficient way to study the genetic distance among them. This study allows comparisons across different species, or different lines of the same subspecies. Genetic diversity studies based on GBS simplify the complex alignment and related computational challenges that researchers often encounter while working on species with high genetic diversity (Elshire et al., 2011). The genetic relationship and geographical distribution of 16 diverse barley landraces were analyzed using the Roche 454 GS FLX titanium technology and GBS. The application of NGS has provided new information on the available barley genomic resources and revealed the diversity present between the barley landraces from both eastern (Zagros mountains and further east) and western (Fertile Crescent and further west) regions (Fu and Peterson, 2011). In another study, GBS data derived from 21,405 accessions from IPK collection were utilized to understand the population structure of domesticated barley and selected a core set of 1,000 genotypes (Milner et al., 2019). The same core set was used to evaluate the efficiency and effectiveness of both GBS and SNP array platforms. Both platforms were equally good in detecting informative SNPs; however, GBS held an edge over SNP array for the detection of rare alleles in the germplasm collection (Darrier et al., 2019).

Conclusion

In the ever-evolving field of genomics, GBS platforms have taken a special place among other sequencing techniques to deliver high-density molecular maps, which are the essential pre-requisite for many advanced breeding studies, such as GWAS and GS, at reduced cost. The concept of genome reduction is the central principle underlying the wider acceptance of GBS, which is highly applicable in crops with large complex genomes, including wheat and barley. GBS works well with crops without any prior reference sequence information, but the availability of a high-quality reference sequence would increase the accuracy manifolds during sequence data curation and allow access to a higher proportion of sequence reads. In the future, standardization of multiple combinations of REs to construct libraries will provide opportunities to increase sequencing coverage. With regard to data analysis, GBS data generated from the existing protocols can be used in the future with the evolution of the computing power of GBS pipelines. The target enrichment based GBS methods have also gained traction; however, the technical complexity associated with these techniques could act as a hurdle in their wider adoption. Moreover, in barley, robust QTL, GWAS and GS studies have been conducted using the data generated from various platforms including GBS; nevertheless, meta-analysis presenting results by combining data from these individual studies are still very limited and need to be extended to encompass a broader scope in coming years. In the wake of climate change, it is now imperative to overcome the bottlenecks in barley breeding by exploiting every advance made in the post-NGS era.

Author Contributions

NR, NQ, and MP contributed to the conception and design of the manuscript. NR wrote the manuscript drafts. NQ and MP revised the manuscript and compiled a final draft. All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.

Funding

This work was supported by the School of Agriculture and Food at the University of Melbourne to MP.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We are grateful to Melissa Bain and Abdulqader Jighly for their critical reading and valuable comments on this manuscript. We thank the Research Training Program (RTP) International PhD scholarship for providing financial support to NR. All the figures were created with BioRender.com.

Footnotes

1. ^https://www.fao.org/wiews/data/domains/monitoring-framework/en/

2. ^www.genesys-pgr.org

3. ^https://seedvault.nordgen.org/Search

4. ^https://www.nordgen.org/bgs/

5. ^http://plants.ensembl.org/Hordeum_vulgare

6. ^https://ics.hutton.ac.uk/barleyGenes/

7. ^https://ics.hutton.ac.uk/eorna/index.html

References

Abed, A., and Belzile, F. (2019). Comparing single-SNP, multi-SNP, and haplotype-based approaches in association studies for major traits in barley. Plant Genome 12:190036. doi: 10.3835/plantgenome2019.05.0036

Genotyping by Sequencing Advancements in Barley

Introduction

Barley Genomic Resources

Germplasm Collections

Genomic Resources

Genotyping by Sequencing

Genotyping by Sequencing Based on REs

One Enzyme GBS

Two Enzyme GBS

Tunable GBS

Nano-GBS

Genotyping by Sequencing Based on Sequence Enrichment

SNP Variant Calling

Applications of GBS in Barley

Genetic Mapping and QTL Detection

Genome-Wide Association Studies

Genomic Selection

Diversity Studies

Conclusion

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

Acknowledgments

Footnotes

References

94% of researchers rate our articles as excellent or good