- 1Key Laboratory of Tobacco Improvement and Biotechnology, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, China
- 2Zhengzhou Tobacco Research Institute, China Tobacco Gene Research Centre, Zhengzhou, China
- 3School of Agriculture, Yunnan University, Kunming, China
Agronomic traits such as plant height (PH), leaf number (LN), leaf length (LL), and leaf width (LW), which are closely related to yield and quality, are important in tobacco (Nicotiana tabacum L.). To identify quantitative trait loci (QTLs) associated with agronomic traits in tobacco, 209 recombinant inbred lines (RILs) and 537 multiparent advanced generation intercross (MAGIC) lines were developed. The biparental RIL and MAGIC lines were genotyped using a 430 K single-nucleotide polymorphism (SNP) chip assay, and their agronomic traits were repeatedly evaluated under different conditions. A total of 43 QTLs associated with agronomic traits were identified through a combination of linkage mapping (LM) and association mapping (AM) methods. Among these 43 QTLs, three major QTLs, namely qPH13-3, qPH17-1, and qLW20-1, were repeatedly identified by the use of various genetically diverse populations across different environments. The candidate genes for these major QTLs were subsequently predicted. Validation and utilization of the major QTL qLW20-1 for the improvement of LW in tobacco were investigated. These results could be applied to molecular marker-assisted selection (MAS) for breeding important agronomic traits in tobacco.
Introduction
Tobacco (Nicotiana tabacum L.) is an economically important species, and it is widely planted worldwide (Xiang et al., 2016). Tobacco leaves are made into various kinds of tobacco products for human consumption. Leaf-related traits such as plant height (PH), leaf number (LN), leaf length (LL), and leaf width (LW) determine not only tobacco yield but also the quality and marketability of tobacco products (White et al., 1979; Xu et al., 2000). Improvements to these important agronomic traits are the primary targets of tobacco breeders.
These important agronomic traits are considered typical quantitative traits regulated by multiple genes, and they are affected by numerous environmental factors (Xu et al., 2000; Xiao et al., 2006). To improve these traits, the use of quantitative trait locus (QTL) mapping together with molecular marker-assisted selection (MAS) is a good strategy (Mohan et al., 1997; Wang and Guo, 2010). The development of molecular markers that are tightly linked to these important traits greatly increase the efficiency of breeding ideal cultivars. For a long time, QTL mapping and the development of markers in tobacco were mainly performed on the basis of simple sequence repeat (SSR) markers (Gholizadeh et al., 2012; Tong et al., 2016). Several QTL mapping studies of tobacco have been reported, including studies on resistance to diseases (e.g., black shank, brown spot, and powdery mildew) (Stavely et al., 1984; Shah et al., 2018; Sun et al., 2018; Zhang et al., 2018) as well as to the yield and quality of tobacco (e.g., agronomic traits and chemical components) (Julio et al., 2006; Vijay et al., 2010; Tan et al., 2012; Cheng et al., 2015). However, to date, only a few studies involving QTL mapping of important agronomic traits of tobacco have been reported. Using SSR markers based on a recombinant inbred line (RIL) population, Tong et al. (2012) performed QTL mapping of seven agronomic traits. A total of three dynamic QTLs related to the number of leaves and leaf area were identified during the three developmental stages by the F2 population (Song et al., 2019).
Traditional genetically different populations for QTL mapping are constructed via two parents exhibiting different target traits and include F2, backcross (BC), doubled haploid (DH), and RIL populations. However, these populations based on biparental populations have several limitations. First, only two alleles between two parents can be compared, while the diversity of target trait alleles with the germplasm is ignored. Second, because recombination is based on F1 meiosis, and most progenies harbor large fragment recombination, the accuracy of QTL detection is limited (Kearsey and Farquhar, 1998).
Compared with biparental populations, multiparent advanced generation intercross (MAGIC) populations offer significant advantages in terms of analyzing multiple alleles and providing increased recombination and mapping resolution (Cavanagh et al., 2008; Holland, 2015). To date, MAGIC populations have been widely used in QTL mapping in crop species (Bandillo et al., 2013; Li et al., 2013; Meng et al., 2016; Bossa-Castro et al., 2018; Ogawa et al., 2018). The first application of the MAGIC population on plants was reported in 2009 for Arabidopsis thaliana when a QTL associated with flowering time was detected using a 19-way MAGIC population (Kover et al., 2009). In rice, different types of MAGIC populations were constructed to assess QTLs for important traits such as biotic/abiotic stress, yield, and quality traits (Bandillo et al., 2013; Meng et al., 2016). In Solanaceae crop species, two eight-way MAGIC populations were constructed, and QTLs related to agronomic traits and resistance to biotic stress were identified (Pascual et al., 2014; Campanelli et al., 2019). However, the construction and utilization of MAGIC populations in tobacco have not been reported.
In this study, the biparental population and the first eight-way MAGIC population of tobacco were produced. The QTLs associated with PH, LN, LL, and LW were identified by the use of the biparental RIL population and the eight-way MAGIC population across different environments. Using a BC population and 211 tobacco accessions, we further verified the genomic location of the major QTL. These results add to the knowledge concerning the genetic basis of different agronomic traits and offer an opportunity to improve target traits via MAS in tobacco.
Materials and Methods
Plant Material
The cigar line Beinhart1000-1 (hereafter referred to as BH) from the USA and the flue-cured tobacco variety Xiaohuangjin1025 (hereafter referred to as XHJ) from China were used as parental lines for a biparental population. Two hundred and nine F8 RILs were developed by the single-seed descent method. Eight different types of tobacco accessions were used to construct an eight-way MAGIC population. BH and Florida301 are cigar-type tobacco varieties. Vam is a burley type tobacco. Basma and Samsun are oriental-type tobacco varieties. Xiaohuaqing (hereafter referred to as XHQ) and Tangpeng (hereafter referred to as TP) are sun-cured tobacco varieties from China. Honghuadajinyuan (hereafter referred to as HD) is a flue-cured tobacco variety. The eight parents were crossed pairwise to produce four two-way hybrids, and these four two-way hybrids were intercrossed in pairs to obtain six four-way crosses. One hundred and thirty-two eight-way crosses were developed by intercrossing between the six 4-way crosses. The progeny derived from the eight-way crosses was mated at random for two generations. In each generation, a minimum of 200 crosses were made between 400 different individuals. Furthermore, three individual progenies per eight-way cross were maintained. Finally, more than eight hundred eight-way MAGIC homozygous lines were generated by the single-seed descent method.
Phenotyping
The two parents and 209 biparental RILs were planted in three different environments in accordance with a randomized complete block design with two replications. A single trial was carried out at the Xichang Experimental Station, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Xichang (27.8°N, 101.5°E, 1,500.0m elevation), Sichuan, China in 2017; the other trials were conducted at the Zhucheng Experimental Station, Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Zhucheng (36.4°N, 119.1°E, 19.3m elevation), Shandong, China in 2017 and 2018. The eight parents and 537 MAGIC lines were planted in Zhucheng (36.4°N, 119.1°E, 19.3m elevation), Shandong, China, in 2019 and Longshan (28.5°N, 109.3°E, 600.2m elevation), Hunan, China in 2019 and 2020 in accordance with a randomized complete block design with two replications. In all the trials, one replication of each line consisted of two 10-plants rows with a row length of 10m, a row space of 1.2m, and a plant distance of 0.5 m.
At the flowering stage, five plants in the middle of each plot were randomly sampled for phenotypic evaluation, and the average value of five plants per plot was used for analysis. Four important agronomic traits were measured, including PH (calculated as the height of the stem from the soil surface to stem apex), LN, LL, and LW. Pearson correlation coefficients between the traits were calculated using R/4.05 software. The analysis of variance (ANOVA) functionality in QTL IciMapping version 4.1 (Meng et al., 2015) was used for ANOVA on the phenotypic traits and estimation of heritability.
Genotyping and Construction of a High-Density Genetic Linkage map
At the Zhengzhou Tobacco Research institute, a 430 K tobacco single-nucleotide polymorphism (SNP) array composed of 432,362 markers was used to genotype the different materials (which included nine parents, 209 RIL, and 537 MAGIC lines) at the Zhengzhou Tobacco Research Institute. The SNP genotyping procedure was performed as described by Zhang et al. (2017). The genotypes for one SSR marker (PT30174) were identified using the PCR method according to the results reported by Bindler et al. (2011). To construct a high-density SNP genetic linkage map, SNPs exhibiting polymorphism between BH and XHJ were selected. SNP markers showing significant segregation distortion (X2 test, P < 0.01, df = 1), distortion with poor quality, or distortion with more than 10% of missing data were excluded from the map construction. IciMapping 4.1 software was used to group and order the markers. The nnTwoOpt algorithm was used to determine the preliminary order and positions of linked markers. The Rippling algorithm (with a window size of five markers) was used to adjust the linkage map. The Kosambi centimorgan function was ultimately used to calculate map distances.
QTL Analysis
For linkage mapping (LM), inclusive composite interval mapping was implemented using IciMapping 4.1 software for QTL mapping. An empirical logarithm of odds (LOD) threshold of 3.40 was used for declaring significant QTLs based on 1,000 runs and a type I error of 5% (Meng et al., 2015). The genetic maps and QTLs were drawn using MapChart 2.0 software (Voorrips, 2002).
For association mapping (AM), a quality check was carried out according to several standards: First, the genetic positions of the markers were clearly mapped onto linkage groups (LG) via LM. Second, all the markers with a minor allele frequency of <5% were removed. Finally, SNP markers with more than 10% missing data were excluded. A total of 3,282 SNP markers were selected for the genome-wide association study (GWAS). The genotypic data from those markers and the phenotypic data across the three environments were analyzed via a mixed linear model (MLM) analysis by TASSEL 5.0 software (Bradbury et al., 2007). According to the adjusted Bonferroni method, a p-value of 1.52 × 10−5 (P = 0.05/Ne, Ne = 3,282) was determined to be the threshold for declaring a significant association (Li C. et al., 2019).
Expression Levels of Candidate Genes via RNA-seq Analysis
To analyze the expression levels of candidate genes, plants of tobacco varieties, BH and XHJ, were grown in 12 cm-diameter plastic pots in an artificial climate chamber with a 12/12 h light/dark photoperiod at 25°C with 70–85% relative humidity. There were three repetitions, each with five plants in one replication. Leaves of the same five plants from each replication were collected for RNA-seq analysis during the flowering stage. Total RNA was isolated from the leaves using the TRIzol reagent (Life Technologies, USA) according to the manufacturer's instructions. Transcriptome sequencing was carried out by Novogene (Beijing, China) on an Illumina NovaSeq platform using the 150 bp paired-end read mode. The expression quantity was calculated as fragments per kilobase of exon per million fragments.
Resequencing of the Different Parental Lines
To compare the difference in coding sequence regions of candidate genes, we randomly broke the genomic DNA of 9 parental lines (XHJ, BH, Florida301, Vam, Basma, Samsun, TP, XHQ, and HD) at 350 bp and used them to construct library using the TruSeq Library Construction Kit. The constructed library was sequenced on the Illumina platform by Novogene (Beijing, China), and 150 bp paired-end reads were generated. The raw sequencing data were filtered using Trimmomatic. After data filtering, we used the clean data for subsequent analyses. A total of 20-fold mean genome coverage for each sample was achieved. Furthermore, SNPs in the encoding sequence regions of the candidate genes between different parental lines were analyzed based on the resequencing results.
Validation and Utilization of the Major QTL
A total of 211 tobacco accessions were collected and genotyped based on the marker bin20-185. The phenotypic values were measured at the Zhucheng (Shandong Province) and Xichang (Sichuan Province) sites between 2014 and 2015. The same designs as previously mentioned were used.
One hundred thirty-three BC4F3 introgression lines (ILs) derived from the flue-cured tobacco K326 (recurrent parent, GG genotype) and the oriental tobacco Samsun (donor parent, AA genotype) were generated. The phenotype of each BC4F3 line was measured at the Xishuangbanna Experimental Station, Xishuangbanna (22.1°N, 101.3°E, 1000.8m elevation), Yunnan, China. The same designs as those discussed previously were used. The BC4F3 ILs with the major QTL qLW20-1 were selected based on genotypes at the locus bin20-185 via PCR.
Results
Phenotypic Analysis of the Parents and Populations
The phenotypic frequency distributions of four agronomic traits across different environments are shown in Figure 1. In the biparental RIL population, the two parental lines, Beinhart1000-1 (BH) and Xiaohuangjin1025 (XHJ), exhibited significantly different agronomic trait phenotypes in different environments. While the difference between the two parents varied according to the environment, XHJ had consistently greater LL, while BH had greater PH, LW, and LN in all the environments. Transgressive segregations of the four traits were observed in the RIL population, and their distributions were approximately normal. For the MAGIC population, the eight parental lines—Vam, XHQ, BH, Basma, Samsun, Florida301, TP, and HD—exhibited significantly different agronomic trait phenotypes in different environments. Basma and TP had lower PH values than the other six parents in the different environments. Moreover, compared with the other seven parents, Vam presented higher LNs in the different environments. The different parents showed different variations in LL and LW in the different environments. The MAGIC population showed large phenotypic variances across different environments for the four agronomic traits, with ranges of 55.83–235.00 cm for PH, 6.17–36.67 for LN, 27.08–67.50 cm for LL, and 8.17–39.50 cm for LW.
In addition, significant correlations were found between PH and the other traits in the RIL population (Supplementary Figure 1). However, the correlation between LL and LW was not significant, suggesting that the leaf shape (indicated by the LL/LW ratio) varied greatly within this population. As shown in Table 1, the variations in the four traits were all determined by genotype and environment. According to the plot, all traits had a moderate heritability of ~50%. These results indicated that the four agronomic traits were all quantitatively inherited in tobacco.
Table 1. Variance components and heritability for the four traits in the tobacco biparental population and MAGIC population in different environments.
Construction of a High-Density Linkage map
Of the 56,693 polymorphic SNP markers, 17,788 SNP markers with <10% missing data were used to construct a linkage map. A total of 3,934 SNP markers were ultimately mapped onto the 24 high-density genetic LGs of tobacco (Supplementary Figure 2 and Supplementary Table 1). The total length of the LGs was 3920.43 cm, with a mean distance of 1.01 cm between adjacent markers. The LG length ranged from 100.09 cm to 255.58 cm. The most saturated LG was LG14, which had an intermarker distance of 0.81 cm, whereas LG18 had the largest distance of 1.34 cm between adjacent markers. The largest LG, LG01, harbored 252 markers, with a mean intermarker distance of 1.01 cm. The smallest LG, LG23, contained 91 markers, with a mean distance of 1.32 cm between adjacent markers (Supplementary Table 2).
QTL Analysis
LM With a Biparental RIL Population
A total of 35 QTLs associated with the four agronomic traits were mapped onto 16 LGs in three different environments (Table 2). Among them, nine, nine, nine, and eight QTLs were found to be associated with PH, LL, LW, and LN, respectively. For PH, a total of nine QTLs (qPH2-1, qPH4-1, qPH8-1, qPH11-1, qPH13-1, qPH13-2, qPH17-1, qPH18-1, and qPH19-1) were identified under different conditions. They were distributed across LGs 2, 4, 8, 11, 13, 17, 18, and 19 and explained 3.49% to 47.04% of the total phenotypic variance. The BH alleles at all loci except qPH4-1, qPH13-1, and qPH18-1 increased the trait values. One major QTL, qPH17-1, was repeatedly detected under different conditions and accounted for 7.06%−47.04% of the phenotypic variance. Nine QTLs (qLL2-1, qLL4-1, qLL8-1, qLL12-1, qLL16-1, qLL17-1, qLL18-1, qLL20-1, and qLL22-1) were detected for LL under different conditions, which were distributed across LGs 2, 4, 8, 12, 16, 17, 18, 20, and 22 and explained 5.39% to 19.81% of the total phenotypic variance. The BH alleles at all loci except qLL2-1, qLL4-1, qLL18-1, and qLL20-1 increased the trait values. Another major QTL, qLL17-1, was repeatedly detected under different conditions and accounted for 5.67%−19.81% of the phenotypic variance. For LW, a total of nine QTLs (qLW2-1, qLW4-1, qLW9-1, qLW11-1, qLW13-1, qLW17-1, qLW20-1, qLW21-2, and qLW23-1) were identified under different conditions. They were distributed across LGs 2, 4, 9, 11, 13, 17, 20, 21, and 23 and explained 2.78% to 31.47% of the total phenotypic variance. The BH alleles at all loci, except qLW11-1, qLW13-1, qLW17-1, and qLW20-1, reduced the trait values. Another major QTL, qLW20-1, was repeatedly detected under different conditions and accounted for 19.24–31.47% of the total phenotypic variance. For LN, a total of eight QTLs (qLN1-1, qLN4-1, qLN12-1, qLN12-2, qLN13-1, qLN17-1, qLN18-1, and qLN22-1) were identified under different conditions. They were distributed across LGs 1, 4, 12, 13, 17, 18, and 22 and explained 4.79% to 20.16% of the total phenotypic variance. The BH alleles at all loci, except qLN4-1 and qLN22-1, increased the trait values.
Table 2. Summary of QTLs affecting the four important agronomic traits in tobacco across different environments using the LM method.
As shown in Figure 2, many regions were found to harbor two or more closely linked QTLs for different traits. For instance, qPH17-1, qLL17-1, qLW17-1, and qLN17-1 were mapped to the adjacent region based on LG17, and these QTLs were tightly linked to markers bin17-48 and bin17-70. These results were in agreement with those of a correlation analysis of phenotypic traits in the RIL population. Two major QTLs (qPH17-1 and qLW20-1) were repeatedly identified using the LM method in the three environments. For PH, one major QTL, named qPH17-1, was mapped to the same chromosomal region from bin17-66 to bin17-67. This QTL explained 23.13% to 47.04% of the phenotypic variation with positive additive effects under different conditions. For LW, the other major QTL, named qLW20-1, was mapped to a similar chromosomal region from bin20-183 to bin20-188. This QTL explained 19.24% to 31.47% of the phenotypic variation with positive additive effects under different conditions.
Figure 2. The positions of QTL affect the four important agronomic traits in tobacco by the linkage mapping method. The black box represents QTL detected at Sichuan in 2017; the red box represents QTL detected at Shandong in 2017; the green box represents QTL detected at Shandong in 2018; and the yellow box represents QTL detected for BLUE.
AM via a MAGIC Population
To investigate the genetic relationship among MAGIC lines, we performed a genetic analysis based on 3,282 SNP markers (Supplementary Table 3). The results suggested that the eight-way populations showed no clear population structure (Supplementary Figure 3). Then, using the MAGIC population, we carried out GWAS of the four agronomic traits, namely, PH, LW, LL, and LN. A total of nine QTLs for PH, LW, and LL were identified by the MLM method (Figure 3, Table 3, and Supplementary Figure 4). Among these QTLs, three association signals were identified for PH, especially qPH13-3, which was repeatedly detected for PH in all environments. Another two QTLs, qPH17-2 and qPH24-1, were identified only in Shandong in 2019 and in Hunan in 2020, respectively. For LW, four QTLs, namely, qLW20-1, qLW14-1, qLW13-2, and qLW9-2, were repeatedly identified in at least two environments (Shandong in 2019 and 2020), and qLW20-1 was identified in all the environments. Additionally, two QTLs (qLL13-1 and qLL14-1) for LL were mapped onto LGs 13 and 14, respectively.
Figure 3. Association mapping for the four important agronomic traits in tobacco using the MAGIC population. The x- and y-axes represent the genetic position of the 12 linkage groups, respectively, as well as the negative log10 P-value. The horizontal solid line indicates the genome-wide significance threshold P = 1.52 × 10−5.
Table 3. Summary of QTLs affecting the four important agronomic traits in tobacco in different environments using the AM method.
Candidate Gene Prediction of Major QTLs
The candidate genes for the three major QTLs (qPH13-3, qPH17-1, and qLW20-1) were predicted based on the physical positions of the tightly linked markers. The qPH13-3 was tightly linked to the marker bin13-161 by the AM method; this QTL was determined to be located on Nt17:170,496,603 bp based on the K326 reference genome reported by Edwards et al. (2017). Comparative mapping of the K326 reference genome predicted the existence of a total of 26 genes from 169.99Mb to 170.99Mb on Nt17. The polymorphisms of the 26 candidate genes were analyzed between the different parents using the resequencing method. As shown in Supplementary Table S4, non-synonymous mutations existed in six genes among the 26 candidate genes. In sum, one gene (Nitab4.5_0002347g0190.1) encoding a protein with high homology to gibberellin 2-beta-dioxygenase 8 (GA2OX8) in Arabidopsis thaliana (Li Y. et al., 2019) was considered as the candidate gene for qPH13-3. Another QTL involved in PH, qPH17-1, was mapped by the AM method to LG 17 between bin17-66 and bin17-67. Further analysis indicated that the markers bin17-66 and bin17-67 were located at Nt17: 147,923,432 bp and Nt17: 150,216,340 bp. There were 68 candidate genes in this target region. For LW, the major QTL qLW20-1 was tightly linked to the marker bin20-185 (Nt23: 997,702 bp). There were 13 candidate genes from 0.50Mb to 1.50Mb on chromosome 23. The expression level and sequence polymorphisms of the 13 candidate genes were analyzed between the different parents. There was no difference between parental lines, XHJ and BH, for the expression of the 13 candidate genes, and non-synonymous mutations existed in four of the 13 candidate genes. Among these genes, one gene (Nitab4.5_0000798g0140.1) encoding a protein with high homology to auxin response factor 9 (ARF9) in Arabidopsis thaliana (Remington et al., 2004) was considered the candidate gene for qLW20-1 (Supplementary Table 4).
Validation and Utilization of the Major QTL qLW20-1 for Improvement of LW in Tobacco
A major QTL, qLW20-1, associated with the LW trait in tobacco, was repeatedly detected in all environments and was tightly linked to the SNP marker bin20-185 (Tables 2, 3). The 211 tobacco accessions were genotyped at the qLW20-1 locus and were divided into two groups (AA and GG) based on the genotypes of the flanking marker bin20-185 (Supplementary Table 5). The accessions in the AA group presented the same genotypes with XHJ at the qLW20-1, while the accessions in the GG group presented the same genotypes with BH at the qLW20-1. The values for LW in three different environments were obtained for all the genotyped accessions. As shown in Figures 4A,B, there was a significant difference (P < 0.0001) in the LW trait between the GG group (comprising 114 tobacco accessions) and the AA group (comprising 97 tobacco accessions). The average values of LW were significantly higher in the GG group than in the AA group in different environments. These results indicated a strong correlation between the genotype of the marker bin20-185 and the LW trait. A new BC4F3 population derived from the flue-cured tobacco K326 (recurrent parent and AA genotype) and the oriental tobacco Samsun (donor parent and GG genotype) was constructed. As shown in Figures 4C,D, tobacco accessions with GG genotypes at the bin20-185 site exhibited significantly greater LW than the recurrent parent K326.
Figure 4. Improvement of LW in tobacco by MAS based on the major QTL qLW20-1. (A) A comparison of upper LW between different genotypes in the tobacco collection in different environments. (B) A comparison of middle LW between different genotypes in the tobacco collection in different environments. (C) The performance of upper and middle leaves between the donor parent K326 and the line anchored to the major QTL controlling tobacco, LW. (D) A comparison of upper and middle LWs between the donor parent K326 and the line that anchored the major QTL controlling tobacco LW. Statistical analysis of LW between different lines based on different genotypes (AA and GG) at the locus bin20-185. The error bars indicate SD. Significant tests are carried out using student's t-tests.
Discussion
Mapping for Complex Traits in Tobacco
In comparison to other important crop species, only a few studies of QTL mapping for complex traits in tobacco have been reported due to the following reasons. First, as a member of the Solanaceae family, tobacco is an allopolyploid species (2n = 48) with a large genome (4.5 Gb) (Sierro et al., 2014). During its evolution, tobacco experienced a genetic bottleneck, resulting in the very low diversity of tobacco accessions present today. Second, suitable genetically different populations for complex trait mapping in tobacco are lacking. To date, only a few studies involving biparental permanent populations, such as RIL and DH populations, have been reported for QTL mapping. Third, many QTL studies are dependent on traditional molecular markers, such as SSRs, amplified fragment length polymorphisms (AFLPs), and restriction fragment length polymorphisms (RFLPs), which have a lower density than SNPs, resulting in insufficient resolution during genetic mapping (Julio et al., 2006; Tan et al., 2012; Cheng et al., 2015; Sun et al., 2018; Zhang et al., 2018). In this study, we constructed a high-density SNP linkage map of tobacco based on a BH/XHJ RIL for genome-wide QTL mapping. Then, major QTLs related to four important agronomic traits were accurately identified by the use of a biparental RIL and a MAGIC population. Finally, the LW of the main tobacco variety K326 was successfully improved via MAS. Our results indicate that combining a high-density SNP linkage map with a different genetic population can be a highly successful strategy for the improvement of complex traits in tobacco.
Comparison of QTL Mapping Results Between the MAGIC Population and the Biparental Population in Tobacco
MAGIC populations are considered to be more effective and powerful for QTL mapping, gene discovery, and breeding of many crop species (Cavanagh et al., 2008; Bandillo et al., 2013; Meng et al., 2016; Campanelli et al., 2019). To date, QTL mapping involving MAGIC populations has not been reported in tobacco. In this study, QTL mapping of four agronomic traits in tobacco was performed by the use of a biparental population and a MAGIC population. Compared to the biparental population, the MAGIC population showed wider phenotypic variances (Figure 1). This suggested that the MAGIC population provides breeders with a greater opportunity to select elite lines for breeding. However, the number of QTLs identified in the biparental population was greater than that in the MAGIC population. There are several plausible reasons for this. First, in our study, the P-value (Padjusted = 0.05/number of markers) of the GWAS was adjusted by the Bonferroni correction method, which is rather strict for multiple comparisons and makes it easier to ignore meaningful sites. To reduce the false-negative discovery rate, we adopted a more relaxed threshold, especially for complex traits (Li C. et al., 2019). Second, the use of more parents for developing a population usually implies that the frequencies of different alleles are lower and uneven across alleles, which reduces the power for detecting QTLs with weak effects (Meng et al., 2016). Third, the SNP markers for AM were derived from these polymorphic markers and mapped to LGs using a biparental population instead of the mapping of all eligible markers in the MAGIC population. The MAGIC population has lower detection power than the biparental population for minor-effect QTLs given similar numbers of markers. However, the MAGIC population is very effective for major QTL detection in tobacco. One major QTL, qLW20-1, identified via the biparental population, was also repeatedly detected via the MAGIC population, and one major QTL, qPH13-3, was identified specifically in the different environments via the MAGIC population. Therefore, it is necessary for the MAGIC population to improve its mapping power and resolution by expanding marker coverage in the future.
Major QTLs for PH and LW in Tobacco
According to the results of LM and AM, a total of 43 QTLs were detected for four important agronomic traits in the different populations in different environments. Among these QTLs, two or more QTLs for different traits were closely linked and located in similar chromosomal regions. These results implied that some agronomic traits were closely related to each other in tobacco. Three major QTLs, qPH13-3, qPH17-1, and qLW20-1, were repeatedly detected under different conditions. Using 614 SSR markers, Tong et al. (2012) reported four QTLs associated with PH, internode length, and the width of the largest waste leaf, explaining 15%−20% of the phenotypic variance. According to AM, qPH13-3 in this study colocalized with qPH17-1 and qLWL17 reported by Tong et al. (2012) and was tightly linked to the marker bin13-161. This marker (Nt17:170,496,603 bp) was located in the first intron region of the gene Nitab4.5_0002347g0190.1. This gene encodes a protein with high homology to gibberellin 2-beta-dioxygenase 8 (GA2OX8) in Arabidopsis thaliana, and this protein acts specifically on C-20 gibberellins (Li C. et al., 2019). Therefore, Nitab4.5_0002347g0190.1 is considered a candidate gene for qPH13-3 and may play an important role in determining tobacco height. For LW, qLW20-1 may be a novel major QTL controlling leaf development in tobacco because it was identified in similar genomic regions in the different environments through combinations of LM and AM methods. In this study, qLW20-1 was indicated to be tightly linked to marker bin20-185. In the target region, one gene (Nitab4.5_0000798g0140.1) encoded a protein with high homology to auxin response factor 9 (ARF9) in Arabidopsis thaliana (Remington et al., 2004) and was considered the candidate gene for qLW20-1. In the Arabidopsis thaliana, the orthologous gene (AT4G23980) encodes a transcription factor that controls the expression of the large set of auxin-dependent genes that mediate hormone-dependent growth and development. Therefore, it is possible that Nitab4.5_0000798g0140.1 controls the expression of auxin-related genes, which govern LW in tobacco. Identification of candidate genes and functional research of these stably expressed QTLs are being carried out.
Potential of qLW20-1 for Breeding in Tobacco
The discovery of new QTLs can provide more choices for breeding new varieties using MAS (Cheng et al., 2013; Wang et al., 2014; Yin et al., 2015; Zhang et al., 2017). In the current study, we found that 114 varieties with the same allele (GG) as BH at the major QTL qLW20-1 showed significant improvements in the LW trait. The results indicated that the GG genotype at the major gene locus played an important role in controlling the development of LW, and the responsible alleles have been widely incorporated into modern cultivated tobacco varieties. To improve the LW of K326 (a major flue-cured tobacco variety grown in China and the USA), we developed a set of introgression lines in the K326 genetic background from a cross between K326 and Samsun. Based on the information from the flanking SNP marker bin20-185, we selected 71 introgression lines (GG genotype). The average values of the upper LW and middle LW of the GG genotype introgression lines were significantly higher than those of the AA genotype. Therefore, the identification of the tightly linked molecular marker bin20-185 provides an opportunity for utilizing this genetic region for tobacco breeding more extensively and effectively.
Conclusion
Overall, in this study, using the biparent and MAGIC populations, we mapped QTLs for PH, LW, LL, and LN in tobacco. A total of 43 QTLs, including three major QTLs related to different agronomic traits, were detected by the combination of LM and AM. These QTLs and markers tightly linked to the traits of interest in this study may be useful for the improvement of agronomic traits in tobacco via MAS.
Data Availability Statement
The data presented in the study are deposited in the Genome Variation Map in National Genomics Data Center, China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences, accession number GVM000327 RIL and GVM000326 MAGIC.
Author Contributions
LC and AY designed the experiment. YL, GY, YS, ZJ, DL, CJ, XP, JY, ZL, JZ, YP, LW, ZX, and QF performed the experiment. HS, LC, and MR analyzed experimental data. LC, HS, and YL wrote the manuscript. All authors read and approved the manuscript.
Funding
This work was supported by grants from the Agricultural Science and Technology Innovation Program (ASTIP-TRIC01).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
The authors are thankful to the National Infrastructure for Crop Germplasm Resources (Tobacco, Qingdao) for the supply of germplasm resources.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.878267/full#supplementary-material
Supplementary Figure 1. Pairwise correlation values among four agronomic traits in the RIL population of tobacco.
Supplementary Figure 2. The high-density SNP linkage map of tobacco based on a BH/XHJ recombinant inbred line population.
Supplementary Figure 3. Phylogenetic trees of the MAGIC lines.
Supplementary Figure 4. Q-Q plots for four agronomic traits across different environments.
Supplementary Table 1. The information from SNP markers was mapped on 24 linkage groups of tobacco.
Supplementary Table 2. The statistics of the high-density single nucleotide polymorphism-based genetic maps in the BH/XHJ population from the 430K SNP genotyping assay.
Supplementary Table 3. The information of SNP markers for association mapping using MAGIC population.
Supplementary Table 4. The list of candidate genes for the three major QTLs is qPH13-3, qPH17-1, and qLW20-1.
Supplementary Table 5. The list of tobacco accessions used for validation and utilization of the major QTL qLW20-1.
References
Bandillo, N., Raghavan, C., Muyco, P. A., Sevilla, M. A. L., Lobina, I. T., Dilla-Ermita, C. J., et al. (2013). Multiparent advanced generation inter-cross (MAGIC) populations in rice: progress and potential for genetics research and breeding. Rice 6, 11. doi: 10.1186/1939-8433-6-11
Bindler, G., Plieske, J., Bakaher, N., Gunduz, I., Ivanov, N., Vand, H. R., et al. (2011). A high density genetic map of tobacco (Nicotiana tabacum L.) obtained from large scale microsatellite marker development. Theor. Appl. Genet. 123, 219–230. doi: 10.1007/s00122-011-1578-8
Bossa-Castro, A. M., Tekete, C., Raghavan, C., Delorean, E. E., Dereeper, A., Dagno, K., et al. (2018). Allelic variation for broad-spectrum resistance and susceptibility to bacterial pathogens identified in a rice MAGIC population. Plant Biotechnol. J. 16, 1559–1568. doi: 10.1111/pbi.12895
Bradbury, P. J., Zhang, Z., Kroon, D. E., Casstevens, T. M., Ramdoss, Y., and Buckler, E. S. (2007). TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635. doi: 10.1093/bioinformatics/btm308
Campanelli, G., Sestili, S., Acciarri, N., Montemurro, F., Palma, D., Leteo, F., et al. (2019). Multi-parental advances generation inter-cross population, to develop organic tomato genotypes by participatory plant breed. Agron. J. 9, 119. doi: 10.3390/agronomy9030119
Cavanagh, C., Morell, M., Mackay, I., and Powell, W. (2008). From mutations to MAGIC: resources for gene discovery, validation and delivery in crop plants. Curr. Opin. Plant Biol. 11, 215–221. doi: 10.1016/j.pbi.2008.01.002
Cheng, H., Luo, H. Y., Du, W. J., Wang, S. K., Chang, S., Dong, S. F., et al. (2013). Effects of different ecological factors on aroma component contents in flue-cured tobacco K326 in yunnan. Chin. Tobacco Sci. 34, 70–73. doi: 10.3969/j.issn.1007-5119.2013.03.14
Cheng, L. R., Yang, A. G., Jiang, C. H., Ren, M., Zhang, Y., Feng, Q. F., et al. (2015). Quantitative trait loci mapping for plant height in tobacco using linkage and association mapping methods. Crop Sci. 55, 641–647. doi: 10.2135/cropsci2014.05.0404
Edwards, K. D., Fernandez-Pozo, N., Drake-Stowe, K., Humphry, M., Evans, A. D., Bombarely, A., et al. (2017). A reference genome for Nicotiana tabacum enables map-based cloning of homeologous loci implicated in nitrogen utilization efficiency. BMC Genom. 18, 448. doi: 10.1186/s12864-017-3791-6
Gholizadeh, S., Darvishzadeh, R., Mandoulakani, B. A., Bernousi, I., Alavi, S. R., and Masouleh, A. K. (2012). Molecular characterization and similarity relationships among flue-cured tobacco (Nicotiana tabacum L.) genotypes using simple sequence repeat markers. Not. Bot. Horti. Agrobo. 40, 247–253. doi: 10.15835/nbha4027169
Holland, J. B. (2015). MAGIC maize: a new resource for plant genetics. Genome Biol. 16, 163. doi: 10.1186/s13059-015-0713-2
Julio, E., Denoyes, R. B., Verrier, J. L., and Borne, F. D. (2006). Detection of QTLs linked to leaf and lmoke properties in Nicotiana tabacum based on a study of 114 recombinant inbred lines. Mol. Breed. 18, 69–91. doi: 10.1007/s11032-006-9019-0
Kearsey, M. J., and Farquhar, A. G. (1998). QTL analysis in plants; where are we now? Heredity 80, 137–142. doi: 10.1046/j.1365-2540.1998.00500.x
Kover, P. X., Valdar, W., Trakalo, J., Scarcelli, N., Ehrenreich, I. M., Purugganan, M. D., et al. (2009). A multiparent advanced generation inter-cross to fine-map quantitative traits in Arabidopsis thaliana. PLoS Genet. 5, e1000551. doi: 10.1371/journal.pgen.1000551
Li, C., Zheng, L. L., Wang, X. N., Hu, Z. B., Zheng, Y., Chen, Q. H., et al. (2019). Comprehensive expression analysis of Arabidopsis GA2-oxidase genes and their functional insights. Plant Sci. 285, 1–13. doi: 10.1016/j.plantsci.2019.04.023
Li, X. F., Liu, Z. X., Lu, D. B., Liu, Y. Z., Mao, X. X., Li, Z. X., et al. (2013). Development and evaluation of multi-genotype varieties of rice derived from MAGIC lines. Euphytica 192, 77–86. doi: 10.1007/s10681-013-0879-1
Li, Y., Cao, K., Zhu, G. R., Fang, W. C., Chen, C. W., Wang, X. W., et al. (2019). Genomic analyses of an extensive collection of wild and cultivated accessions provide new insights into peach breeding history. Genome Biol. 20, 36. doi: 10.1186/s13059-019-1648-9
Meng, L., Li, H. H., Zhang, L. Y., and Wang, J. K. (2015). QTL IciMapping: integrated software for genetic linkage map construction and quantitative trait locus mapping in biparental populations. Crop J. 3, 269–283. doi: 10.1016/j.cj.2015.01.001
Meng, L. J., Zhao, X. Q., Ponce, K., Ye, G. Y., and Leung, H. (2016). QTL mapping for agronomic traits using multiparent advanced generation inter-cross (MAGIC) populations derived from diverse elite indica rice lines. Field Crops Res. 189, 19–42. doi: 10.1016/j.fcr.2016.02.004
Mohan, M., Nair, S., Bhagwat, A., Krishna, T. G., Yano, M., Bhatia, C. R., et al. (1997). Genome mapping, molecular markers and marker-assisted selection in crop plants. Mol. Breed. 3, 87–103. doi: 10.1023/A:1009651919792
Ogawa, D., Yamamoto, E., Ohtani, T., Kanno, N., Tsunematsu, H., Nonoue, Y., et al. (2018). Haplotype-based allele mining in the Japan-MAGIC rice population. Sci. Rep. 8, 4379. doi: 10.1038/s41598-018-22657-3
Pascual, L., Desplat, N., Huang, B. E., Desgroux, A., Bruguier, L., Bouchet, J. P., et al. (2014). Potential of a tomato MAGIC population to decipher the genetic control of quantitative traits and detect causal variants in the resequencing era. Plant Biotechnol. J. 13, 565–577. doi: 10.1111/pbi.12282
Remington, D. L., Vision, T. J., Guilfoyle, T. J., and Reed, J. W. (2004). Contrasting modes of diversification in the Aux/IAA and ARF gene families. Plant Physiol. 135, 1738–1752. doi: 10.1104/pp.104.039669
Shah, L., Rehman, S., Ali, A., Yahya, M., Riaz, M. W., Si, H. Q., et al. (2018). Genes responsible for powdery mildew resistance and improvement in wheat using molecular marker-assisted selection. J. Plant Dis. Prot. 125, 145–158. doi: 10.1007/s41348-017-0132-6
Sierro, N., Battey, J. N. D., Ouadi, S., Bakaher, N., Bovet, L., Willig, A., et al. (2014). The tobacco genome sequence and its comparison with those of tomato and potato. Nat. Commun. 5, 3833. doi: 10.1038/ncomms4833
Song, J., Liu, G. X., Tong, Y., Wang, Y. Y., Li, Y., Zhang, X. W., et al. (2019). Dynamic QTL analysis of leaf number and leaf area in tobacco at different developmental stages. Mol. Plant Breed. 18, 6047–6052. doi: 10.13271/j.mpb.017.006047
Stavely, J. R., Chaplin, J. F., and Gwynn, G. R. (1984). Registration of bel921 brown spot resistant flue-cured tobacco germplasm. Crop Sci. 24, 830–831. doi: 10.2135/cropsci1984.0011183X002400040063x
Sun, M. M., Cheng, L. R., Jiang, C. H., Zhu, C. G., Ren, M., Zhang, Y. S., et al. (2018). Identification of a major QTL affecting resistance to brown spot in tobacco (Nicotiana tabacum L.) via linkage and association mapping methods. Euphytica 214, 195. doi: 10.1007/s10681-018-2244-x
Tan, X. L., Xu, X. H., and Wang, N. C. (2012). QTLs analysis of the easy curing potential in flue-cured tobacco. Mol. Plant Breed. 10, 201–206. doi: 10.3969/mpb.010.000201
Tong, Z. J., Jiao, F. C., Wu, X. F., Wang, F. Q., Chen, X. J., et al. (2012). Mapping of quantitative trait loci underlying six agronomic traits in flue-cured tobacco (Nicotiana tabacum L.). Acta Agronomica Sinica 38, 1407–1415. doi: 10.3724/SP.J.1006.2012.01407
Tong, Z. J., Xiao, B. G., Jiao, F. C., Fang, D. H., Zeng, J. M., Wu, X. F., et al. (2016). Large-scale development of SSR markers in tobacco and construction of a linkage map in flue-cured tobacco. Breed. Sci. 66, 381–390. doi: 10.1270/jsbbs.15129
Vijay, V., Danehower, D. A., Tyler, S., Moon, H. S., and Lewis, R. S. (2010). Analysis of a Nicotiana tabacum L. genomic region controlling two leaf surface chemistry traits. J. Agric. Food Chem. 58, 294–300. doi: 10.1021/jf903256h
Voorrips, R. E. (2002). MapChart: software for the graphical presentation of linkage maps and QTLs. J. Hered. 93, 77–78. doi: 10.1093/jhered/93.1.77
Wang, M., Gu, X. F., Miao, H., Liu, S. L., Wang, Y., Wehner, T. C., et al. (2014). Molecular mapping and candidate gene analysis for heavy netting gene (H) of mature fruit of cucumber (Cucumis sativus L.). Sci. Agricul. Sin. 47, 1550–1557. doi: 10.3864/j.issn.0578-1752.2014.08.011
Wang, Y. R., and Guo, S. D. (2010). Insect-resistance and high-yield transgenic tobacco obtained by molecular breeding technology. African J. Biotechnol. 9, 6626–6631. doi: 10.5897/AJB09.1605
White, F. H., Pandeya, R. S., and Dirks, V. A. (1979). Correlation studies among and between agronomic, chemical, physical and smoke characteristics in flue-cured tobacco (Nicotiana tabacum L.). Can. J. Plant Sci. 59, 111–120. doi: 10.4141/cjps79-016
Xiang, P. H., Guo, W., Shan, X. H., Huang, Y. Z., and Long, S. P. (2016). Effects of tobacco-rice continuous cropping years on soil physicochemical properties and tobacco yield and quality. J. Agr. Sci. Tech-Iran. 17, 2668–2671+2676. doi: 10.16175/j.cnki.1009-4229.2016.11.055
Xiao, B. G., Zhu, J., Lu, X. P., Bai, Y. F., and Li, Y. P. (2006). Genetic and correlation analysis for agronomic traits in flue-cured tobacco (Nicotiana tabacum L.). Hereditas 3, 317–323. doi: 10.16288/j.yczz.2006.03.013
Xu, M. H., Wang, M. Y., and Long, W. H. (2000). Analysis of genetic effects of major agronomic and quality characters in tobacco (Nicotiana tabaccum L.). Hereditas 6, 395–397. doi: 10.16288/j.yczz.2000.06.013
Yin, C. B., Li, H. H., Li, S. S., Xu, L. D., Zhao, Z. G., and Wang, J. K. (2015). Genetic dissection on rice grain shape by the two-dimensional image analysis in one japonica × indica population consisting of recombinant inbred lines. Theor. Appl. Genet. 128, 1969–1986. doi: 10.1007/s00122-015-2560-7
Zhang, J. F., Luo, Z. P., He, S. B., Jin, J. J., Li, Z. F., Xu, Y. L., et al. (2017). Genetic diversities of 24 tobacco cultivars analyzed by SNP. Tobacco Sci. Technol. 50, 1–8. doi: 10.16135/j.issn1002-0861
Keywords: agronomic traits, QTL, MAGIC population, RIL population, tobacco
Citation: Liu Y, Yuan G, Si H, Sun Y, Jiang Z, Liu D, Jiang C, Pan X, Yang J, Luo Z, Zhang J, Ren M, Pan Y, Sun K, Meng H, Wen L, Xiao Z, Feng Q, Yang A and Cheng L (2022) Identification of QTLs Associated With Agronomic Traits in Tobacco via a Biparental Population and an Eight-Way MAGIC Population. Front. Plant Sci. 13:878267. doi: 10.3389/fpls.2022.878267
Received: 17 February 2022; Accepted: 21 April 2022;
Published: 06 June 2022.
Edited by:
Shengjun Li, Qingdao Institute of Bioenergy and Bioprocess Technology (CAS), ChinaReviewed by:
Zhiming Zhang, Shandong Agricultural University, ChinaDonghai Mao, Institute of Subtropical Agriculture (CAS), China
Copyright © 2022 Liu, Yuan, Si, Sun, Jiang, Liu, Jiang, Pan, Yang, Luo, Zhang, Ren, Pan, Sun, Meng, Wen, Xiao, Feng, Yang and Cheng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lirui Cheng, Y2hlbmdsaXJ1aSYjeDAwMDQwO2NhYXMuY24=; Aiguo Yang, eWFuZ2FpZ3VvJiN4MDAwNDA7Y2Fhcy5jbg==
†These authors have contributed equally to this work