Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 04 August 2021
Sec. Genomics of Plants and the Phytoecosystem
This article is part of the Research Topic The Development and Application of Multi-Omics Integration Approaches to Dissecting Complex Traits in Plants View all 25 articles

Identification of Candidate Genes Regulating the Seed Coat Color Trait in Sesame (Sesamum indicum L.) Using an Integrated Approach of QTL Mapping and Transcriptome Analysis

\r\nChun Li,Chun Li1,2Yinghui Duan,Yinghui Duan1,2Hongmei Miao,Hongmei Miao1,2Ming Ju,Ming Ju1,2Libin WeiLibin Wei1Haiyang Zhang,*Haiyang Zhang1,2*
  • 1Henan Sesame Research Center, Henan Academy of Agricultural Sciences, Zhengzhou, China
  • 2Henan Key Laboratory of Specific Oilseed Crops Genomics, Henan Sesame Research Center, Henan Academy of Agricultural Sciences, Zhengzhou, China

Seed coat color is an important seed quality trait in sesame. However, the genetic mechanism of seed coat color variation remains elusive in sesame. We conducted a QTL mapping of the seed coat color trait in sesame using an F2 mapping population. With the aid of the newly constructed superdense genetic linkage map comprised of 22,375 bins distributed in 13 linkage groups (LGs), 17 QTLs of the three indices (i.e., L, a, and b values) of seed coat color were detected in seven intervals on four LGs, with a phenotype variance explanation rate of 4.46–41.53%. A new QTL qSCa6.1 on LG 6 and a QTL hotspot containing at least four QTLs on LG 9 were further identified. Variants screening of the target intervals showed that there were 84 genes which possessed the variants that were high-impact and co-segregating with the seed coat color trait. Meanwhile, we performed the transcriptome comparison of the developing seeds of a white- and a black-seeded variety, and found that the differentially expressed genes were significantly enriched in 37 pathways, including three pigment biosynthesis related pathways. Integration of variants screening and transcriptome comparison results suggested that 28 candidate genes probably participated in the regulation of the seed coat color in sesame; of which, 10 genes had been proved or suggested to be involved in pigments biosynthesis or accumulation during seed formation. The findings gave the basis for the mechanism of seed coat color regulation in sesame, and exhibited the effects of the integrated approach of genome resequencing and transcriptome analysis on the genetics analysis of the complex traits.

Introduction

Sesame (Sesamum indicum L.) is a specific and important oilseed crop with a long cultivation history (Bedigian, 2003). At present, sesame is mainly grown in Asia, Africa, and Latin America. Sesame seeds possess abundant nutritional substances, such as unsaturated fatty acids, proteins, digestible fiber, and beneficial antioxidants (such as lignans) (Namiki, 2007; Kumar et al., 2013; Ha et al., 2017; Keskin, 2019). For sesame seeds, seed coat color varies from white, shallow yellow, yellow, golden, brown, gray, reddish brown, and other medium colors, to black. In particular, the white- and the black-seeded seeds are more popular for consumption in the world. Black sesame seeds are shown to possess abundant nutritional facts (Ganesan and Xu, 2017; Khoo et al., 2017). Some studies showed that the contents of some substances, such as phenolics, vitamins, and phloretin, in black sesame seeds are high (Ha et al., 2017; Wang et al., 2018). In China, black sesame seeds have also been used as a traditional Chinese medicine in the last 3,000 years (Wang et al., 2018; Zhang H. et al., 2019).

In sesame, seed coat color trait is considered as a quantitative trait (Zhang H. et al., 2013; Wang L. et al., 2016). Zhang H. et al. (2013) firstly performed the genetic background and QTL mapping analysis of sesame seed coat color with an F2 hybridization population using simple sequence repeat (SSR), amplified fragment length polymorphism (AFLP), and random selective amplification of microsatellite polymorphic loci (RSAMPL) markers. Four QTLs were located on three LGs with the phenotypic variation from 9.6 to 39.95% (Zhang H. et al., 2013). Subsequently, Wang L. et al. (2016) investigated the seed coat color indices of the L, a, and b values of a recombinant inbred lines (RIL) population and determined four QTLs on a genetic map constructed by restriction-site associated DNA sequencing (RAD-Seq). The four QTLs were found to be on three LGs and with the phenotypic variation from 3 to 46%. Wei et al. (2015) performed a genome-wide association study (GWAS) for the seed coat color and some other traits using a panel of 705 sesame accessions and determined seven loci associated with the seed coat color. Meanwhile, Du et al. (2019) detected seven QTLs related with seed coat color on three LGs using specific length amplified fragment sequencing (SLAF-seq) and an F2 population. Wang et al. (2020) identified 20 candidate genes associated with pigment biosynthesis using transcriptome data from a black and white sesame variety. However, only the SiPPO (polyphenol oxidase) gene was determined for the seed coat color in sesame until now (Wei et al., 2015).

For angiosperm plants, the seed coat color trait is predominantly determined by the content of various pigments, which mainly belong to the secondary flavonoid metabolites, such as flavonols, anthocyanins, and proanthocyanidins (PAs), and are synthesized through the flavonoid biosynthesis pathway (Winkel-Shirley, 2001). At present, dozens of genes regulating the seed coat color regulation have been detected in crops; of which, most genes encode the enzymes related to flavonoid biosynthesis (such as C2 in maize, and TT6 and TT7 in Arabidopsis), transfer proteins (ZmMRP3 in maize and TT12 and AHA10 in Arabidopsis), or regulatory factors (TT1 and TT10 in Arabidopsis) (Pourcel et al., 2005; Marinova et al., 2007; Owens et al., 2008; Badone et al., 2010; Han et al., 2010; Appelhagen et al., 2011, 2015; Eloy et al., 2017). Flavonoid biosynthesis pathway is highly conserved in different plant species (Dong et al., 2001; Tohge et al., 2017). To date, numerous transcription factors, such as MYBs, basic helix-loop-helix (bHLHs), Transparent Testa Glabra1 (TTG1), and MYB-bHLH-WDR (MBW) ternary complexes, have been found to be participating in the regulation of flavonoid biosynthesis pathway in a tissue-specific manner (Gonzalez et al., 2009; Xu et al., 2014). In addition, post-translation modification and microRNA have also been proven to be related to the flavonoid biosynthesis pathway (Wang H. et al., 2016).

In this study, in order to identify the major effect of genes underlying sesame seed coat color, we performed a QTL mapping for the sesame seed coat color trait based on a superdense genetic linkage map constructed using an F2 population and genome resequencing technology. The transcriptome comparison analysis of a black- and white-seeded variety during seed development was also carried out. As a result, a total of 28 candidate genes were determined for the seed coat color trait in sesame according to the integration of QTL mapping and transcriptome analysis. The findings provide a deep understanding of the genetic mechanism underlying the seed coat color regulation in sesame.

Materials and Methods

Materials and Data

Two varieties, Yuzhi DS899 (white-seeded, mutated from a sibling of Yuzhi 11) and JS012 (black-seeded), and their 120 F2 individuals were used for the QTL mapping in this study (Zhang et al., 2016). The F2 population and both parents were grown in Sanya (109°50′E and 18°25′N), Hainan, China in 2014 for seed coat color trait evaluation. For the transcriptome analysis, two varieties with a contrasting seed coat color, i.e., Yuzhi 11 and the black-seeded parent JS012, were cultivated in Yuanyang (113°97′E and 35°05′N), China in 2020. Twenty capsules from 10 plantlets of each variety at the time points 5, 10, 15, 20, and 25 DAF, respectively, were collected. The developing seeds were then peeled from the capsules, frozen in liquid nitrogen, and stored at –80°C for RNA extraction. Three biological replicates were set for RNA sequencing. All the materials above were chosen from the sesame germplasm reservoir of Henan Sesame Research Center, Henan Academy of Agricultural Sciences (HSRC, HAAS).

The genome resequencing data (reserved in NCBI Bioproject PRJNA315474) of Yuzhi DS899 and JS012 and their 120 F2 individuals (Zhang et al., 2016) were used for the genetic map construction and QTL mapping in this study. The sesame genome of var. Yuzhi11 (white-seeded; version 2) (Zhang et al., 2016) was used as the reference genome for genotyping, variant digging, and candidate gene screening analysis.

Seed Coat Color Trait Evaluation

Seed coat color of the F2 family was investigated based on a single plant. For each of the sequenced 120 F2 individuals above, the seeds were harvested after the plant was totally matured. The seeds from each individual were randomly divided into three equal pieces (used as three replicates) after the unmatured seeds were discarded. As to evaluate the seed coat color, the seeds were put into a quartz cell, and then scanned under the Colorflex EZ spectrophotometer (Hunter Associates Laboratory, United States). Three indices (i.e., L, a, and b values) were measured and recorded. Of which, “L” defines the luminosity of the seed coat, with a range from –100 for black to 100 for white; “a” and “b” indicate the shade of color pairs, with a range from –60 for green to +60 for red, and from –60 for blue to +60 for yellow, respectively. For each of the indices, the mean value of the three replicates was finally used.

Construction of a Superdense Genetic Linkage Map for Sesame

Sequencing reads of the parents Yuzhi DS899 and JS012 and their 120 F2 individuals (Zhang et al., 2016) were firstly filtered using the Trimmomatic 0.33 software (Bolger et al., 2014), and then mapped to the reference sesame genome (cv. Yuzhi 11) (Zhang et al., 2016) using the Burrows Wheeler Aligner (BWA) 0.7.17 software with the default settings (Li and Durbin, 2009). Joint variant calling was performed using the GATK 4.0 program following its best practice (Van der Auwera et al., 2013). Genotype coding was performed using the GenosToABH plugin in TASSEL 5.2.43 (Glaubitz et al., 2014). The homozygous genotype of the parent Yuzhi DS899 was coded as “A,” the parent JS012 as “B,” and the heterozygous genotype of their hybrid as “H.” Since several consecutive heterozygotes (“H”) would be called as identical homozygotes (“A” or “B”) within a very short distance due to the low reads coverage, the coded genotype was processed according to Miao et al. (2018) using the following procedures to improve the accuracy of linkage map construction: (1) combining the consecutive and redundant (with identical information for all individuals in the progeny) markers within a short region into a representative marker, and the minimum length between two continuous markers was set to 100 bp (base pair); (2) correcting the genotype of representative markers based on a sliding window of 15 markers; (3) based on the corrected markers, grouping the adjacent and redundant markers into bins; and (4) filtering out the bins of distorted segregation with the number of plants with “A” or “B” genotypes <20, and the number of plants with H genotypes <40. The threshold for distorted segregation loci was selected based on a pilot study of linkage map construction, in which we found that at least 80 individuals (with ideal segregation ratio 1:2:1 for genotype “A,” “B,” and “H”) were needed to form 13 linkage groups (corresponding 13 chromosomes of sesame).

The genetic map was constructed based on the filtered genotype matrix using the Lep-MAP3 program with the default parameters and a logarithm of the odds (LOD) of 12.0 to separate chromosomes (Rastas, 2017). LGs with less than 100 markers were discarded. The Kosambi function was applied to transfer the recombinant rate into the genetic distance (Morgan).

QTL Mapping and Comparison

QTL mapping was performed using the QTL Cartographer 2.5 (Wang et al., 2012), QTL.gCIMapping.GUI v1.0 (Wen et al., 2019; Zhang Y. W. et al., 2019), and QTL IciMapping 4.2.53 (Meng et al., 2015), respectively. The Composite interval mapping (CIM) method in the QTL Cartographer, the Genome-wide composite interval mapping (GCIM) method in the QTL.gCIMapping.GUI, and the Inclusive composite interval mapping (ICIM) and Interval mapping (IM) methods in the QTL IciMapping were individually used to detect the QTLs and to estimate the effects. For the CIM method, the map was scanned in 2 cM (centimorgan) intervals with a window size of 10 cM, and the LOD was determined by running 1,000 permutation tests. For the GCIM method, random model, critical LOD score = 2.5, and walk step for genome wide scanning = 1 cM were used. For the ICIM and IM methods, LOD was determined by running 1,000 permutation tests at type I error equals 0.05, and walk step was set at 1 cM. The ratio of phenotypic variance explained by genotype was taken from the marker at the peak QTL position. QTL physical position was determined by the flanking markers.

To make a distinction between the new QTLs identified in this study and from the reported results, QTL position comparison was performed with flanking sequences using the MUMER 4 program (Marcais et al., 2018). The flanking sequences of the QTLs or the associated loci were applied as the queries for alignment against the sesame reference genome (cv. Yuzhi 11) (Zhang et al., 2016).

RNA-Seq and Analysis

Total RNA of each sample was extracted using the RNAiso Plus Reagents (TaKaRa, Dalian, China). Genomic DNA was removed by DNase I treatment. RNA-seq libraries were constructed and sequenced on the Illumina platform HiSeq 2500 (Illumina, Inc., San Diego, CA, United States). The clean reads were mapped to the reference sesame genome using the STAR 2.7.0 (Dobin et al., 2013). The reads mapped to each genomic feature were counted using the featureCounts 2.0.1 (Yang et al., 2014). Default parameters were used for both read mapping with STAR and read counting with featureCounts. Differentially expressed genes (DEG) were screened and analyzed using the DESeq 2 software with the default parameters (Love et al., 2014). False discovery rate (FDR) adjusted p-values < 0.05 and absolute value of log2 (fold change) > 1 were set as the cutoff for the DEGs.

Gene Function Annotation, Homologous Gene Screening, and Variant Effect Prediction

Gene function annotation was performed using the eggNOG-Mapper 5.0.1 (Huerta-Cepas et al., 2017). Gene ontology (GO) plotting was conducted in the WEGO 2.0 (Ye et al., 2018). Homologous gene screening in the target intervals was conducted using the BLAST + 2.2.31 with the E-value < 1e-10. The effect of gene variant was estimated using the SnpEff 4.1 (Cingolani et al., 2012), and the variants that caused frame shift, stop gain, stop lost, start lost, splice acceptor, splice donor, exon loss, frame-shift, or nonsense mutation were regarded as high-impact and screened according to Cingolani et al. (2012). Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was performed using the R package clusterProfiler 3.12 with the default parameters (Yu et al., 2012), and a q-value < 0.05 was set as the cutoff.

Results

Seed Coat Color Genetic Analysis of the Mapping Population

In order to clarify the genetic background of the seed coat color in sesame, we firstly investigated the phenotypic variation of the seed coat color trait of the F2 population in 2016 (Supplementary Table 1 and Supplementary Figure 1). L, a, and b values of the white-seeded parent Yuzhi DS899 were 62.81, 5.50, and 19.62, respectively, while those of the black seeded parent were 21.62, 6.73, and 7.27, respectively. All the F1 seeds were black, whereas F2 plants exhibited an obvious variation in seed coat color with various values of L, a, and b (Supplementary Figure 1). As to the 120 F2 plants, “L” values ranged from 16.23 to 62.81, with a standard deviation of 8.40; the “a” values ranged from 1.20 to 9.34, with a standard deviation of 2.67; while “b” varied from 1.42 to 19.62, with a standard deviation of 4.27 (Supplementary Table 1). In particular, we found some F2 plants that presented a darker seed coat color than the black parent (Supplementary Figure 1). However, the color of all the F2 plantlets was darker than the white-seeded parent.

We further performed the pairwise correlation analysis of the three indicators of the seed coat color in the population (Supplementary Table 1). The results showed that the L, a, and b values were significantly correlated with each other. The L, a, and b values of the F2 population were all out of the normal distribution (Figure 1 and Supplementary Table 1), indicating the existences of a few major QTLs controlling the three indicators.

FIGURE 1
www.frontiersin.org

Figure 1. Distribution of the seed coat color indices presented by the L, a, and b values in the F2 population. (A)L” value; (B)a” value; (C)b” value.

Construction of a Superdense Genetic Linkage Map

Before determining the QTLs for the seed coat color trait, we firstly constructed a superdense genetic map using the F2 population above and the genome resequencing data. Based on the reference genome (var. Yuzhi 11), a total of 1,264,646 SNP/InDel variants for the two parents and their 120 F2 plants were identified (Supplementary Table 2). The number of unique variants for the two parents Yuzhi DS899 and JS012 was 122,361 and 528,671, respectively. After filtered, a total of 425,011 variants were successfully genotyped into “A,” “B,” or “H” types in the population (Supplementary Table 2).

Then, we constructed the matrix consisting of the above 425,011 genotype codes for the 120 F2 individuals and fed it to the linkage map construction pipeline (Table 1). As the bins with the distorted segregation (the numbers of plants having A/B genotypes < 20 and H genotypes < 40) were excluded, a total of 22,375 bins containing 380,544 SNP/InDel markers were obtained for the genetic map construction (Table 1 and Supplementary Table 3). As a result, a genetic map with 13 LGs was successfully constructed (Figure 2). The total length of the genetic map was 1,576.14 cM, and the length per LG ranged from 96.96 to 157.96 cM. The average marker interval was 0.82 cM per bin marker. The marker density was 14.20 bins or 241.44 SNP/InDel markers per cM. In addition, 34 gaps with the length ≥ 5.0 cM remained in the genetic map, with the largest gap size of 18.27 cM (Table 1).

TABLE 1
www.frontiersin.org

Table 1. Characteristics of the superdense genetic linkage map for sesame.

FIGURE 2
www.frontiersin.org

Figure 2. The superdense genetic map constructed using an improved pipeline and the F2 population derived from a cross between Yuzhi DS899 and JS012.

QTL Mapping of the Seed Coat Color Trait in the F2 Population

We then performed a QTL mapping of the seed coat color in the crossing population using four methods (GCIM, ICIM, IM, and CIM) (Table 2 and Supplementary Table 4). The results from the four methods were similar, and QTLs were found on LG3, 5, 6, and 9 for the three seed coat color indicators L, a, and b (Table 2 and Supplementary Table 4). Of these, the QTL intervals observed using the CIM method were usually wider, while the intervals from the GCIM method were narrower. The overlapping relationship and boundaries between the QTLs were obvious (Supplementary Table 4). Therefore, based on the QTL interval boundaries, we summarized the QTL mapping resulting into 17 QTLs, including 5, 4, and 8 QTLs for L, a, and b, respectively (Table 2). The genetic effect (the explanation rate of phenotype variance or VG/VP value) of the QTLs on the seed coat color trait ranged from 4.66% (for qSCb5.2 estimated by GCIM) to 41.53% (for qSCL9.3 estimated by ICIM).

TABLE 2
www.frontiersin.org

Table 2. QTL distribution of the seed coat color using L, a, and b values in sesame.

Due to the overlapping positions of the QTLs controlling L, a, and b, a total of seven non-overlapping intervals, with one on LG3 (43.75–47.09 cM), one on LG5 (20.44–22.11 cM), one on LG6 (78.80–84.23 cM), and four on LG9 (9.17–20.4, 20.4–21.23, 21.23–27.07, and 28.73–30.40 cM), were identified for the sesame seed coat color in this study (Table 2). As the QTLs intervals on LG9 were adjacent to each other, we thus named the QTL hotspot “qSC cluster” for the seed coat color trait. By further merging this qSC cluster into one interval, a total of four intervals (hereinafter referred to as “qSC intervals”) on four LGs were thus harvested for the seed coat color trait in sesame, i.e., LG3: 43.75–47.09 cM, LG5: 20.44–22.11 cM, LG6: 78.80–84.23 cM, and LG9: 9.17–30.40 cM (Table 2).

Genes in the qSC Intervals

We further screened the genes in the four qSC intervals according to the fine sesame reference genome (Table 3). The results showed that the above qSC intervals were located into six contigs (i.e., C34, C_2_7, C4, U0167, C3, and C_2_3) in the reference genome. The physical size of the six contig segments varied from 26.80 Kb (kilobases) (in contig U0167) to 2,090.72 Kb (in contig C_2_3). A total of 625 genes were detected in the six contig segments, and the number of genes in each of the contig segments varied from one on contig C4 to 235 on C_2_3 (Table 3 and Supplementary Table 5).

TABLE 3
www.frontiersin.org

Table 3. Genes and variants in the four merged qSC intervals based on the sesame genome data.

In order to determine the candidate genes, we firstly compared the two parents and determined that a total of 10,719 SNP/InDel variants existed in the six target contig segments between the parents (Table 3 and Supplementary Table 6). Of which, 9,393 variants were unique in JS012 (black-seeded) compared with Yuzhi DS899 (white-seeded) and Yuzhi 11 (the variety used for reference genome sequencing, white-seeded), and thus, co-segregated with the seed coat color trait. Annotation results of SnpEff indicated that 393 variants in 84 genes caused frame shift, stop gain, stop lost, start lost, splice acceptor, splice donor, exon loss, frame-shift, or nonsense mutation, and were named as high-impact variants (Table 3 and Supplementary Table 6).

DEGs of White- and Black-Seeded Varieties

To further explore the characteristics of the genes related with the seed coat color trait, we compared the expression profiles of the developing seeds in a white variety (Yuzhi 11) and a black variety (JS012) (Supplementary Figure 2). A total of 1,210.93 million reads (181.64 Gbases), with an average of 43.25 million clean reads, for the samples were harvested, and the mapping rates to the reference were all above 96.03% (Supplementary Table 7). The number of DEGs at the stages 5, 10, 15, 20, and 25 DAF between the two varieties were 2,148 (838 up and 1,310 down expressed in JS012), 5,176 (3,019 up and 2,157 down), 3,725 (1,598 up and 2,127 down), 2,984 (1,092 up and 1,892 down), and 5,115 (2,115 up and 3,000 down), respectively (Supplementary Table 8). GO annotation showed that the DEGs from the five stages presented a similar functional profile (Supplementary Figure 3). Most DEGs were involved in metabolic and cellular biological processes and the molecular functions of catalytic activity and binding. We further performed the KEGG enrichment analysis and found that the DEGs were significantly enriched in 37 pathways (q_value < 0.05), including pigment biosynthesis related phenylpropanoid biosynthesis (KEGG: ko00940, at the stages 10, 15, 20, and 25 DAF), flavonoid biosynthesis (KEGG: ko00941, at the stages 10, 15, and 25 DAF), and anthocyanin Biosynthesis (KEGG: ko00942, at the stage 10 DAF) pathways (Supplementary Table 9 and Figure 3). Notably, the three pigment biosynthesis related pathways above were enriched after the stage 5 DAF, and no genes in the three pathways were located in our qSC intervals (Supplementary Table 9).

FIGURE 3
www.frontiersin.org

Figure 3. Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis of the differentially expressed genes (DEGs) identified at the stages 5, 10, 15, 20, and 25 DAF between JS012 and Yuzhi 11.

Seed Coat Color Candidate Genes Identification in White- and Black-Seeded Varieties

In order to screen the candidate genes related with the seed coat color regulation in sesame, we compared the expression profiles of the 625 genes in the qSC intervals during the seed development in the two varieties (Supplementary Table 10). Of the 625 genes, 62, 138, 101, 83, and 135 were differentially expressed at the stages 5, 10, 15, 20, and 25 DAF, respectively (Supplementary Table 10). Considering that the pigment was accumulated in JS012 gradually from 8 DAF and that no enrichment of the seed coat color formation related pathways were observed at the stage 5 DAF, we took the stage 5 DAF as the control and the other stages as the case to search for the candidate genes, and a Venn diagram consisting of the genes that were DEGs at the stages 10, 15, 20, and 25 DAF and not-DEGs at the stage 5 DAF was created (Figure 4A). We found 223 genes (the sum of the red numbers in Figure 4) that were DEGs in at least at one of the stages 10, 15, 20, and 25 DAF but not at 5 DAF. By selecting the intersection of the 223 genes and the 84 genes containing high-impacts variants (and co-segregated with the seed coat color trait; Supplementary Table 6), 28 genes were identified (Figure 4B).

FIGURE 4
www.frontiersin.org

Figure 4. Venn diagram of genes in the qSC intervals. (A) Venn diagram of the DEGs after 5 DAF and not-DEGs at the stage 5 DAF; (B) Venn diagram of the genes from DEG filtered (the genes that were DEGs in at least one of the stages 10, 15, 20 and 25 DAF but not at 5 DAF), and high-impact related (the genes containing high-impacts variants and co-segregated with the seed coat color trait).

The 28 genes were selected as the highly-potential candidate genes for the sesame seed coat color. Of the 28 candidates, seven were located on LG3, 2 on LG5, and 19 on LG9, while no genes were found on LG6 (Table 4). The 28 genes had various functions, including E3 ubiquitin-protein ligase (C_2_3.773 and C_2_3.787), oxidase (C_2_7.207 and C34.291), transcription factor (C_2_3.819 and C34.285), and so on (Table 4). Of the 28 genes, 14 were expressed predominantly in JS012 (Supplementary Figure 4A), 11 expressed predominantly in Yuzhi 11 (Supplementary Figure 4B), and the other three genes presented the hybridized expression pattern (Supplementary Figure 4C).

TABLE 4
www.frontiersin.org

Table 4. The 28 candidate genes related to seed coat color in sesame.

Discussion

The Multiple-Fold Improvement of the Sesame Genetic Linkage Map

In this study, a superdense genetic linkage map was constructed for sesame by genome resequencing and an updated map construction pipeline. Compared with the published genetic maps (Zhang Y. et al., 2013; Zhang et al., 2018; Wu et al., 2014; Uncu et al., 2016; Wang L. et al., 2016; Mei et al., 2017; Du et al., 2019; Asekova et al., 2021; Liang et al., 2021), the new genetic linkage map contains the highest number of both SNP/InDel markers (380,544) and bins (22,375) for sesame to date.

During the genetic map construction, the genotype data which resulted from genome resequencing usually have many noisy genotypes and always lead to inaccurate co-segregating marker (bins) grouping. To overcome the problem above, we developed a genetic map construction pipeline in 2016 and updated the method in this study. The previous pipeline basically consists of five steps: (1) calling variants individually; (2) choosing the variant loci that were called in either parent and in at least 20 progenies; (3) filtering out the loci that were heterozygous in either parent or that have a read depth under five in any progeny; (4) genotype coding; and (5) genotype correction (Zhang et al., 2016). In this study, the pipeline was improved, and the genotype matrix for map construction was prepared as follows: (1) calling variants jointly; (2) performing genotype coding; (3) combining the consecutive same markers within a short region; (4) genotype correction; and (5) filtering out the loci of distorted segregation. In the updated pipeline, joint variant calling was applied with a higher efficiency than the previous single-sample variant calling and loci filtration strategy. Joint variant calling shared the information across all samples in a cohort and therefore had greater sensitivity, particularly for the site with a low sequence coverage or misalignment (Van der Auwera et al., 2013). As to the loci filtration strategy, in order to “rescue” the deleted but informative sites, we just filtered out the loci with distorted segregation and omitted the filtration step before genotype correction. Thus, more makers and bins could be applied for map construction. In the study, the located SNP/InDel makers increased from 30,193 to 380,544 in the genetic map, with more than an 11-fold increase. The number of bins in the new map increased from 3,041 to 22,375, with more than a six-fold increase. Furthermore, to improve the accuracy of the pipeline, we had two times of merging involved in the pipeline, i.e., combining the consecutive and redundant markers within a short region into a representative marker, and grouping the adjacent and redundant markers into bins. The involvement of merging twice lead to an average of 17 markers per bin in the new map, and, thus, the new map should be of a greater accuracy.

The Superdense Genetic Map Had More Robustly Dissected the Genetic Mechanism Underlying the Seed Coat Color Trait

Seed coat color is one of the earliest traits studied in crops for genetics analysis and Mendel’s law discovery. In sesame, seed coat color seems to be correlated with the species evolution and the beneficial effects on seed nutrition (Namiki, 2007; Zhang et al., 2012). Based on our investigation, the sesame seed coat color is stable under various environments, and is mainly influenced by the seed ripening level. With the aid of a molecular genetic map, Zhang H. et al. (2013) detected four QTLs for the seed coat color on three LGs using an F2 population derived from a cross between a white-seeded parent and a black-seeded parent. Wei et al. (2015) identified seven loci on three LGs for the trait by GWAS using a natural population. Wang L. et al. (2016) identified four QTLs on three LGs for the trait using a population consisting of 430 RILs. Lately, Du et al. (2019) found seven seed coat color related QTLs on three LGs. The results suggest that the seed coat color trait is controlled by a few major QTLs in sesame, and our phenotype analysis also agrees with this. In the present study, 17 QTLs in seven non-overlapping intervals on four LGs were determined for sesame coat color. Comparison of the results indicated that our QTLs covered all the previous published QTLs, except two loci, i.e., qSCL-8.1 identified by Wang L. et al. (2016) and qsccZ12 identified by Du et al. (2019). High consistency reflects the reliability of the QTL mapping results in this study. Furthermore, we found that the specific QTL qSCL-8.1 reported by Wang L. et al. (2016) was close to the qSC cluster in this study. We proposed that qSCL-8.1 should be another member of the cluster. The QTL qsccZ12 was identified based on the color space index Z, which is different from the indices used in this study, and had the lowest phenotypic variation (5.58%) among the seven QTLs reported by Du et al. (2019). This may be the reason why we missed it in this study. Comparing with these previous studies, we (1) identified a new QTL for the seed coat color, i.e., qSCa6.1, and (2) successfully distinguished more QTLs than the previous studies.

Candidate Genes for the Seed Coat Color in Sesame

In this study, in order to clarify the expression profiles of the candidate genes above, we also analyzed and compared the seed transcriptome during the seed development in the white- and black-seeded varieties. Compared to the stage 5 DAF, the DEGs at the following developing stages seem more important for the sesame seed coat color formation. Previous investigation indicated that pigment accumulation was usually observed approximately from 8 DAF in the black variety JS012 (data not shown). In this study, we determined that the pigment formation related pathways were not enriched at the time point 5 DAF. We thus used the DEGs at the stage 5 DAF to filter the other stages for candidate gene identification. Similarly, Wang et al. (2020) did a detailed research using RNA-seq data from a black- and white-sesame, and proposed that the genes controlling the sesame seed coat color were mainly expressed after 5 DAF, and thus took the 5 DAF as the initiated control and grouped 11–20 DAF to identify the candidate genes linked with pigment biosynthesis.

By integrating the QTL mapping and transcriptome analysis, we determined 28 genes as the candidate genes for the seed coat color in sesame (Table 4). Of the 28 candidate genes, the gene SiPPO (C_2_7.207) has been proved to determine the seed coat color in sesame (Wei et al., 2015), while the genes C_2_3.819 and C34.285 were annotated as transcription factor MYB-like and bHLH-like, respectively (Table 4). MYB and bHLH domain proteins act as a key regulator in PA accumulation in a developing seed (Nesi et al., 2000, 2001). Meanwhile, the gene C34.291 was annotated to encode a pyridoxine/pyridoxamine 5′-phosphate oxidase (PPOX) gene, which was suggested to be related to pigment biosynthesis in both plant (Sang et al., 2011) and human (Frank et al., 1999). The gene C_2_7.220 encodes a dioxygenase gene that was proven to be involved in plant pigments biosynthesis (Polturak and Aharoni, 2018). The gene C_2_3.683 encodes a xyloglucan endotransglucosylase/hydrolase 5 (XTH5) genes, and XTH5 has been reported to be involved in fruit pigment accumulation in the ripening processes in tomato (Wang et al., 2019). In addition, we found that the genes C_2_3.773 and C_2_3.787 were annotated as E3 ubiquitin-protein ligases, which were suggested to play important roles in tomato pigment accumulation (Tang et al., 2015). The genes C_2_3.774 and C_2_3.850 encode triose phosphate/phosphate translocator and cytochrome P450 gene, respectively, and were also suggested to be related to pigments biosynthesis in many plants (Tanaka and Brugliera, 2013; Hilgers et al., 2018). To sum up, 10 out of the 28 candidate genes were proved or suggested to be involved in pigments biosynthesis or accumulation. The results exhibited the high confidence of the candidate gene detection. However, we should remember that some SNP and InDels existed in the upstream and downstream sequences of the genes that have been omitted in the study. In the future, the function of the candidate genes and other genes in the target intervals should be further validated to clarify the regulation for the seed coat color trait in sesame.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/, PRJNA315474 and PRJNA742439.

Author Contributions

CL performed the data analysis and drafted the manuscript. YD conducted the main experiments and participated in the data analysis. HM guided the experiments and drafted the manuscript. MJ conducted the cross-section. LW performed the genetic experiments. HZ conceived and designed the experiments, and guided manuscript correction for publishing. All authors have seen and approved the final version of the manuscript.

Funding

This work was supported by the Key Laboratory of Specific Oilseed Crops Genomics of Henan Province. This work was financially supported by the earmarked fund for China Agriculture Research System (CARS-14); the Key Project of Science and Technology of Henan province (201300110600); Zhongyuan Science and Technology Innovation Leading Talent Plan (214200510020), the Henan Province Specific Professor Position Program (SPPP2020), the Zhongyuan Scientist Workshop Construction (ZSWC2019 and 214400510026), and the Innovation Scientists and Technicians Troop Construction Projects of Henan Province (ISTTCPHP2016).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2021.700469/full#supplementary-material

Supplementary Figure 1 | Phenotypes of seed coat color of the two parents, F1 and some F2 progenies. (A) Yuzhi DS899; (B) JS012; (C) F1 seeds of the cross between Yuzhi DS899 and JS012; (D) seeds of some F2 progenies.

Supplementary Figure 2 | The heatmap of the samples used for DEG screening between white- and black-seeded varieties.

Supplementary Figure 3 | GO annotation of the DEGs identified between JS012 and Yuzhi 11 at the stages 5, 10, 15, 20, and 25 DAF, respectively.

Supplementary Figure 4 | The expression pattern of the 28 candidate genes. The X-axis indicates the time points at 5, 10, 15, 20, and 25 DAF, respectively; the Y-axis indicates the expression level of the genes; Indicates significantly differentially expressed; the orange line stands for the gene expression level in JS012, while the blue line stands for in Yuzhi 11. (A) the genes expressed predominantly in JS012; (B) the genes expressed predominantly in Yuzhi 11; (C) the genes with hybridized expression pattern.

Supplementary Table 1 | Statistics of the L, a, and b values of the seed coat color trait in the F2 population derived from the cross between Yuzhi DS899 and JS012.

Supplementary Table 2 | The variant counts of the two parents and their F2 population harvested from joint variant calling.

Supplementary Table 3 | Distribution of bin markers in the superdense genetic map.

Supplementary Table 4 | QTL mapping of seed coat color traits in sesame using the GCIM, ICIM, IM, and CIM methods.

Supplementary Table 5 | The genes located in the qSC intervals.

Supplementary Table 6 | The SNPs or InDels in the six contig segments between Yuzhi DS899 and JS012.

Supplementary Table 7 | Statistics of the RNA-seq datasets used in the study.

Supplementary Table 8 | The DEGs identified between JS012 and Yuzhi 11 at the stages 5, 10, 15, 20, and 25 DAF, respectively.

Supplementary Table 9 | The enriched KEGG pathways in the DEGs between JS012 and Yuzhi 11 at the stages 5, 10, 15, 20, and 25 DAF, respectively.

Supplementary Table 10 | The DEGs in the qSC intervals.

References

Appelhagen, I., Lu, G. H., Huep, G., Schmelzer, E., Weisshaar, B., and Sagasser, M. (2011). TRANSPARENT TESTA1 interacts with R2R3-MYB factors and affects early and late steps of flavonoid biosynthesis in the endothelium of Arabidopsis thaliana seeds. Plant J. 67, 406–419. doi: 10.1111/j.1365-313X.2011.04603.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Appelhagen, I., Nordholt, N., Seidel, T., Spelt, K., and Koes, R. (2015). TRANSPARENT TESTA 13 is a tonoplast P3A -ATPase required for vacuolar deposition of proanthocyanidins in Arabidopsis thaliana seeds. Plant J. 82, 840–849. doi: 10.1111/tpj.12854

PubMed Abstract | CrossRef Full Text | Google Scholar

Asekova, S., Oh, E., Kulkarni, K. P., Siddique, M. I., Lee, M. H., Kim, J. I., et al. (2021). An integrated approach of QTL mapping and genome-wide association analysis identifies candidate genes for phytophthora blight resistance in sesame (Sesamum indicum L.). Front. Plant Sci. 12:604709. doi: 10.3389/fpls.2021.604709

PubMed Abstract | CrossRef Full Text | Google Scholar

Badone, F. C., Cassani, E., Landoni, M., Doria, E., Panzeri, D., Lago, C., et al. (2010). The low phyticacid1-241 (lpa1-241) maize mutation alters the accumulation of anthocyanin pigment in the kernel. Planta 231, 1189–1199. doi: 10.1007/s00425-010-1123-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Bedigian, D. (2003). Evolution of sesame revisited: domestication, diversity and prospects. Genet. Resour. Crop Evol. 50, 779–787. doi: 10.1023/A:1025029903549

CrossRef Full Text | Google Scholar

Bolger, A. M., Lohse, M., and Usadel, B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120. doi: 10.1093/bioinformatics/btu170

PubMed Abstract | CrossRef Full Text | Google Scholar

Cingolani, P., Platts, A., Wang, L., Coon, M., Nguyen, T., Wang, L., et al. (2012). A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118, iso-2, iso-3. Fly (Austin) 6, 80–92. doi: 10.4161/fly.19695

PubMed Abstract | CrossRef Full Text | Google Scholar

Dobin, A., Davis, C. A., Schlesinger, F., Drenkow, J., Zaleski, C., Jha, S., et al. (2013). STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21. doi: 10.1093/bioinformatics/bts635

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, X., Braun, E. L., and Grotewold, E. (2001). Functional conservation of plant secondary metabolic enzymes revealed by complementation of Arabidopsis flavonoid mutants with maize genes. Plant Physiol. 127, 46–57. doi: 10.1104/pp.127.1.46

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, H., Zhang, H., Wei, L., Li, C., Duan, Y., and Wuang, H. (2019). A high-density genetic map constructed using specific length amplified fragment (SLAF) sequencing and QTL mapping of seed-related traits in sesame (Sesamum indicum L.). BMC Plant Biol. 19:588. doi: 10.1186/s12870-019-2172-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Eloy, N. B., Voorend, W., Lan, W., Saleme, M. L., Cesarino, I., Vanholme, R., et al. (2017). Silencing CHALCONE SYNTHASE in maize impedes the incorporation of tricin into lignin and increases lignin content. Plant Physiol. 173, 998–1016. doi: 10.1104/pp.16.01108

PubMed Abstract | CrossRef Full Text | Google Scholar

Frank, J., McGrath, J. A., Poh-Fitzpatrick, M. B., Hawk, J. L., and Christiano, A. M. (1999). Mutations in the translation initiation codon of the protoporphyrinogen oxidase gene underlie variegate porphyria. Clin. Exp. Dermatol. 24, 296–301. doi: 10.1046/j.1365-2230.1999.00484.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Ganesan, K., and Xu, B. (2017). A critical review on polyphenols and health benefits of black soybeans. Nutrients 9:455. doi: 10.3390/nu9050455

PubMed Abstract | CrossRef Full Text | Google Scholar

Glaubitz, J. C., Casstevens, T. M., Lu, F., Harriman, J., Elshire, R. J., Sun, Q., et al. (2014). TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS One 9:e90346. doi: 10.1371/journal.pone.0090346

PubMed Abstract | CrossRef Full Text | Google Scholar

Gonzalez, A., Mendenhall, J., Huo, Y., and Lloyd, A. (2009). TTG1 complex MYBs, MYB5 and TT2, control outer seed coat differentiation. Dev. Biol. 325, 412–421. doi: 10.1016/j.ydbio.2008.10.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Ha, T. J., Lee, M. H., Seo, W. D., Baek, I. Y., Kang, J. E., and Lee, J. H. (2017). Changes occurring in nutritional components (phytochemicals and free amino acid) of raw and sprouted seeds of white and black sesame (Sesamum indicum L.) and screening of their antioxidant activities. Food Sci. Biotechnol. 26, 71–78. doi: 10.1007/s10068-017-0010-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, Y., Vimolmangkang, S., Soria-Guerra, R. E., Rosales-Mendoza, S., Zheng, D., Lygin, A. V., et al. (2010). Ectopic expression of apple F3′H genes contributes to anthocyanin accumulation in the Arabidopsis tt7 mutant grown under nitrogen stress. Plant Physiol. 153, 806–820. doi: 10.1104/pp.109.152801

PubMed Abstract | CrossRef Full Text | Google Scholar

Hilgers, E. J. A., Schöttler, M. A., Mettler-Altmann, T., Krueger, S., Dörmann, P., Eicks, M., et al. (2018). The combined loss of triose phosphate and xylulose 5-phosphate/phosphate translocators leads to severe growth retardation and impaired photosynthesis in Arabidopsis thaliana tpt/xpt double mutants. Front. Plant Sci. 9:1331. doi: 10.3389/fpls.2018.01331

PubMed Abstract | CrossRef Full Text | Google Scholar

Huerta-Cepas, J., Forslund, K., Coelho, L. P., Szklarczyk, D., Jensen, L. J., Mering, C. V., et al. (2017). Fast genome-wide functional annotation through orthology assignment by eggNOG-Mapper. Mol. Biol. Evol. 34, 2115–2122. doi: 10.1093/molbev/msx148

PubMed Abstract | CrossRef Full Text | Google Scholar

Keskin, H. (2019). “Active Compounds, Health Effects, and Extraction of Unconventional Plant Seed Oils,” in Plant and Human Health, eds M. Ozturk and K. Hakeem (Cham: Springer), 245–285.

Google Scholar

Khoo, H. E., Azlan, A., Tang, S. T., and Lim, S. M. (2017). Anthocyanidins and anthocyanins: colored pigments as food, pharmaceutical ingredients, and the potential health benefits. Food Nutr. Res. 61:1361779. doi: 10.1080/16546628.2017.1361779

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, N., Mudgal, J., Parihar, V. K., Nayak, P. G., Kutty, N. G., and Rao, C. M. (2013). Sesamol treatment reduces plasma cholesterol and triacylglycerol levels in mouse models of acute and chronic hyperlipidemia. Lipids 48, 633–638. doi: 10.1007/s11745-013-3778-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760. doi: 10.1093/bioinformatics/btp324

PubMed Abstract | CrossRef Full Text | Google Scholar

Liang, J., Sun, J., Ye, Y., Yan, X., Yan, T., Rao, Y., et al. (2021). QTL mapping of PEG-induced drought tolerance at the early seedling stage in sesame using whole genome re-sequencing. PLoS One 16:e0247681. doi: 10.1371/journal.pone.0247681

PubMed Abstract | CrossRef Full Text | Google Scholar

Love, M. I., Huber, W., and Anders, S. (2014). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15:550. doi: 10.1186/s13059-014-0550-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Marcais, G., Delcher, A. L., Phillippy, A. M., Coston, R., Salzberg, S. L., and Zimin, A. (2018). MUMmer4: a fast and versatile genome alignment system. PLoS Comput. Biol. 26:14. doi: 10.1371/journal.pcbi.1005944

PubMed Abstract | CrossRef Full Text | Google Scholar

Marinova, K., Pourcel, L., Weder, B., Schwarz, M., and Barron, D. (2007). The Arabidopsis MATE transporter TT12 acts as a vacuolar flavonoid/H+ -antiporter active in proanthocyanidin-accumulating cells of the seed coat. Plant Cell 19, 2023–2038. doi: 10.1105/tpc.106.046029

PubMed Abstract | CrossRef Full Text | Google Scholar

Mei, H., Liu, Y., Du, Z., Wu, K., Cui, C., Jiang, X., et al. (2017). High-density genetic map construction and gene mapping of basal branching habit and flowers per leaf axil in sesame. Front. Plant Sci. 8:636. doi: 10.3389/fpls.2017.00636

PubMed Abstract | CrossRef Full Text | Google Scholar

Meng, L., Li, H., Zhang, L., and Wang, J. (2015). QTL IciMapping: integrated software for genetic linkage map construction and quantitative trait locus mapping in bi-parental populations. Crop J. 3, 269–283. doi: 10.1016/j.cj.2015.01.001

CrossRef Full Text | Google Scholar

Miao, C., Fang, J., Li, D., Liang, P., and Zhang, X. (2018). Genotype-Corrector: improved genotype calls for genetic mapping in F2 and RIL populations. Sci. Rep. 8:10088. doi: 10.1038/s41598-018-28294-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Namiki, M. (2007). Nutraceutical functions of sesame: a review. Crit. Rev. Food Sci. Nutr. 47, 651–673. doi: 10.1080/10408390600919114

PubMed Abstract | CrossRef Full Text | Google Scholar

Nesi, N., Debeaujon, I., Jond, C., Pelletier, G., Caboche, M., and Lepiniec, L. (2000). The TT8 gene encodes a basic helix-loop-helix domain protein required for expression of DFR and BAN genes in Arabidopsis siliques. Plant Cell 12, 1863–1878. doi: 10.1105/tpc.12.10.1863

PubMed Abstract | CrossRef Full Text | Google Scholar

Nesi, N., Jond, C., Debeaujon, I., Caboche, M., and Lepiniec, L. (2001). The Arabidopsis TT2 gene encodes an R2R3 MYB domain protein that acts as a key determinant for proanthocyanidin accumulation in developing seed. Plant Cell 13, 2099–2114. doi: 10.1105/tpc.010098

PubMed Abstract | CrossRef Full Text | Google Scholar

Owens, D. K., Crosby, K. C., Runac, J., Howard, B. A., and Winkel, B. S. (2008). Biochemical and genetic characterization of Arabidopsis flavanone 3beta-hydroxylase. Plant Physiol. Biochem. 46, 833–843. doi: 10.1016/j.plaphy.2008.06.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Polturak, G., and Aharoni, A. (2018). “La Vie en Rose”: biosynthesis, sources, and applications of betalain pigments. Mol. Plant 11, 7–22. doi: 10.1016/j.molp.2017.10.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Pourcel, L., Routaboul, J. M., Kerhoas, L., Caboche, M., and Lepiniec, L. (2005). TRANSPARENT TESTA10 encodes a laccase-like enzyme involved in oxidative polymerization of flavonoids in Arabidopsis seed coat. Plant Cell 17, 2966–2980. doi: 10.1105/tpc.105.035154

PubMed Abstract | CrossRef Full Text | Google Scholar

Rastas, P. (2017). Lep-MAP3: robust linkage mapping even for low-coverage whole genome sequencing data. Bioinformatics 33, 3726–3732. doi: 10.1093/bioinformatics/btx494

PubMed Abstract | CrossRef Full Text | Google Scholar

Sang, Y., Locy, R. D., Goertzen, L. R., Rahotte, A. M., and Si, Y. (2011). Expression, in vivo localization and phylogenetic analysis of a pyridoxine 5′-phosphate oxidase in Arabidopsis thaliana. Plant Physiol. Biochem. 49, 88–95. doi: 10.1016/j.plaphy.2010.10.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Tanaka, Y., and Brugliera, F. (2013). Flower colour and cytochromes P450. Philos. Trans. R. Soc. Lond. B Biol. Sci. 368:20120432. doi: 10.1098/rstb.2012.0432

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, X., Miao, M., Niu, X., Zhang, D., Cao, X., Jin, X., et al. (2015). Ubiquitin-conjugated degradation of golden 2-like transcription factor is mediated by CUL4-DDB1-based E3 ligase complex in tomato. New Phytol. 209, 1028–1039. doi: 10.1111/nph.13635

PubMed Abstract | CrossRef Full Text | Google Scholar

Tohge, T., de Souza, L. P., and Fernie, A. R. (2017). Current understanding of the pathways of flavonoid biosynthesis in model and crop plants. J. Exp. Bot. 68, 4013–4028. doi: 10.1093/jxb/erx177

PubMed Abstract | CrossRef Full Text | Google Scholar

Uncu, A., Fray, A., Karlovsky, P., and Doganlar, S. (2016). High-throughput single nucleotide polymorphism (SNP) identification and mapping in the sesame (Sesamum indicum L.) genome with genotyping by sequencing (GBS) analysis. Mol. Breeding 36:173. doi: 10.1007/s11032-016-0604-6

CrossRef Full Text | Google Scholar

Van der Auwera, G. A., Carneiro, M. O., Hart, C., Poplin, R., and Del Angel, G. (2013). From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 43, 11.10.1–11.10.33. doi: 10.1002/0471250953.bi1110s43

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, A., Chen, D., Ma, Q., Rose, J. K., Fei, Z., Liu, Y., et al. (2019). The tomato HIGH PIGMENT1/DAMAGED DNA BINDING PROTEIN 1 gene contributes to regulation of fruit ripening. Hortic Res. 6:15. doi: 10.1038/s41438-018-0093-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, D., Zhang, L., Huang, X., Wang, X., Yang, R., Mao, J., et al. (2018). Identification of nutritional components in black sesame determined by widely targeted metabolomics and traditional Chinese medicines. Molecules 23:1180. doi: 10.3390/molecules23051180

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, H., Ma, S., Xue, L., Li, Y., Wang, J., He, X., et al. (2016). miR-488 determines coat pigmentation by down-regulating the pigment-producing gene pro-opiomelanocortin. Cell Mol. Biol. 62, 37–43. doi: 10.14715/cmb/2016.62.12.7

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Dossou, S., Wei, X., Zhang, Y., Li, D., Yu, J., et al. (2020). Transcriptome dynamics during black and white sesame (Sesamum indicum L.) seed development and identification of candidate genes associated with black pigmentation. Gene. 11:1399. doi: 10.3390/genes11121399

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Xia, Q., Zhang, Y., Zhu, X., and Zhu, X. (2016). Updated sesame genome assembly and fine mapping of plant height and seed coat color QTLs using a new high-density genetic map. BMC Genomics 17:31. doi: 10.1186/s12864-015-2316-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, S., Basten, C. J., and Zeng, Z. B. (2012). Windows QTL Cartographer 2.5. Raleigh: North Carolina State University.

Google Scholar

Wei, X., Liu, K., Zhang, Y., Feng, Q., Wang, L., Zhao, Y., et al. (2015). Genetic discovery for oil production and quality in sesame. Nat. Commun. 6:8609. doi: 10.1038/ncomms9609

PubMed Abstract | CrossRef Full Text | Google Scholar

Wen, Y. J., Zhang, Y. W., Zhang, J., Feng, J. Y., Dunwell, J. M., and Zang, Y. M. (2019). An efficient multi-locus mixed model framework for the detection of small and linked QTLs in F2. Brief. Bioinform. 20, 1913–1924. doi: 10.1093/bib/bby058

PubMed Abstract | CrossRef Full Text | Google Scholar

Winkel-Shirley, B. (2001). Flavonoid biosynthesis. A colorful model for genetics, biochemistry, cell biology, and biotechnology. Plant Physiol. 126, 485–493. doi: 10.1104/pp.126.2.485

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, K., Liu, H., Yang, M., Tao, Y., and Ma, H. (2014). High-density genetic map construction and QTLs analysis of grain yield-related traits in Sesame (Sesamum indicum L.) based on RAD-Seq technology. BMC Plant Biol. 14:274. doi: 10.1186/s12870-014-0274-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, W., Grain, D., Bobet, S., Le, G. J., and Thévenin, J. (2014). Complexity and robustness of the flavonoid transcriptional regulatory network revealed by comprehensive analyses of MYB-bHLH-WDR complexes and their targets in Arabidopsis seed. New Phytol. 202, 132–144. doi: 10.1111/nph.12620

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, L., Gordon, K. S., and Wei, S. (2014). featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930. doi: 10.1093/bioinformatics/btt656

PubMed Abstract | CrossRef Full Text | Google Scholar

Ye, J., Zhang, Y., Cui, H., Liu, J., Wu, Y., Cheng, Y., et al. (2018). WEGO 2.0: a web tool for analyzing and plotting GO annotations, 2018 update. Nucleic Acids Res. 46, W71–W75. doi: 10.1093/nar/gky400

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, G., Wang, L. G., Han, Y., and He, Q. Y. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 6, 284–287. doi: 10.1089/omi.2011.0118

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, H., Miao, H., and Ju, M. (2019). “Potential for adaptation to climate change through genomic breeding in sesame,” in Genomic Designing of Climate-Smart Oilseed Crops, ed. C. Kole (Cham: Springer), 371–440.

Google Scholar

Zhang, H., Miao, H., Li, C., Wei, L., Duan, Y., Ma, Q., et al. (2016). Ultra-dense SNP genetic map construction and identification of SiDt gene controlling the determinate growth habit in Sesamum indicum L. Sci. Rep. 6:31556. doi: 10.1038/srep31556

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, H., Miao, H., Li, C., Wei, L., and Ma, Q. (2012). Analysis of sesame karyotype and resemblance-near coefficient. Chin. Plant Bull. 47, 602–614. doi: 10.3724/SP.J.1259.2012.00602

CrossRef Full Text | Google Scholar

Zhang, H., Miao, H., Wei, L., Li, C., Zhao, R., and Wang, C. (2013). Genetic analysis and QTL mapping of seed coat color in sesame (Sesamum indicum L.). PLoS One 8:e63898. doi: 10.1371/journal.pone.0063898

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., Wang, L., Gao, Y., Li, D., and Yu, J. (2018). Genetic dissection and fine mapping of a novel dt gene associated with determinate growth habit in sesame. BMC Genet. 19:38. doi: 10.1186/s12863-018-0614-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., Wang, L., Xin, H., Li, D., and Ma, C. (2013). Construction of a high-density genetic map for sesame based on large scale marker development by specific length amplified fragment (SLAF) sequencing. BMC Plant Biol. 13:141. doi: 10.1186/1471-2229-13-141

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y. W., Wen, Y. J., Dunwell, J. M., and Zhang, Y. M. (2019). QTL.gCIMapping.GUI v2.0: an R software for detecting small-effect and linked QTLs for quantitative traits in bi-parental segregation populations. Comput. Struct. Biotechnol. J. 18, 59–65. doi: 10.1016/j.csbj.2019.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: seed coat color, sesame, genetic linkage map, genome re-sequencing, RNA-Seq

Citation: Li C, Duan Y, Miao H, Ju M, Wei L and Zhang H (2021) Identification of Candidate Genes Regulating the Seed Coat Color Trait in Sesame (Sesamum indicum L.) Using an Integrated Approach of QTL Mapping and Transcriptome Analysis. Front. Genet. 12:700469. doi: 10.3389/fgene.2021.700469

Received: 26 April 2021; Accepted: 05 July 2021;
Published: 04 August 2021.

Edited by:

Shang-Qian Xie, Hainan University, China

Reviewed by:

Hong-Bin Zhang, Texas A&M University, United States
Xingwei Zheng, Shanxi Academy of Agricultural Sciences, China
Xiaobiao Zhu, Anhui Agricultural University, China

Copyright © 2021 Li, Duan, Miao, Ju, Wei and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Haiyang Zhang, zhanghaiyang@zzu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.