Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 16 July 2024
Sec. Plant Breeding

Metabolic, transcriptomic, and genetic analyses of candidate genes for seed size in watermelon

Xiqing Wang,*Xiqing Wang1,2*Wen Yan,Wen Yan1,2Núria RealNúria Real3Yunhe Jia,Yunhe Jia1,2Yongkai Fu,Yongkai Fu1,2Xuejun ZhangXuejun Zhang4Haibo You,Haibo You1,2Yi Cai,Yi Cai1,2Bin LiuBin Liu5
  • 1Horticultural Branch of Heilongjiang Academy of Agricultural Sciences, Harbin, China
  • 2Center for Research in Vegetable Engineering Technology of Heilongjiang, Harbin, China
  • 3Plant Pathology, IRTA Cabrils, Cabrils, Spain
  • 4Hainan Sanya Crops Breeding Trial Center of Xinjiang Academy Agricultural Sciences, Sanya, China
  • 5Hami-Melon Research Center, Xinjiang Academy of Agricultural Sciences, Urumqi, China

Seed size (SS) constitutes a pivotal trait in watermelon breeding. In this study, we present findings from an examination of two watermelon accessions, namely, BW85 and F211. Seeds from BW85 exhibited a significant enlargement compared to those of F211 at 13 days after pollination (DAP), with the maximal disparity in seed length and width manifesting at 17 DAP. A comprehensive study involving both metabolic and transcriptomic analyses indicated a significant enrichment of the ubiquinone and other terpenoid-quinone biosynthesis KEGG pathways. To detect the genetic region governing seed size, a BSA-seq analysis was conducted utilizing the F2 (BW85 × F211) population, which resulted in the identification of two adjacent QTLs, namely, SS6.1 and SS6.2, located on chromosomes 6. SS6.1 spanned from Chr06:4847169 to Chr06:5163486, encompassing 33 genes, while SS6.2 ranged from Chr06:5379337 to Chr06:5419136, which included only one gene. Among these genes, 11 exhibited a significant differential expression between BW85 and F211 according to transcriptomic analysis. Notably, three genes (Cla97C06G113960, Cla97C06G114180, and Cla97C06G114000) presented a differential expression at both 13 and 17 DAP. Through annotation, Cla97C06G113960 was identified as a ubiquitin-conjugating enzyme E2, playing a role in the ubiquitin pathway that mediates seed size control. Taken together, our results provide a novel candidate gene influencing the seed size in watermelon, shedding light on the mechanism underlying seed development.

1 Introduction

Seeds, as a unique organ in plants, play a pivotal role in their life cycle. Seed size and weight are agronomically important traits that not only influence plant adaptation to environmental stress but also impact overall yield and quality (Wang et al., 2019; Guo et al., 2020; Miao et al., 2020). Seed development involves the regulation of numerous genes responding to developmental and environmental signals (Figueiredo and Köhler, 2014; Li and Li, 2016). Recent studies have revealed some key genes and regulatory pathways controlling plant seed size (Li and Li, 2016; Li et al., 2019), including G protein signal transduction, the ubiquitin–proteasome pathway, mitogen activation protein kinase (MAPK) signaling, several transcription factors, and the perception and homeostasis of plant hormones (Guo et al., 2020).

G protein signal transduction affects multiple physiological activities during plant growth and development. Functional G proteins are heterotrimeric complexes composed of three subunits, Gα, Gβ, and Gγ. In Arabidopsis, a Gγ (AGG3)-deficient mutant presents reduced seed size (Chakravorty et al., 2011; Li et al., 2012). The regulation of grain length by these Gγs is mainly dependent on Gα (RGA1), RGB1, and the transcription factor MADS1 (Liu et al., 2018; Sun et al., 2018).

The ubiquitin receptor realization plays a crucial role in seed growth regulation; several genes of the ubiquitin–proteasome system were reported to regulate seed size, such as DA1, DA2, DAR1, and GW2 (Xiong et al., 2023)—for instance, in Arabidopsis, the ubiquitin receptor DA1 together with the E3 ubiquitin ligases DA2 and BB/EOD1 regulates seed growth (Xia et al., 2013). AtDAR1 is a homologous gene of DA1; the seed and organ size, respectively, were increased in DA1 and DAR1 double-mutant (Dong et al., 2017). This collaborative mechanism is mirrored in the DA1 homologue (GW2) in wheat and maize, which has also been implicated in determining kernel size (Li et al., 2010; Bednarek et al., 2012). In Setaria italica, a RING-type E3 ligase SGD1 and its E2 partner SiUBC32 are involved in regulating grain weight, grain size, and panicle size (Tang et al., 2023).

The plant mitogen activator polypeptide (MAPK) cascade, consisting of MAPK, MAPK split (MAPKK) and MAPKK split (MAPKKK), plays an important role in various signal transductions. In A. thaliana, the mkk4/mkk5 double-mutant exhibits shortened seeds (Zhang et al., 2017). This cascade’s importance extends to rice, where OsMKKK10, OsMKK4, and OsMAPK6 positively control grain size (Xu et al., 2018). Additionally, regulatory factors such as regulators, regulatory coactivators, and chromatin regulators have also been investigated. Members of the SQUAMOSA promoter binding regulator family, OsSPL13 and OsSPL16, positively impact rice grain size through the regulation of SRS5 and GL7 expression, respectively (Wang et al., 2015; Segami et al., 2017). Similarly, OsmiR396, OsGRF4, and OsGIF have been identified as positive regulators of rice grain size (Li and Li, 2016). In Arabidopsis, the B3 domain tracker factors NGAL and KLU effectively regulate seed size (Zhang et al., 2017).

Watermelon exhibits significant variability in seed size, with seed length (SL) ranging from 4.2 to 16.8 mm (Kim et al., 2009). Watermelon seeds are classified into six representative types based on size: giant, big, medium, small, tiny, and tomato seed (Kim et al., 2009). Watermelons designated for seed consumption typically feature larger seeds, while those intended for flesh consumption necessitate relatively smaller seeds (Gong et al., 2022). Therefore, seed size emerges as a crucial agronomic trait in watermelon breeding. Pioneering work by Poole et al. (Porter, 1937) identified small seeds as dominant to medium-sized ones, with the seeds’ length controlled by l and s genes. Subsequent studies pinpointed four major effects: stable QTLs (ClSS2.2, ClSS6.1, ClSS6.2, and ClSS8.2) governing seed size and weight (Prothro et al., 2012; Meru and McGregor, 2013), affirming the quantitative nature of seed size variation in watermelon (Prothro et al., 2012; Hui-Wen et al., 2016; Gong et al., 2022). Utilizing the CRISPR/Cas9 system, Wang et al. (2021) generated watermelon Clbg1 mutants, revealing that reduced ABA levels in the clb1 mutant led to diminished seed size and weight due to a decrease in cell number.

In this study, we employed a BSA-Seq strategy to map the candidate genes responsible for seed size in watermelon, contributing to the enrichment of genetic knowledge regarding seed size control.

2 Results

2.1 Seeds from BW85 are bigger than those from F211

As shown in Figure 1A, a clear disparity in seed size between watermelon BW85 and F211 was evident at 21 days after pollination (DAP). Seeds from BW85 exhibited a significant increase in size compared to those from F211 at 13 DAP, with the most pronounced difference in seed length and width observed at 17 DAP (Figures 1B, C).

Figure 1
www.frontiersin.org

Figure 1 Phenotypic analysis of seed size (SS) of BW85 and F211. (A) Seed size of BW85 and F211 at 21 days after pollination (DAP). (B) Seed length of BW85 and F211. Error bars represent the standard deviations (SDs; n = 20). Asterisks indicate significant differences in seed length between BW85 and F211 at different DAPs (P < 0.05). (C) Seed width of BW85 and F211. Error bars indicate the standard deviations (SDs; n = 20). Asterisks indicate significant differences in seed width between BW85 and F211 at different DAP (P < 0.05).

2.2 Ubiquinone biosynthesis compounds are different between BW85 and F211 seeds

To investigate the metabolic differences of seed development between BW85 and F211, a metabolic analysis was performed on watermelon seeds at 13 DAP. The results revealed a total of 32 upregulated and 19 downregulated compounds between BW85 and F211 at 13 DAP (Supplementary Table S1). A KEGG classification analysis illustrated that a majority of these compounds belong to the metabolism clade (Figure 2A). Furthermore, a comprehensive KEGG analysis uncovered enrichment in 40 pathways (Supplementary Table S2). To highlight the most significant pathways, the top 20 pathways were selected based on reliable enrichment (Figure 2B). Among the top pathways exhibiting the highest enrichment were biosynthesis of secondary metabolites, ABC transporters, phenylpropanoid biosynthesis, biosynthesis of amino acids, and ubiquinone and other terpenoid-quinone biosynthesis (Figure 2B; Supplementary Table S3).

Figure 2
www.frontiersin.org

Figure 2 Metabolic analysis of seed development between BW85 and F211 at 13 days after pollination (DAP). (A) KEGG classification analysis of different metabolic compounds between BW85 and F211. (B) Statistics of KEGG enrichment analysis of different metabolic compounds between BW85 and F211.

2.3 Differently expressed genes between BW85 and F211 enriched at ubiquinone and other terpenoid-quinone biosynthesis pathways

Following the observation of distinct metabolic pathways enriched between BW85 and F211, we aimed to investigate their associated regulatory mechanisms. To accomplish this, a transcriptome analysis was performed for watermelon seeds at 13 and 17 DAP for BW85 and F211, respectively. The analysis at 13 DAP revealed a substantial expression difference, with 1,909 upregulated and 1,839 downregulated genes between BW85 and F211 (Figure 3A; Supplementary Table S4). The KEGG analysis of the differentially expressed genes (DEGs) unraveled enrichment in 121 pathways (Supplementary Table S5). The top pathways with the highest enrichment included the following KEGG pathways: plant hormone signal transduction (73 DEGs), starch and sucrose metabolism (65 DEGs), carbon metabolism (64 DEGs), biosynthesis of amino acids (54 DEGs), and ribosome (46 DEGs) (Figure 3B). At 17 DAP, a total of 646 upregulated and 687 downregulated genes were identified between BW85 and F211 (Figure 3C; Supplementary Table S6). The subsequent KEGG pathway highlighted enrichment in 108 pathways (Supplementary Table S7). The top pathways with the highest enrichment included KEGG pathways: plant hormone signal transduction, starch and sucrose metabolism, biosynthesis of amino acids, amino sugar and nucleotide sugar metabolism, and carbon metabolism (Figure 3D). These pathways featured 24, 21, 14, 14, and 12 DEGs, respectively. Interestingly, the ubiquinone and other terpenoid-quinone biosynthesis pathways exhibited a significant enrichment, consistent with the results from the metabolic analysis.

Figure 3
www.frontiersin.org

Figure 3 Transcriptomic analysis of seed development between BW85 and F211 at different days after pollination (DAP). (A) Volcano plot depicting the transcriptome differences between BW85 and F211 seeds at 13 DAP. (B) KEGG enrichment analysis highlighting pathways associated with differentially expressed genes (DEGs) between BW85 and F211 seeds at 13 DAP. (C) Volcano plot illustrating the transcriptome variance between BW85 and F211 seeds at 17 DAP. (D) KEGG enrichment analysis revealing pathways linked to DEGs between BW85 and F211 seeds at 17 DAP. n = 3.

To elucidate the relationship between gene expression patterns and traits, we performed a weighted gene co-expression network analysis (WGCNA). By constructing an unsigned network based on gene expression, we divided them into 13 modules (Figure 4A). The module turquoise contains the largest number of genes, followed by modules blue, brown, and yellow (Figure 4B). Subsequently, we correlated these modules with the traits of the samples. The results showed that the modules turquoise, tan, green, yellow, black, and magenta were significantly correlated with the development of seeds over time (p < 0.05). The modules blue, pink, and brown were significantly related to seed variety. However, there was no significant correlation between modules and seed length, width, and the ratio of the two (Figure 4C).

Figure 4
www.frontiersin.org

Figure 4 WGCNA of seed development between BW85 and F211 at different DAP. (A) Cluster dendrogram showing the different modules of co-expressed genes. The gray module included genes that did not belong to any other modules. (B) Distribution of the number of genes contained in different modules. (C) Correlation between different modules and sample trait information. The numbers in brackets indicate the p-value.

2.4 Genetic mapping of seed size in watermelon

To genetically map this gene, we conducted a BSA-seq using a population of 200 F2 progeny resulting from the cross between BW85 and F211. This mapping identified two adjacent QTLs, designated as SS6.1 and SS6.2, located on chromosome 6 (Supplementary Table S8; Figure 5A). These QTLs collectively contain 34 annotated genes, according to the watermelon 97103 reference genome sequence (v2.5). More precisely, SS6.1 extends from Chr06:4847169 to Chr06:5163486, encompassing 33 annotated genes, while SS6.2 is confined to the genomic interval Chr06:5379337 to Chr06:5419136, which only contains one annotated gene (Supplementary Table S9).

Figure 5
www.frontiersin.org

Figure 5 QTL mapping of seed size (SS). (A) Distribution of ΔSNP-index in bulked segregant analysis (BSA) for big and small seed sizes. (B) Heatmap of differentially expressed genes (DEGs) in the QTL region between BW85 and F211 at 13 and 17 days after pollination.

Building upon the insights gained from our transcriptomic analysis, we focused our attention on the differentially expressed genes (DEGs) within the QTL region. A total of 10 significant DEGs between BW85 and F211 were identified in the reference watermelon 97103 genome v2.5 (Figure 5B; Supplementary Table S10). Four of these DEGs appear to be particularly relevant to seed size regulation, with two genes belonging to module blue regulating it in the variety dimension, while two genes belonging to module turquoise play a role in seed growth time. In module blue, Cla97C06G113960, annotated as a ubiquitin-conjugating enzyme E2, is implicated in the ubiquitin pathway that mediated the control of seed size (Li and Li, 2014). Cla97C06G114000, an ATP-citrate synthase beta chain protein 2, is required for normal growth, development, and elongation of C18 to either C20 or C24 fatty acids in seeds (www.mybiosource.com/recombinant-protein/atp-citrate-synthase-beta-chain-protein-2-aclb-2/1461067). In module turquoise, Cla97C06G114480 is a MATE efflux family protein, which is known for its involvement in several physiological functions of plant growth and development (Du et al., 2021). Cla97C06G114100, encoding an outer envelope protein 16, impacts metabolic fluxes during ABA-controlled seed development (Pudelski et al., 2012). Other differentially expressed proteins in this region include an uncharacterized protein (Cla97C06G114190), a beta-D-xylosidase 1-like protein (Cla97C06G114170), a calmodulin-binding transcription activator with an unknown function (Cla97C06G114160), a cAMP-regulated phosphoprotein with an unknown function (Cla97C06G114180), enoyl-CoA hydratase (Cla97C06G113970), and bifunctional TENA-E protein isoform X1 (Cla97C06G114070). Importantly, among all of them, Cla97C06G113960, Cla97C06G114180, and Cla97C06G114000 are consistently downregulated at both 13 and 17 DAP (Figure 5B).

2.5 Comparative genomic analysis of candidate genes in the QTL region

Through a comparative analysis of the genomic sequence from BW85 and F211, a total of 352 variants, comprising 238 SNPs and 114 InDels, were identified within the QTL region (Supplementary Table S11). However, all InDels and most of the SNPs were located in the noncoding or intronic regions. Among the 34 candidate genes, only three (Cla97C06G114010, Cla97C06G114030, and Cla97C06G114120) presented non-synonymous (nsSNP) variants between BW85 and F211 (Supplementary Table S11). Interestingly, both Cla97C06G114010 and Cla97C06G114030 encode pectinesterase inhibitors, each harboring a distinct nsSNP (nucleotide-C68T/protein-S23F and Nucleotide-C305T/Protein-S102F, respectively). Similarly, Cla97C06G114120, presents a single nsSNP (Nucleotide-C977A/Protein-T326N).

3 Discussion

Seed size stands as a crucial agronomic trait, impacting the domestication and breeding of plants (Guo et al., 2020). Based on the size, the watermelon seed is classified into six representative types ranging from giant to tomato seed (Kim et al., 2009). Seed size is a complex trait controlled by multiple genes. Historically, Poole and colleagues (1937) identified genes (l, s, Ti, and ts) governing seed sizes, while subsequent studies pinpointed approximately 15 genes associated with seed size have been reported to date and major players on chromosomes 2 and 6 (Porter, 1937; Zhang, 1996; Kim et al., 2009; Prothro et al., 2012; Guo et al., 2020). In the present study, we confirmed a major QTL for seed size on LG6 by BSA-Seq analysis of an F2 population, from a small and medium size line cross. Moreover, our QTL region revealed new candidate genes related to seed size.

Among the 34 annotated genes in the mapping interval, 10 genes were found to be differentially expressed between BW85 and F211. Notably, Cla97C06G113960, Cla97C06G114180, and Cla97C06G114000 were consistently downregulated at both 13 and 17 DAP (Figure 5B). Another interesting candidate gene that appeared from our study is Cla97C06G113960, annotated as a ubiquitin-conjugating enzyme E2. Various components of the ubiquitin–proteasome system play important roles in regulating seed development. These components target specific proteins for ubiquitination, marking them for degradation by the proteasome. The ubiquitination process involves the sequential action of three enzymes: ubiquitin-activating enzyme (E1), ubiquitin-conjugating enzyme (E2), and ubiquitin ligase (E3). E1 activates ubiquitin, transferring it to E2, which collaborates with E3 to specify the substrate (Hershko and Ciechanover, 1998; Moon et al., 2004; Zhang et al., 2023). The involvement of the ubiquitin system in determining seed size was noted when a quantitative trait locus controlling grain size and weight in rice was characterized and found to encode the RING-type E3 ligase GW2 (Linden and Callis, 2020). In Setaria italica, a RING-type E3 ligase SGD1 and its E2 partner SiUBC32 are involved in regulating grain weight, grain size, and panicle size (Tang et al., 2023). SMALL GRAIN 3 (SMG3) and DECREASED GRAIN SIZE 1 (DGS1), an ERAD-related E2–E3 enzyme pair, regulate grain size and weight through the brassinosteroid (BR) signaling pathway in rice (Li et al., 2023). While E3 ubiquitin ligases, like GW2 and DA1, have been extensively studied for their role in seed size regulation, the functions of E1 and E2 remain less explored (Li and Li, 2014).

Our study identified the ubiquitin-conjugating enzyme E2 gene Cla97C06G113960 as a promising candidate, contributing to the expansion of understanding the less-explored components in ubiquitin-mediated seed size control. While direct evidence linking ubiquitin-conjugating enzymes to the biosynthesis of ubiquinone and terpenoid-quinones may be limited, there is potential for indirect regulation through their involvement in cellular processes such as protein turnover, signaling pathways, and metabolic regulation. Furthermore, scrutinizing the non-synonymous SNP variants among the candidate genes, Cla97C06G114010 and Cla97C06G114030 emerged as noteworthy (Supplementary Table S11). Recent studies reported that pectin functions in seed development (Levesque-Tremblay et al., 2015; Xu et al., 2022). Therefore, encoded as pectin esterase inhibitors, Cla97C06G114010 and Cla97C06G114030 have the potential regulatory role in seed size.

In conclusion, our findings expand the understanding of genetic control mechanisms underlying watermelon seed size. The identification of novel genes and the unraveling of their differential expression, coupled with insights into non-synonymous SNPs, open avenues for further research. These insights have implications for advancing plant breeding strategies, facilitating the cultivation of watermelon varieties with optimized seed sizes for various agronomic needs.

4 Materials and methods

4.1 Plant materials

The F2 generation was obtained through the artificial pollination of flowers from the two distinct parental lines, BW85 and F211, which were also used for transcriptome analysis. Notably, BW85 has a larger seed size compared to F211. To ensure controlled conditions, each plant was limited to setting only one fruit.

4.2 Mapping strategy

Young leaves from the mapping population were meticulously collected and stored at -80°C. Genomic DNA was extracted from young leaves of watermelon plants by an improved CTAB method (Liu et al., 2022). SNP markers were developed from the whole-genome sequencing data of BW85 and F211, employing a previously described method (Liu et al., 2019). To identify candidate regions associated with each target trait, all of the SNPs between the test samples were summarized based on the alignment results against the reference genome. Subsequently, the Euclidean distance (ED) (Fekih et al., 2013) and Δindex (Hill et al., 2013; Takagi et al., 2013) were calculated. The final QTLs were determined by the intersection of linked regions defined through both ED and Δindex, in conjunction with SNPs.

4.3 RNA-seq analysis

RNA extraction was performed on watermelon seeds harvested at 13 and 17 DAP. The Agilent 2100 Bioanalyzer system (Agilent Technologies, CA, USA) was employed to assess RNA quality, ensuring a RIN/RQ (RNA integrity number/RNA quality index)≥8. Sequencing libraries were prepared and sequenced on a HiSeq 4000 platform (Illumina, San Diego, CA, USA), and analysis followed a previously described method (Liu et al., 2021). Genes with thresholds of fold change≥1.5 and p-value <0.05 are defined as differentially expressed genes (DEGs). After the logarithmic conversion of FPKM values with a base of 2, the genes with a median absolute deviation (MAD) in the top 75% and greater than 0.01 were used for network analysis. Co-expression gene network analysis was performed using the WGCNA software package v1.72.5 (https://CRAN.R-project.org/package=WGCNA).

4.4 Candidate gene prediction and variant calling

To genetically map this gene, we conducted a BSA-seq using a population of 200 F2 progeny resulting from the cross between BW85 × F211. This mapping identified two adjacent QTLs, designated as SS6.1 and SS6.2, located on chromosome 6 (Supplementary Table S1; Figure 5A). These QTLs collectively contain 34 annotated genes, according to the watermelon 97103 reference genome sequence (v2.5). More precisely, SS6.1 extends from Chr06:4847169 to Chr06:5163486, encompassing 33 annotated genes, while SS6.2 is confined to the genomic interval Chr06:5379337 to Chr06:5419136, which only contains one annotated gene (Supplementary Table S2).

Candidate gene prediction was performed by utilizing the watermelon (97103) v2.5 genome (http://cucurbitgenomics.org/v2/organism/16). The functional annotation of the predicted candidate genes was accomplished using the Dicots PLAZA 4.5 Database (https://bioinformatics.psb.ugent.be/plaza/versions/plaza_v4_5_dicots/) and the BLASTP tool from NCBI (http://blast.ncbi.nlm.nih.gov/Blast.cgi). In addition, variations within the fine mapping interval were identified by analyzing the whole-genome resequencing data of BW85 and F211.

4.5 Metabolite analyses

Metabolite extraction was performed on watermelon seeds harvested at 13 DAP. For sample preparation and extraction of widely targeted metabolomics, vacuum freeze-drying was used; the samples were freeze-dried by using a vacuum freeze-dryer (Scientz-100F). The freeze-dried sample was crushed using a mixer mill (MM 400, Retsch) with a zirconia bead for 1.5 min at 30 Hz. Then, 100 mg of lyophilized powder was dissolved with 1.2 mL 70% methanol solution and vortexed for 30 s every 30 min for six times in total. The sample was placed in a refrigerator at 4°C overnight. Following centrifugation at 12,000 rpm for 10 min, the extracts were filtrated (SCAA-104, 0.22 μm pore size; ANPEL, Shanghai, China, http://www.anpel.com.cn/) before UPLC–MS/MS analysis.

The extracts were analyzed using an UPLC–ESI–MS/MS system (UPLC, SHIMADZU Nexera X2, www.shimadzu.com.cn/; MS, Applied Biosystems 4500 Q TRAP, www.appliedbiosystems.com.cn/). The analytical conditions were as follows: UPLC: column, Agilent SB-C18 (1.8 µm, 2.1 mm × 100 mm). The mobile phase consisted of solvent A, pure water with 0.1% formic acid, and solvent B, acetonitrile with 0.1% formic acid. Sample measurements were performed with a gradient program that employed the starting conditions of 95% A, 5% B. Within 9 min, a linear gradient to 5% A, 95% B was programmed, and a composition of 5% A, 95% B was kept for 1 min. Subsequently, a composition of 95% A, 5.0% B was adjusted within 1.10 min and kept for 2.9 min. The flow velocity was set as 0.35 mL per minute. The column oven was set to 40°C. The injection volume was 4 μL. The effluent was alternatively connected to an ESI-triple quadrupole-linear ion trap (QTRAP)-MS.

For ESI-Q TRAP-MS/MS analysis, the ESI source operation parameters were as follows: source temperature, 550°C; ion spray voltage (IS), 5,500 V (positive ion mode)/–4,500 V (negative ion mode); ion source gas I (GSI), 50 psi; gas II (GSII), 60 psi; curtain gas (CUR), 25 psi. The collision-activated dissociation (CAD) was high. QQQ scans were acquired as MRM experiments with collision gas (nitrogen) set to medium. DP (declustering potential) and CE (collision energy) for individual MRM transitions were done with further DP and CE optimization. A specific set of MRM transitions was monitored for each period according to the metabolites eluted within this period.

Both hierarchical cluster analysis (HCA) and Pearson’s correlation coefficient (PCC) analysis of the samples and metabolites were carried out using the ComplexHeatmap package in R. Variable importance in projection (VIP) scores >1 and |log2FC| ≥1.0 were used to determine differentially abundant metabolites. The VIP values were extracted from the result of the orthogonal projections to latent structures discriminant analysis (OPLS-DA).

5 Conclusion

In summary, our study has successfully identified a major QTL on chromosome 6 that serves as a regulator of watermelon seed size. Within the designated mapping interval, we have cataloged a total of 34 annotated candidate genes. Particularly noteworthy are three promising candidates—Cla97C06G113960 (ubiquitin-conjugating enzyme E2), Cla97C06G114010, and Cla97C06G114030 (pectin esterase inhibitors)—which emerged from a comprehensive transcriptomic and genomic analysis. Future research should focus on the functional verification of these candidates, which opens a valuable opportunity for the development of watermelon varieties with customized seed sizes, catering to diverse consumer preferences and market demands.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author contributions

XW: Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Writing – original draft, Writing – review & editing. WY: Conceptualization, Investigation, Methodology, Software, Writing – review & editing. NR: Writing – review & editing. YJ: Resources, Writing – review & editing. YF: Data curation, Writing – review & editing. XZ: Funding acquisition, Supervision, Writing – review & editing. HY: Supervision, Writing – review & editing. YC: Supervision, Writing – review & editing. BL: Conceptualization, Funding acquisition, Software, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research received financial support from the Ministry of Agriculture of the People’s Republic of China (CARS-25–2023-Z1); Heilongjiang scientific research business expenses project of provincial scientific research institutions (CZKYF2022–1-B016); the fund for Stable Support to Agricultural Sci-Tech Renovation (xjnkywdzc-2022001–6), and the specific research fund of The Innovation Platform for Academicians of Hainan Province.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2024.1394724/full#supplementary-material

Supplementary Table 1 | Different metabolic compounds between BW85 and F211 at 13 DAP.

Supplementary Table 2 | KEGG classification annotation of different metabolic compounds between BW85 and F211 at 13 DAP.

Supplementary Table 3 | KEGG pathways enrichment analysis of different metabolic compounds between BW85 and F211 at 13 DAP.

Supplementary Table 4 | Differentially expressed genes (DEGs) between BW85 and F211 at 13 DAP.

Supplementary Table 5 | KEGG annotation of DEGs between BW85 and F211 at 13 DAP.

Supplementary Table 6 | Differentially expressed genes (DEGs) between BW85 and F211 at 17 DAP.

Supplementary Table 7 | KEGG annotation of DEGs between BW85 and F211 at 17 DAP.

Supplementary Table 8 | QTLs for watermelon seed size.

Supplementary Table 9 | Annotation of genes in the QTL region.

Supplementary Table 10 | DEGs in the QTL region.

Supplementary Table 11 | InDels and SNP annotation in the QTL region.

References

Bednarek, J., Boulaflous, A., Girousse, C., Ravel, C., Tassy, C., Barret, P., et al. (2012). Down-regulation of the TaGW2 gene by RNA interference results in decreased grain size and weight in wheat. J. Exp. Bot. 63, 5945–5955. doi: 10.1093/jxb/ers249

PubMed Abstract | CrossRef Full Text | Google Scholar

Chakravorty, D., Trusov, Y., Zhang, W., Acharya, B. R., Sheahan, M. B., Mccurdy, D. W., et al. (2011). An atypical heterotrimeric G-protein γ-subunit is involved in guard cell K+-channel regulation and morphological development in Arabidopsis thaliana. Plant J. 67, 840–851. doi: 10.1111/j.1365-313X.2011.04638.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, H., Dumenil, J., Lu, F. H., Na, L., Vanhaeren, H., Naumann, C., et al. (2017). Ubiquitylation activates a peptidase that promotes cleavage and destabilization of its activating E3 ligases and diverse growth regulatory proteins to limit cell proliferation in Arabidopsis. Genes Dev. 31, 197–208. doi: 10.1101/gad.292235.116

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, Z., Su, Q., Wu, Z., Huang, Z., Bao, J., Li, J., et al. (2021). Genome-wide characterization of MATE gene family and expression profiles in response to abiotic stresses in rice (Oryza sativa). BMC Ecol. Evol. 21, 1–14. doi: 10.1186/s12862-021-01873-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Fekih, R., Takagi, H., Tamiru, M., Abe, A., Natsume, S., Yaegashi, H., et al. (2013). MutMap+: genetic mapping and mutant identification without crossing in rice. PloS One 8, e68529. doi: 10.1371/journal.pone.0068529

PubMed Abstract | CrossRef Full Text | Google Scholar

Figueiredo, D. D., Köhler, C. (2014). Signalling events regulating seed coat development. Biochem. Soc Trans. 42, 358–363. doi: 10.1042/BST20130221

PubMed Abstract | CrossRef Full Text | Google Scholar

Gong, C., Zhao, S., Yang, D., Lu, X., Anees, M., He, N., et al. (2022). Genome-wide association analysis provides molecular insights into natural variation in watermelon seed size. Hortic. Res.-England 9, 11. doi: 10.1093/hr/uhab074

CrossRef Full Text | Google Scholar

Guo, Y., Gao, M., Liang, X., Xu, M., Liu, X., Zhang, Y., et al. (2020). Quantitative trait loci for seed size variation in cucurbits–a review. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.00304

CrossRef Full Text | Google Scholar

Hershko, A., Ciechanover, A. (1998). The ubiquitin system. Annu. Rev. Biochem. 67, 425–479. doi: 10.1146/annurev.biochem.67.1.425

PubMed Abstract | CrossRef Full Text | Google Scholar

Hill, J. T., Demarest, B. L., Bisgrove, B. W., Gorsi, B., Su, Y.-C., Yost, H. J. (2013). MMAPPR: mutation mapping analysis pipeline for pooled RNA-seq. Genome Res. 23, 687–697. doi: 10.1101/gr.146936.112

PubMed Abstract | CrossRef Full Text | Google Scholar

Hui-Wen, Z., Bing-Yang, L., Hong-Yan, M., Peng, G., Fei-Shi, L., Qi-Fan, G., et al. (2016). QTL mapping of watermelon seed traits. Acta Hortic. Sin. 43, 715.

Google Scholar

Kim, Y.-J., Yang, T.-J., Park, Y.-H., Lee, Y.-J., Kang, S.-C., Kim, Y.-K., et al. (2009). Development of near isogenic lines with various seed sizes and study on seed size-related characteristics in watermelon. Korean J. Breed. Sci. 41, 403–411.

Google Scholar

Levesque-Tremblay, G., Müller, K., Mansfield, S. D., Haughn, G. W. (2015). HIGHLY METHYL ESTERIFIED SEEDS is a pectin methyl esterase involved in embryo development. Plant Physiol. 167, 725–737. doi: 10.1104/pp.114.255604

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, N., Li, Y. (2014). Ubiquitin-mediated control of seed size in plants. Front. Plant Sci. 5. doi: 10.3389/fpls.2014.00332

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, N., Li, Y. (2016). Signaling pathways of seed size control in plants. Curr. Opin. Plant Biol. 33, 23–32. doi: 10.1016/j.pbi.2016.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Q., Li, L., Yang, X., Warburton, M. L., Bai, G., Dai, J., et al. (2010). Relationship, evolutionary fate and function of two maize co-orthologs of rice GW2 associated with kernel size and weight. BMC Plant Biol. 10, 1–15. doi: 10.1186/1471-2229-10-143

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, S., Liu, Y., Zheng, L., Chen, L., Li, N., Corke, F., et al. (2012). The plant-specific G protein γ subunit AGG3 influences organ size and shape in Arabidopsis thaliana. New Phytol. 194, 690–703. doi: 10.1111/j.1469-8137.2012.04083.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, N., Xu, R., Li, Y. (2019). Molecular networks of seed size control in plants. Annu. Rev. Plant Biol. 70, 435–463. doi: 10.1146/annurev-arplant-050718-095851

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Zhang, B. L., Duan, P. G., Yan, L., Yu, H. Y., Zhang, L. M., et al. (2023). An endoplasmic reticulum-associated degradation-related E2-E3 enzyme pair controls grain size and weight through the brassinosteroid signaling pathway in rice. Plant Cell 35, 1076–1091. doi: 10.1093/plcell/koac364

PubMed Abstract | CrossRef Full Text | Google Scholar

Linden, K. J., Callis, J. (2020). The ubiquitin system affects agronomic plant traits. J. Biol. Chem. 295, 13940–13955. doi: 10.1074/jbc.REV120.011303

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, B., Guan, D., Zhai, X., Yang, S., Xue, S., Chen, S., et al. (2019). Selection footprints reflect genomic changes associated with breeding efforts in 56 cucumber inbred lines. Hortic. Res.-England 6, 127. doi: 10.1038/s41438-019-0209-4

CrossRef Full Text | Google Scholar

Liu, Q., Han, R., Wu, K., Zhang, J., Ye, Y., Wang, S., et al. (2018). G-protein βγ subunits determine grain size through interaction with MADS-domain transcription factors in rice. Nat. Commun. 9, 852. doi: 10.1038/s41467-018-03047-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, B., Santo Domingo, M., Mayobre, C., Martín-Hernández, A. M., Pujol, M., Garcia-Mas, J. (2022). Knock-out of CmNAC-NOR affects melon climacteric fruit ripening. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.878037

CrossRef Full Text | Google Scholar

Liu, B., Zhao, S., Li, P., Yin, Y., Niu, Q., Yan, J., et al. (2021). Plant buffering against the high-light stress-induced accumulation of CsGA2ox8 transcripts via alternative splicing to finely tune gibberellin levels and maintain hypocotyl elongation. Hortic. Res.-England 8, 10. doi: 10.1038/s41438-020-00430-w

CrossRef Full Text | Google Scholar

Meru, G., McGregor, C. (2013). Genetic mapping of seed traits correlated with seed oil percentage in watermelon. Hortscience 48, 955–959. doi: 10.21273/HORTSCI.48.8.955

CrossRef Full Text | Google Scholar

Miao, C., Wang, D., He, R., Liu, S., Zhu, J. K. (2020). Mutations in MIR 396e and MIR 396f increase grain size and modulate shoot architecture in rice. Plant Biotechnol. J. 18, 491–501. doi: 10.1111/pbi.13214

PubMed Abstract | CrossRef Full Text | Google Scholar

Moon, J., Parry, G., Estelle, M. (2004). The ubiquitin-proteasome pathway and plant development. Plant Cell 16, 3181–3195. doi: 10.1105/tpc.104.161220

PubMed Abstract | CrossRef Full Text | Google Scholar

Porter, D. (1937). Inheritance of certain fruit and seed characters in watermelons. Hilgardia 10, 489–509. doi: 10.3733/hilg.v10n12p489

CrossRef Full Text | Google Scholar

Prothro, J., Sandlin, K., Abdel-Haleem, H., Bachlava, E., White, V., Knapp, S., et al. (2012). Main and epistatic quantitative trait loci associated with seed size in watermelon. J. Am. Soc Hortic. Sci. 137, 452–457. doi: 10.21273/JASHS.137.6.452

CrossRef Full Text | Google Scholar

Pudelski, B., Schock, A., Hoth, S., Radchuk, R., Weber, H., Hofmann, J., et al. (2012). The plastid outer envelope protein OEP16 affects metabolic fluxes during ABA-controlled seed development and germination. J. Exp. Bot. 63, 1919–1936. doi: 10.1093/jxb/err375

PubMed Abstract | CrossRef Full Text | Google Scholar

Segami, S., Takehara, K., Yamamoto, T., Kido, S., Kondo, S., Iwasaki, Y., et al. (2017). Overexpression of SRS5 improves grain size of brassinosteroid-related dwarf mutants in rice (Oryza sativa L.). Breed. Sci. 67, 393–397. doi: 10.1270/jsbbs.16198

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, S., Wang, L., Mao, H., Shao, L., Li, X., Xiao, J., et al. (2018). A G-protein pathway determines grain size in rice. Nat. Commun. 9, 851. doi: 10.1038/s41467-018-03141-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Takagi, H., Abe, A., Yoshida, K., Kosugi, S., Natsume, S., Mitsuoka, C., et al. (2013). QTL-seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. Plant J. 74, 174–183. doi: 10.1111/tpj.12105

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, S., Zhao, Z. Y., Liu, X. T., Sui, Y., Zhang, D. D., Zhi, H., et al. (2023). An E2-E3 pair contributes to seed size control in grain crops. Nat. Commun. 14, 17. doi: 10.1038/s41467-023-38812-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, S., Li, S., Liu, Q., Wu, K., Zhang, J., Wang, S., et al. (2015). The OsSPL16-GW7 regulatory module determines grain shape and simultaneously improves rice yield and grain quality. Nat. Genet. 47, 949–954. doi: 10.1038/ng.3352

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, W., Pan, Q., Tian, B., He, F., Chen, Y., Bai, G., et al. (2019). Gene editing of the wheat homologs of TONNEAU 1-recruiting motif encoding gene affects grain shape and weight in wheat. Plant J. 100, 251–264. doi: 10.1111/tpj.14440

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Wang, J., Guo, S., Tian, S., Zhang, J., Ren, Y., et al. (2021). CRISPR/Cas9-mediated mutagenesis of ClBG1 decreased seed size and promoted seed germination in watermelon. Hortic. Res.-England 8, 12. doi: 10.1038/s41438-021-00506-1

CrossRef Full Text | Google Scholar

Xia, T., Li, N., Dumenil, J., Li, J., Kamenski, A., Bevan, M. W., et al. (2013). The ubiquitin receptor DA1 interacts with the E3 ubiquitin ligase DA2 to regulate seed and organ size in Arabidopsis. Plant Cell 25, 3347–3359. doi: 10.1105/tpc.113.115063

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiong, E. H., Qu, X. L., Li, J. F., Liu, H. L., Ma, H., Zhang, D., et al. (2023). The soybean ubiquitin-proteasome system: Current knowledge and future perspective. Plant Genome 16, 15. doi: 10.1002/tpg2.20281

CrossRef Full Text | Google Scholar

Xu, R., Duan, P., Yu, H., Zhou, Z., Zhang, B., Wang, R., et al. (2018). Control of grain size and weight by the OsMKKK10-OsMKK4-OsMAPK6 signaling pathway in rice. Mol. Plant 11, 860–873. doi: 10.1016/j.molp.2018.04.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, Z., Yang, M., Li, Z., Xiao, J., Yang, X., Wang, H., et al. (2022). Tissue-specific pectin methylesterification and pectin methylesterase activities play a role in lettuce seed germination. Sci. Hortic. 301, 111134. doi: 10.1016/j.scienta.2022.111134

CrossRef Full Text | Google Scholar

Zhang, J. (1996). Inheritance of seed size from diverse crosses in watermelon. Report-Cucurbit Genet. Coop. 19, 67–69.

Google Scholar

Zhang, M., Wu, H., Su, J., Wang, H., Zhu, Q., Liu, Y., et al. (2017). Maternal control of embryogenesis by MPK6 and its upstream MKK4/MKK5 in Arabidopsis. Plant J. 92, 1005–1019. doi: 10.1111/tpj.13737

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., Zhang, X., Liu, X., Pai, Q., Wang, Y., Wu, X. (2023). Molecular network for regulation of seed size in plants. Int. J. Mol. Sci. 24, 10666. doi: 10.3390/ijms241310666

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: watermelon, seed size, genetic mapping, transcriptomic analysis, metabolic analysis

Citation: Wang X, Yan W, Real N, Jia Y, Fu Y, Zhang X, You H, Cai Y and Liu B (2024) Metabolic, transcriptomic, and genetic analyses of candidate genes for seed size in watermelon. Front. Plant Sci. 15:1394724. doi: 10.3389/fpls.2024.1394724

Received: 02 March 2024; Accepted: 25 June 2024;
Published: 16 July 2024.

Edited by:

Padma Nimmakayala, West Virginia State University, United States

Reviewed by:

Wei Liu, Fujian Agriculture and Forestry University, China
Hao Li, Northwest A&F University, China

Copyright © 2024 Wang, Yan, Real, Jia, Fu, Zhang, You, Cai and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xiqing Wang, eGlxaW5nd2FuZzEwMEAxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.