Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 13 September 2023
Sec. Plant Bioinformatics
This article is part of the Research Topic Plant Systems Biology: Integration of System-Wide Studies to Elucidate Central Features In Biological Processes View all 6 articles

Integrative analysis of metabolome, proteome, and transcriptome for identifying genes influencing total lignin content in Populus trichocarpa

Jia ZhaoJia Zhao1Kairui ChaoKairui Chao2Achuan Wang*Achuan Wang1*
  • 1College of Computer and Control Engineering, Northeast Forestry University, Harbin, China
  • 2College of Forestry, Inner Mongolia Agricultural University, Hohhot, China

Lignin, a component of plant cell walls, possesses significant research potential as a renewable energy source to replace carbon-based products and as a notable pollutant in papermaking processes. The monolignol biosynthetic pathway has been elucidated and it is known that not all monolignol genes influence the total lignin content. However, it remains unclear which monolignol genes are more closely related to the total lignin content and which potential genes influence the total lignin content. In this study, we present a combination of t-test, differential gene expression analysis, correlation analysis, and weighted gene co-expression network analysis to identify genes that regulate the total lignin content by utilizing multi-omics data from transgenic knockdowns of the monolignol genes that includes data related to the transcriptome, proteome, and total lignin content. Firstly, it was discovered that enzymes from the PtrPAL, Ptr4CL, PtrC3H, and PtrC4H gene families are more strongly correlated with the total lignin content. Additionally, the co-downregulation of three genes, PtrC3H3, PtrC4H1, and PtrC4H2, had the greatest impact on the total lignin content. Secondly, GO and KEGG analysis of lignin-related modules revealed that the total lignin content is not only influenced by monolignol genes, but also closely related to genes involved in the “glutathione metabolic process”, “cellular modified amino acid metabolic process” and “carbohydrate catabolic process” pathways. Finally, the cinnamyl alcohol dehydrogenase genes CAD1, CADL3, and CADL8 emerged as potential contributors to total lignin content. The genes HYR1 (UDP-glycosyltransferase superfamily protein) and UGT71B1 (UDP-glucosyltransferase), exhibiting a close relationship with coumarin, have the potential to influence total lignin content by regulating coumarin metabolism. Additionally, the monolignol genes PtrC3H3, PtrC4H1, and PtrC4H2, which belong to the cytochrome P450 genes, may have a significant impact on the total lignin content. Overall, this study establishes connections between gene expression levels and total lignin content, effectively identifying genes that have a significant impact on total lignin content and offering novel perspectives for future lignin research endeavours.

1 Introduction

Lignin is the second most abundant complex aromatic polymer in Plantae, following cellulose, and is predominantly found in the woody tissues of plants. As a major component of plant cell walls, lignin plays a crucial role in preventing the invasion of various plant pathogens, contributing significantly to plant defence against external abiotic stresses (Boerjan et al., 2003; Vanholme et al., 2010). Moreover, wood holds promise as a renewable energy source due to its ability to release more heat when used as fuel, offering significant potential for replacing other carbon-based products (de Vries et al., 2021). The monolignol biosynthetic pathway begins with phenylalanine, a product of the phenylalanine pathway. Phenylalanine is catalytically converted into cinnamic acid by phenylalanine ammonia-lyase (PAL). Cinnamic acid is then hydroxylated by cinnamate 4-hydroxylase (C4H) to produce p-coumaric acid. Subsequently, p-coumaric acid is catalysed by 4-coumarate-CoA ligase (4CL) to form p-coumaroyl-CoA, which enters the specific monolignol biosynthesis pathway. Various enzymes, including cinnamate 3-hydroxylase (C3H), hydroxycinnamoyl transferase (HCT), and cinnamoyl-CoA reductase (CCR), catalyse the formation of monolignols. These monolignols are composed of three major subunits: hydroxyphenyl (H), guaiacyl (G), and syringyl (S) monolignols. Finally, through the random combination of various monolignols, polymerization occurs, resulting in lignin with complex structures (Umezawa, 2010; Vanholme et al., 2019; Zhang et al., 2020). Research has shown that laccase (LAC) and peroxidase (POD) play important roles in the polymerization of monolignols into lignin with complex structures (Long et al., 2021; Wen et al., 2022).

Poplar is an important tree species in China, that is known for its fast growth and ease of asexual reproduction, which facilitates the smooth progress of genetic engineering technology. As a model species, the complete genome of Populus trichocarpa was the first woody plant to be sequenced (Tuskan et al., 2006). After four years, Shi et al. (Shi et al., 2010) comprehensively identified monolignol genes in P. trichocarpa and identified 23 wood-specific enzymes involved in ten monolignol gene families, including PAL, 4CL, C3H, C4H, HCT, CCR, CAD (cinnamyl alcohol dehydrogenase), CCoAOMT (caffeoyl-CoA O-methyltransferase), COMT (Catechol-O-methyltransferase) and CAld5H (coniferaldehyde-5-hydroxylase). The monolignol biosynthetic pathway and the intermediate metabolites in P. trichocarpa are depicted in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1 The monolignol biosynthetic pathway in Populus trichocarpa. PtrPAL1-5: PtrPAL1, PtrPAL2, PtrPAL3, PtrPAL4, PtrPAL5; PtrC4H1.2: PtrC4H1, PtrC4H2; Ptr4CL3.5: Ptr4CL3, Ptr4CL5; PtrCAD1.2: PtrCAD1, PtrCAD2; PtrHCT1.6: PtrHCT1, PtrHCT6; PtrCAld5H1.2: PtrCAld5H1, PtrCAld5H2; PtrCCoAOMT1-3: PtrCCoAOMT1, PtrCCoAOMT2, PtrCCoAOMT3; LAC, laccase; POD, peroxidase.

Genetic modifications are valuable methods for regulating metabolic pathways. These modifications alter the expression levels of genes, resulting in changes in the enzyme content that can be used to catalyse key pathway reactions, thereby achieving the regulation of lignin content. In genetic engineering studies of P. trichocarpa, it was found that the genes Ptr4CL, PtrHCL, PtrCCR, and PtrCAD not only reduced the total lignin content but also influenced the lignin composition. On the other hand, the genes PtrC3H, PtrC4H, PtrPAL, and PtrCOMT solely led to a reduction in total lignin content. The gene PtrCAld5H increased the total lignin content and influenced the composition of lignin. However, the gene PtrCCoAOMT had no effect on either the total lignin content or composition (Peng et al., 2016; Van Acker et al., 2017; Wang et al., 2018; De Meester et al., 2020; Tsai et al., 2020). In addition, Kim et al. found that the simultaneous knockdown of the three genes PtrC4H1, PtrC4H2, and PtrC3H3 significantly reduced the total lignin content in P. trichocarpa (Kim et al., 2020). However, there are certain limitations to studying the relationship between gene editing of monolignol genes and lignin content using genetic engineering techniques. The focus of such studies is more on investigating the regulation of the monolignol biosynthetic pathway rather than uncovering the specific genes that influence lignin content. Additionally, editing the same monolignol genes in different species may lead to varying effects on lignin content. Therefore, it is crucial to employ more appropriate analytical methods and explore the genes that impact lignin content in a specific species.

Gene co-expression networks (GCNs) have been widely used for gene identification and weighted gene co-expression network analysis (WGCNA) is the most representative GCN (Zhang and Horvath, 2005). WGCNA entails a clustering analysis of gene expression levels from microarray or RNA-seq data, representing relationships between gene nodes through continuous variable edge weights. Notably, WGCNA addresses the issue of “one-size-fits-all” in traditional GCN construction by using a unique dynamic cut method. Furthermore, highly correlated gene sets (gene modules) obtained through clustering, which have similar functions, can be associated with sample traits by calculating correlation coefficients. Importantly, WGCNA allows for the simultaneous analysis of gene modules and multiple traits related to biological relevance, enabling the identification of biologically significant genes and establishing co-expression relationships among genes within modules and their associations with relevant traits (Wu C. et al., 2021; Kondoh et al., 2022; Petrosyan et al., 2023). Rao et al. (Rao et al., 2019) and Hong et al. (Hong et al., 2021) applied gene co-expression network analysis methods to explore genes and transcription factors related to the monolignol biosynthetic pathway in P. trichocarpa. However, their studies focused on establishing network relationships between genes without specifically analysing the impact of genes on lignin content. Therefore, it is of great importance to construct a co-expression network of genes closely associated with total lignin content. This can be achieved by utilizing data from transcriptome, proteome, and total lignin content measurements following the knockdown of multiple monolignol genes. Such an approach can effectively identify genes that significantly influence the total lignin content.

In this paper, we focus on the genes that influence total lignin content in P. trichocarpa. To achieve this, we conducted a comprehensive analysis of multi-omics data, which includes transcriptome, proteome, and total lignin content, in both transgenic lines with knockdown genes of ten monolignol gene families and control lines. We present a combination of t-tests, differential gene expression analysis, correlation analysis, and WGCNA to identify genes that influence the total lignin content. Specifically, we started by using t-tests to identify 15 transgenic lines with significant differences. Then, we performed differential gene expression analysis on each of these 15 groups individually and selected differentially expressed genes with at least one intersecting gene among the 15 groups, resulting in a total of 5894 genes and 139 samples. To identify important gene modules closely related to lignin, we constructed tightly interconnected gene modules based on the theory that genes with similar functions exhibit similar expression patterns. We also incorporated the absolute protein abundance of the monolignol biosynthetic enzymes and total lignin content data in this process. Next, we performed GO and KEGG enrichment analyses to investigate the functions of genes within specific modules and identify the biological pathways in which the genes may be involved, and have an impact on total lignin content. Furthermore, we conducted correlation analysis between the expression levels of genes in the modules most closely associated with total lignin content and the actual lignin content. Using the 15 genes that showed a strong correlation with total lignin content as the core, we constructed a weighted gene co-expression network to explore the genes influencing total lignin content. This research provides a solid foundation for understanding lignin synthesis and degradation processes, and it offers valuable insights for the development and utilization of lignin.

2 Materials and methods

2.1 Data availability

Wang et al. performed a series of systematic experimental knockdowns of monolignol genes, and the absolute abundances of the monolignol transcripts and proteins were measured using RNA-seq and protein cleavage isotope dilution mass spectrometry (PC-IDMS), respectively. The lignin content was determined following the Klason procedure in P. trichocarpa (Wang et al., 2018). The transcriptomics data is available under GEO accession number GSE78953 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE78953], and the proteomics dataset and total lignin content are accessible on CyVerse [https://datacommons.cyverse.org/browse/iplant/home/shared/LigninSystemsDB].

We analysed the transcriptome, proteome (20 pathway enzymes), and total lignin content data of both transgenic lines with multiple gene knockdowns of ten monolignol gene families (PtrPAL, PtrC3H, PtrC4H, PtrCAD, PtrCCR, PtrHCT, Ptr4CL, PtrCAld5H, PtrCOMT, and PtrCCoAOMT), and corresponding control lines. The annotation information of genes was obtained from two databases, JGI [https://jgi.doe.gov/] and NCBI [https://www.ncbi.nlm.nih.gov/].

2.2 T-test and differential gene analysis

Firstly, we conducted t-tests using the R package t.test on the total lignin content of all different knockdown transgenic and control lines (Table S1). As a result, we identified 15 groups of transgenic lines that displayed significant differences (P<0.05) in total lignin content. We visualized these results using boxplots (Figure 2). These transgenic lines were obtained through various treatments involving the knockdown of specific genes (PtrPAL2.4.5, PtrC3H3, PtrC3H3.C4H1.2, PtrC4H1, PtrHCT1, PtrHCT6, PtrHCT1,6, PtrCAld5H1,2, PtrCCR2, PtrCCoAOMT1.2, PtrCCoAOMT3, PtrCOMT2, Ptr4CL3, Ptr4CL5, and Ptr4CL3.5). Subsequently, we performed average clustering analysis on the 15 groups of transgenic and control lines, eliminating obvious outliers, and generated PCA plots (Figure 3) and volcano plots (Figure 4).

FIGURE 2
www.frontiersin.org

Figure 2 Boxplot of 15 groups. The blue box represent traitements, and the orange box represent corresponding controls. Target genes of 15 traitement groups are listed on the top of the panel. PtrPAL2.4.5: the co-downregulation of PtrPAL2, PtrPAL4, and PtrPAL5; PtrC3H3.C4H1.C4H2: the co-downregulation of PtrC3H3, PtrC4H1, and PtrC4H2; PtrHCT1.6: the co-downregulation of PtrHCT1 and PtrHCT6; PtrCAld5H1.2: the co-downregulation of PtrCAld5H1 and PtrCAld5H2; the co-downregulation of Ptr4CL3.5: Ptr4CL3 and Ptr4CL5; PtrCCoAOMT1.2: the co-downregulation of PtrCCoAOMT1 and PtrCCoAOMT2.

FIGURE 3
www.frontiersin.org

Figure 3 PCA of 15 groups. The red dots represent traitement samples, and the yellow dots represent corresponding control samples. Target genes of 15 traitement groups are listed on the bottom of the panel. The abbreviations for the genes in the treatment group are the same as the caption in Figure 2.

FIGURE 4
www.frontiersin.org

Figure 4 Volcano Plot of 15 groups. The red dots represent significantly upregulated genes, the grey dots represent stable genes, and the blue dots represent significantly downregulated genes. Target genes of 15 traitement groups are listed on the top of the panel. The abbreviations for the genes in the treatment group are the same as the caption in Figure 2.

Furthermore, we performed differential gene analysis using the R package DESeq2 on the data of the 15 groups of transgenic and control lines, with filtering criteria set as log2FC > 1 and Padj< 0.05. To ensure a more accurate identification of genes regulating total lignin content, we selected the differential genes that had at least one intersection among the 15 groups for further analysis. After these steps, a total of 5894 differentially expressed genes from 139 samples were used for WGCNA.

2.3 Data preprocessing

Data preprocessing plays a vital role before conducting WGCNA, as it guarantees data consistency, completeness, and suitability for analysis, ultimately ensuring data quality and enhancing the accuracy and reliability of the results. The transcriptome, proteome, and total lignin data cannot be directly utilized in WGCNA; instead, they necessitate data preprocessing steps, including data filtering, normalization, and standardization. These steps are essential to prepare the data in a suitable format for meaningful and effective network analysis.

2.3.1 Data filtering

The RNA-seq results indicate that not all genes are expressed, and the expression levels of each gene can vary greatly under different biological processes, sampling times, and tissue locations. When analysing gene expression data, genes with low expression levels have lower reliability and need to be filtered out to ensure the reliability of data analysis. To achieve this, we transformed the read counts (reads) aligned to the reference gene fragments into CPM (counts-per-million threshold). Subsequently, we excluded genes with CPM values less than 0.5 in all three sample replicates to effectively remove genes with low expression levels. For this filtering process, we utilized the “cpm” function available in the R package edgeR (Robinson et al., 2010). The formula (1) for calculating CPM is as follows:

CPM=CN106(1)

Where C represents the reads aligned to a specific gene, and N represents the total number of reads aligned to all genes in the sample.

2.3.2 Data normalization

Due to limited experimental space, all experimental samples were obtained in six different batches, and each batch of transgenic lines had corresponding controls. We normalized the counts of genes to their corresponding wild-type data using the ratio-based (arithmetic mean as reference, Ratio-A) approach (Luo et al., 2010; Li and Zhao, 2019). The formula (2) is as follows:

Xij'=Xij-1nl=1nXilr(2)

where r is the reference batch, Xij'  represents the adjusted gene expression value of the i-th gene in the j-th sample, Xij represents the original gene expression value of the i-th gene in the j-th sample, n is the number of control samples in the reference batch, and Xilr represents the gene expression value of the i-th gene in the l-th sample of the reference batch. Additionally, since WGCNA requires input data to be greater than or equal to 0, we added the absolute value of the smallest negative number to all corrected counts in all batches.

2.3.3 Data standardization

Gene expression levels are determined by randomly sampling gene fragments, leading to a higher likelihood of sampling long sequence genes compared to short ones. Additionally, the sequencing depth can impact the number of reads aligned to each gene. Thus, relying solely on raw reads is inadequate to accurately measure gene expression levels. To address this issue, we adopted TPM (transcripts per million) normalization to convert the reads into the number of reads per kilobase per million mapped reads, offering a more reliable measure of gene expression for analysis. The formula (3) for calculating TPM is as follows:

TPM=Ri(sumRili)li106(3)

Where Ri represents the number of reads for the i-th gene, li represents the length of the i-th gene (in kilobases), and sum Rili is the sum of ratios between the reads of the i-th gene and its corresponding gene length. The multiplication by 106 is to convert it into the number of reads per million mapped reads.

2.4 Weighted gene co-expression network analysis

In WGCNA, gene expression data are utilized to calculate the Pearson correlation coefficient between genes, resulting in a gene co-expression similarity matrix through correlation analysis. Traditionally, a hard threshold is employed to determine whether genes are correlated or not. This means that values above the threshold indicate correlation, while values below it indicate no correlation. However, this approach may lead to some information loss. To address this, WGCNA adopts a soft thresholding approach, assuming that there is a correlation between all genes. This approach transforms the gene expression similarity matrix into an adjacency matrix and subsequently into a topological overlap matrix, which measures the interconnectedness among genes. By doing so, the method considers not only the pairwise correlation between two genes but also their correlation with other genes in the network. To identify gene modules, the dynamic tree cut algorithm (Langfelder et al., 2008) are applied. This algorithm mimics the characteristics of real biological networks, providing biologically meaningful results. Overall, the soft thresholding approach and dynamic tree cut algorithm in WGCNA allow for a more comprehensive analysis, capturing the complexity of gene interactions and facilitating the identification of functionally related gene modules.

2.4.1 Construction of gene modules

After preprocessing the gene expression data, the genes were further subjected to detection and filtering using the goodSamplesGenes functions available in the WGCNA package within the R programming environment. The selected genes that met the criteria were used to construct a weighted gene co-expression network using the WGCNA package in R 4.2.2 software. For the specific methods and principles employed in this process, please refer to the research conducted by Zhang B et al. (Zhang and Horvath, 2005).

2.4.2 GO and KEGG enrichment analyses

We conducted functional annotation of genes within specific modules using two important databases: the Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG). The GO (Zhao et al., 2020) database [http://geneontology.org/] offers a standardized language to define and describe gene and gene product functions across various biological species. GO terms classify gene functions into three categories: Molecular Function (MF), Biological Process (BP), and Cellular Component (CC). These terms enable researchers to predict the functions of genes or proteins in the same or different species based on the known functions of genes or proteins stored in the database. On the other hand, the KEGG (Chen et al., 2020) database [https://www.kegg.jp/] serves as a knowledge repository for systematic analysis of gene functions. It integrates genomic, chemical, and systemic functional information and contains a vast amount of genomic sequence data, pathway information related to metabolism, regulation, and signal transduction, as well as information on chemical compounds, enzyme molecules, and enzyme reactions. Researchers can leverage this wealth of data to annotate gene functions by linking genomic information with functional information.

For our functional annotation analysis, we employed the enrichGO and enrichKEGG functions available in the clusterProfiler package (Wu T. et al., 2021) within the R programming environment. These functions allowed us to perform GO and KEGG enrichment analysis on the genes within specific modules, aiding us in gaining valuable insights into the functional characteristics and pathways associated with these genes.

3 Results

3.1 Identification of co-expression gene modules

In the curve fitting of the non-scale topology model with the soft threshold β and the fitting degree R2 (Figure 5), we observed that when R2 = 0.86, the gene connectivity curve tended to saturate. This observation indicates that the network conforms to the conditions of a non-scale distribution. The corresponding soft threshold at this point is determined to be β=7.

FIGURE 5
www.frontiersin.org

Figure 5 Analysis of network topology for various soft-thresholding powers. (A) The scalefree fit index (y-axis) as a function of the soft- thresholding power (x-axis). (B) the mean connectivity (degree, y-axis) as a function of the soft-thresholding power (x-axis).

Subsequently, with the parameter β in place, hierarchical clustering was performed on the transcriptomics data obtained from 15 groups of transgenic and control lines. Gene modules were then formed through dynamic cutting, followed by the merging of similar modules. The outcome was the creation of 15 gene modules, excluding the grey module. To visually distinguish these modules, each module was randomly assigned a unique colour (Tables S2, S3). The figure depicting gene clustering and module cutting comprises two parts: the upper part visualizes the clustering dendrogram, demonstrating how genes are grouped based on expression pattern similarities, while the lower part illustrates the results of dynamic cutting for the modules and the subsequent outcome of module cutting after merging similar modules (Figure 6).

FIGURE 6
www.frontiersin.org

Figure 6 The co-expression genes clustering and the module cutting. Each branch represents a gene and each color below represents a gene co-expression module. The dynamic tree cut indicates the modules were divided based on the gene clustering results. The merged dynamic cut indicates the modules were divided by combining modules with similar expression patterns.

3.2 Correlation analysis of modules and traits

We performed correlation analysis between all gene modules and traits (total lignin content, 20 pathway enzymes, and different treatments) to calculate the correlation coefficients (r). Subsequently, we generated a heatmap depicting the relationship between gene modules and traits (Figure 7). Our findings revealed that most monolignol genes (PtrPAL5, PtrC3H3, PtrC4H1, PtrC4H2, Ptr4CL3, PtrCAD2, PtrHCT6, PtrCAld5H1, and PtrCAld5H2) were clustered within the “MElightcyan” module. However, PtrPAL2 and PtrCAD2 were clustered within the “MEorange” and “MEroyalblue” modules, respectively.”

FIGURE 7
www.frontiersin.org

Figure 7 Heatmap of Module-trait relationships. The expression patterns of 15 modules (excluding the grey module) are shown by the heatmap. The module name is shown to the left side of each cell. Numbers in the table report the correlations of the corresponding module genes and trait, with the p-values (*) printed below the correlations in parentheses. Each column corresponds to a specific trait (from left To right: total lignin content, 20 pathway enzymes, and different treatments). The scale bar on the right indicates the range of possible correlations from positive (red) to negative (green).

From the heatmap of the correlation analysis between modules and traits, we observed that the “MElightcyan” module, which contains most of the monolignol genes, exhibited a relatively low correlation with the total lignin content (|correlation coefficient| = 0.18), while the “MElightyellow” module showed the highest correlation with the total lignin content (|correlation coefficient| = 0.47), followed by the “MEviolet” module (|correlation coefficient| = 0.28). Furthermore, we found that the “MElightcyan” module, which includes most of the monolignol genes, exhibited a certain degree of correlation with the majority of pathway enzymes. The “MElightyellow” and “MEviolet” modules were primarily associated with pathway enzymes (PAL1, 4CL3, C3H3, C4H1, etc.) from the gene families of PtrPAL, Ptr4CL, PtrC3H, and PtrC4H monolignol genes. Additionally, we noticed that the treatment with triple gene knockdown of PtrC3H3, PtrC4H1, and PtrC4H2 showed a strong correlation with these two modules, indicating that this treatment had the greatest impact on the total lignin content.

3.3 GO and KEGG enrichment analyses

We conducted GO and KEGG enrichment analyses for the “MElightcyan”, “MElightyellow” and “MEviolet” gene modules. The top 10 results of the GO (focusing on biological processes) and KEGG enrichment analysis results for each module, are shown in Figure 8 (Tables S4, S5).

FIGURE 8
www.frontiersin.org

Figure 8 GO and KEGG enrichment analysis for the “MElightcyan”, “MElightyellow” and “MEviolet” gene modules. (A) GO and enrichment analysis. The top 10 biological process results of GO enrichment were selected to display. (B) KEGG and enrichment analysis. CountRatio: A score, the numerator is the number of genes enriched in this GO/KEGG entry and the denominator is the number of genes in specific module. p.adjust: corrected p value, the range for value of p.adjust from 0 (red) to 0.025 (blue). For the complete GO and KEGG enrichment results, please view Tables S4 and S5.

The “MElightcyan” module, comprising most of the monolignol genes, reflected significant enrichment (P< 0.05) of pathways such as “cell cycle”, “lignin metabolic process”, “phenylpropanoid metabolic process”, and “secondary metabolic process”. Additionally, this module indicated enrichment of pathways such as “flavonoid biosynthesis” and “phenylpropanoid biosynthesis”. In contrast, the “MElightyellow” module, displaying the highest correlation with the total lignin content, indicated significant enrichment in pathways related to “glutathione metabolic process” and “cellular modified amino acid metabolic process,” as well as in the “glutathione metabolism” pathway. The “MEviolet” module reflected significant enrichment in the “carbohydrate catabolic process” and “vitamin B6 metabolism” pathways. The GO and KEGG enrichment results for these three modules, which awere closely associated with lignin, indicate that the total lignin content is influenced not only by genes involved in the “lignin biosynthesis”, “phenylpropanoid biosynthesis” and “flavonoid biosynthesis” pathways but also by genes involved in the “glutathione metabolic process”, “cellular modified amino acid metabolic process” and “carbohydrate catabolic process” pathways.

3.4 Correlation analysis and visualization of the genes co-expression network

We conducted a detailed analysis of the genes within the “MElightyellow” module, which demonstrated the highest correlation with the total lignin content. As a result, we identified eight types of genes present in this module, including cytochrome P450 genes, cinnamyl-alcohol dehydrogenase genes, UDP-glucosyltransferase genes, UDP-glycosyltransferase superfamily protein genes, glutathione-S-transferase genes, wall-associated kinase genes, MATE efflux family protein genes, and indole-3-acetate beta-D-glucosyltransferase genes.

Next, to compare the correlation levels between the genes from the “MElightyellow” module that most correlated with the total lignin content, as well as monolignol genes and the total lignin content, we conducted a correlation analysis using the R package cor (Figure 9; Table S6). It is worth noting that we found that the correlation (highest: |r| = 0.59, lowest: |r| = 0.52) between the top 15 genes in the “MElightyellow” module and the lignin content was much higher than that between monolignol genes and the lignin content (highest: |r| = 0.36, lowest: |r| = 0.02). We have systematically compiled information about the top 15 genes with the strongest correlation to the total lignin in the “MElightyellow” module, as outlined in Table 1. The kME (eigengene connectivity) serves as intermediate data during the analysis of gene co-expression networks (Table S7), and the methodology for calculating kME is detailed in Section 2.4. Specifically, the higher the kME value, the stronger the degree of gene connectivity, indicating greater significance and centrality within the network. “|r|” represents the absolute value of the correlation coefficient.

FIGURE 9
www.frontiersin.org

Figure 9 Correlation analysis of gene expression levels with total lignin content. The genes, including the top 15 genes displaying the strongest correlation with total lignin content, as well as monolignol genes, is shown on the x-axis; The correlation coefficient of genes and total lignin content is shown on the y-axis.

TABLE 1
www.frontiersin.org

Table 1 The top 15 genes displaying the strongest correlation with total lignin content.

Finally, we utilized Cytoscape 3.9.1 software to visualize the co-expression network constructed from genes in the “MElightyellow” module that showed the highest correlation with the total lignin content (Figure 10; Table S8). The 15 genes with the strongest correlation to the total lignin content were selected as central nodes, while the eight types of genes were represented by different colours, and genes not belonging to these types were represented in blue. In the network, nodes represent genes, and the edges represent co-expression relationships between genes, with edge thickness indicating the strength of the co-expression relationship. Specifically, the edges in red and yellow represent the relationships between two UDP-glycosyltransferase superfamily protein genes and eight types of genes, respectively, as well as the co-expression relationships between two UDP-glycosyltransferase superfamily protein genes and other genes beyond these eight types. Notably, we observed that these 15 genes, which had the highest correlation with the total lignin content, exhibited co-expression relationships with nearly all genes within the “MElightyellow” module.

FIGURE 10
www.frontiersin.org

Figure 10 Visualization of weighted genes co-expression network in the “MElightyellow” module. The 15 genes with the strongest correlation to the total lignin content were selected as central nodes, while the eight types of genes were represented by different colors, and genes not belonging to these types were represented in blue. The nodes represent genes, and the edges represent co-expression relationships between genes, with edge thickness indicating the strength of the co-expression relationship. The edges in red and yellow respectively represent the relationships between two UDP-glycosyltransferase superfamily protein genes and eight types of genes, as well as the co-expression relationships between two UDP-glycosyltransferase superfamily protein genes and other genes beyond these eight types. For the complete genes coexpression network results, please view Table S8.

4 Conclusions and discussion

Firstly, we performed weighted gene co-expression network analysis (WGCNA) on a dataset containing 5894 differentially expressed genes from 139 samples. Our analysis revealed that enzymes from the PtrPAL, Ptr4CL, PtrC3H, and PtrC4H gene families may have a closer relationship with the total lignin content(Kim et al., 2023; Wu et al., 2023). In addition, the co-downregulation of three genes, PtrC3H3, PtrC4H1, and PtrC4H2, had the greatest impact on total lignin content, which is consistent with the finding reported by Kim et al. (Kim et al., 2020).

Secondly, by performing GO and KEGG analyses on the lignin-related modules, we discovered that the total lignin content is influenced not only by genes involved in the “lignin biosynthesis”, “phenylpropanoid biosynthesis” and “flavonoid biosynthesis” pathways but also by genes involved in the “glutathione metabolic process”, “cellular modified amino acid metabolic process” and “carbohydrate catabolic process” pathways. Moreover, glutathione peroxidase (GSH-PX) is one of the peroxidases, involved in reactive oxygen species metabolism and lignin metabolism. It can significantly influence the content of total phenols, which are precursors of lignin synthesis, and the phenylalanine ammonia-lyase involved in monolignol biosynthesis, thereby influencing the total lignin content (Yu et al., 2011).

Finally, further analysis of the genes within the “MElightyellow” module, which showed the highest correlation with the total lignin content, revealed that this module mainly contains eight types of genes. Among the top 15 genes with the highest correlation to the total lignin content and expression levels of all genes within the module, four types of genes were identified: 1. three cinnamyl-alcohol dehydrogenase genes, namely CAD1, CADL3, and CADL8 (Kim et al., 2007; Chao et al., 2022; Lee et al., 2022); 2. two UDP-glycosyltransferase superfamily protein genes, Potri.006G007200 and HYR1, showed the closest correlation with the total lignin content, with HYR1 found to be significantly downregulated after exposure to coumarin (Xu et al., 2019); 3. one UDP-glucosyl transferase gene, UGT71B1, was reported to play a critical role in coumarin metabolism and glycosylation in regulating the efficacy of secondary metabolites (Jiang, 2022); and 4. one cytochrome P450 gene, Pt-ACT11.13 (Park et al., 2020). Importantly, among these 15 genes, there are several noteworthy ones. The GLIP1 gene involved in pathogen defence and is regulated by the transcription factor WRKY (Han et al., 2019; Miao et al., 2019), the ERF021 gene directly regulates the expression of monolignol genes (Zeng et al., 2020), the SAUR1 gene is an auxin-inducible gene that promotes plant cell division and elongation (Tong et al., 2022), and the At4g27220 gene functions as a resistance protein (Cortaga et al., 2022). However, genes LOC7486095, LOC7479833, LOC7495227, and Potri.008G017700 have not been previously reported in relation to lignin.

Therefore, we can infer that the monolignol genes PtrC3H3, PtrC4H1, and PtrC4H2, which belong to the cytochrome P450 gene type, may have a significant impact on the total lignin content. Among the top 15 genes with the highest correlation to the total lignin content, three cinnamyl-alcohol dehydrogenase genes were present. This gene type is involved in the final step of the monolignol biosynthetic pathway, suggesting that it may significantly influence the total lignin content. Furthermore, the UDP-glycosyltransferase superfamily protein genes and UDP-glucosyl transferase genes are closely related to coumarin, indicating that they may influence the total lignin content by influencing coumarin metabolism.

However, despite identifying genes that may influence the total lignin content, the correlation between gene modules and total lignin content did not reach the expected level. This lower correlation could be attributed to the fact that the knockdown of monolignol genes did not entirely reduce the total lignin content to zero. Additionally, the differences between transgenic and control lines are often smaller compared to lines under different stress conditions, and these relatively insignificant differences may affect the degree of correlation between modules and traits.

In conclusion, for a single species (P. trichocarpa), utilizing a combination of t-test, differential gene expression analysis, correlation analysis, and WGCNA on multi-omics data, including data related to the transcriptome, proteome, and total lignin content of multiple transgenic lines, allows us to establish a connection between gene expression levels and total lignin content. This approach effectively revealed information about genes that influence total lignin content and facilitates a better understanding of the impact of genes on lignin. Therefore, in future lignin research, we should not only focus on studying genes related to the monolignol biosynthetic pathway and their regulatory factors but also investigate which genes or pathways may have an impact on lignin content. By considering the overall metabolic activities of organisms and utilizing advanced analysis methods, a more comprehensive exploration of lignin biosynthesis can be achieved, ultimately leading to the strategic development and utilization of lignin. This study conducted an initial exploration of gene regulation on total lignin content. In the future, more advanced techniques, such as machine learning and gene inference networks, are expected to be applied to more in-depth research in this field.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: The transcriptome are available under GEO accession number GSE78953 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE78953], while the data sets of proteo- mics and total lignin content are available on CyVerse [https://datacommons.cyverse.org/browse/iplant/home/shared/Lignin SystemsDB]. The annotation information of all genes was acquired from two databases, JGI [https://jgi.doe.gov/] and NCBI [https://www.ncbi.nlm.nih.gov/].

Author contributions

JZ, AW, and KC developed the concept of the article. JZ analyzed and wrote the paper. AW provided guidance on technical analysis methods. All authors contributed to the article and approved the submitted version.

Funding

“Specialized Research Fund for Doctoral Program of Higher Education of China”, grant number 2572018AB32”.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1244020/full#supplementary-material

References

Boerjan, W., Ralph, J., Baucher, M. (2003). Lignin biosynthesis. Annu. Rev. Plant Biol. 54, 519–546. doi: 10.1146/annurev.arplant.54.031902.134938

PubMed Abstract | CrossRef Full Text | Google Scholar

Chao, N., Chen, W.-Q., Cao, X., Jiang, X.-N., Gai, Y., Liu, L. (2022). Plant hormones coordinate monolignol biosynthesis with seasonal changes in Populus tomentosa. Environ. Exp. Botany 195, 104784. doi: 10.1016/j.envexpbot.2022.104784

CrossRef Full Text | Google Scholar

Chen, J., Liu, C., Cen, J., Liang, T., Xue, J., Zeng, H., et al. (2020). KEGG-expressed genes and pathways in triple negative breast cancer Protocol for a systematic review and data mining. Med. 99 (18), e19986. doi: 10.1097/md.0000000000019986

CrossRef Full Text | Google Scholar

Cortaga, C. Q., Latina, R. A., Habunal, R. R., Lantican, D. V. (2022). Identification and characterization of genome-wide resistance gene analogs (RGAs) of durian (Durio zibethinus L.). Journal Genet. Eng. Biotechnol. 20 (1), 29–29. doi: 10.1186/s43141-022-00313-8

CrossRef Full Text | Google Scholar

De Meester, B., Calderon, B. M., de Vries, L., Pollier, J., Goeminne, G., Van Doorsselaere, J., et al. (2020). Tailoring poplar lignin without yield penalty by combining a null and haploinsufficient CINNAMOYL-CoA REDUCTASE2 allele. Nat. Commun. 11 (1), 5020. doi: 10.1038/s41467-020-18822-w

PubMed Abstract | CrossRef Full Text | Google Scholar

de Vries, L., Guevara-Rozo, S., Cho, M., Liu, L. Y., Renneckar, S., Mansfield, S. D. (2021). Tailoring renewable materials via plant biotechnology. Biotechnol. Biofuels 14 (1), 167. doi: 10.1186/s13068-021-02010-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, X., Li, S., Zhang, M., Yang, L., Liu, Y., Xu, J., et al. (2019). Regulation of GDSL lipase gene expression by the MPK3/MPK6 cascade and its downstream WRKY transcription factors in arabidopsis immunity. Mol. Plant-Microbe Interactions 32 (6), 673–684. doi: 10.1094/mpmi-06-18-0171-r

CrossRef Full Text | Google Scholar

Hong, J. Y., Gunasekara, C., He, C., Liu, S. Z., Huang, J. Q., Wei, H. R. (2021). Identification of biological pathway and process regulators using sparse partial least squares and triple-gene mutual interaction. Sci. Rep. 11 (1), 13174. doi: 10.1038/s41598-021-92610-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, Z. (2022). Research status of coumarin biosynthesis pathway and key enzyme genes. Modern hortic. 45 (12), 189–191. doi: 10.14051/j.cnki.xdyy.2022.12.049

CrossRef Full Text | Google Scholar

Kim, J. Y., Cho, K. H., Keene, S. A., Colquhoun, T. A. (2023). Altered profile of floral volatiles and lignin content by down-regulation of Caffeoyl Shikimate Esterase in Petunia. BMC Plant Biol. 23 (1), 210. doi: 10.1186/s12870-023-04203-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, S.-J., Kim, K.-W., Cho, M.-H., Franceschi, V. R., Davin, L. B., Lewis, N. G. (2007). Expression of cinnamyl alcohol dehydrogenases and their putative homologues during Arabidopsis thaliana growth and development: Lessons for database annotations? Phytochem. 68 (14), 1957–1974. doi: 10.1016/j.phytochem.2007.02.032

CrossRef Full Text | Google Scholar

Kim, H., Li, Q. Z., Karlen, S. D., Smith, R. A., Shi, R., Liu, J., et al. (2020). Monolignol Benzoates Incorporate into the Lignin of Transgenic Populus trichocarpa Depleted in C3H and C4H. ACS Sustain. Chem. Eng. 8 (9), 3644–3654. doi: 10.1021/acssuschemeng.9b06389

CrossRef Full Text | Google Scholar

Kondoh, K., Akahori, H., Muto, Y., Terada, T. (2022). Identification of key genes and pathways associated with preeclampsia by a WGCNA and an evolutionary approach. Genes 13 (11), 2134. doi: 10.3390/genes13112134

PubMed Abstract | CrossRef Full Text | Google Scholar

Langfelder, P., Zhang, B., Horvath, S. (2008). Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R. Bioinformatics 24 (5), 719–720. doi: 10.1093/bioinformatics/btm563

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, K. H., Lee, K.-L., Nam, K. J., Yang, J.-W., Lee, J. J., Shim, D., et al. (2022). Expression analysis of sweetpotato cinnamyl alcohol dehydrogenase genes in response to infection with the root-knot nematode Meloidogyne incognita. Plant Biotechnol. Rep. 16 (4), 487–492. doi: 10.1007/s11816-022-00773-x

CrossRef Full Text | Google Scholar

Li, S., Zhao, Y. (2019). Research progress on batch effect removal methodsfor gene expression data. J. Nanjing Agric. Univ. 42 (03), 389–397. doi: 10.7685/jnau.201810016

CrossRef Full Text | Google Scholar

Long, G. H., Wu, P. Y., Fu, J. Z., Lu, H. L., Zhang, Y. (2021). Research progress on the regulation of peroxidase on lignin synthesis. Modern Agric. Sci. Technol. 23, 47–49 + 54. doi: 10.3969/j.issn.1007-5739.2021.23.019

CrossRef Full Text | Google Scholar

Luo, J., Schumacher, M., Scherer, A., Sanoudou, D., Megherbi, D., Davison, T., et al. (2010). A comparison of batch effect removal methods for enhancement of prediction performance using MAQC-II microarray gene expression data. Pharmacogenomics J. 10 (4), 278–291. doi: 10.1038/tpj.2010.57

PubMed Abstract | CrossRef Full Text | Google Scholar

Miao, X. y., Qu, H. p., Han, Y. l., He, C. f., Qiu, D. w., Cheng, Z. w. (2019). The protein elicitor Hrip1 enhances resistance to insects and early bolting and flowering in Arabidopsis thaliana. PloS One 14 (4), e0216082. doi: 10.1371/journal.pone.0216082

PubMed Abstract | CrossRef Full Text | Google Scholar

Park, H., Park, G., Jeon, W., Ahn, J.-O., Yang, Y.-H., Choi, K.-Y. (2020). Whole-cell biocatalysis using cytochrome P450 monooxygenases for biotransformation of sustainable bioresources (fatty acids, fatty alkanes, and aromatic amino acids). Biotechnol. Advances 40, 107504. doi: 10.1016/j.bioteChadv.2020.107504

CrossRef Full Text | Google Scholar

Peng, X. P., Wang, B., Wen, J. L., Yang, S. Z., Lu, M. Z., Sun, R. C. (2016). Effects of genetic manipulation (HCT and C3H down-regulation) on molecular characteristics of lignin and its bioconversion to fermentable sugars. Cellulose Chem. Technol. 50 (5-6), 649–658.

Google Scholar

Petrosyan, V., Dobrolecki, L. E., Thistlethwaite, L., Lewis, A. N., Sallas, C., Srinivasan, R. R., et al. (2023). Identifying biomarkers of differential chemotherapy response in TNBC patient-derived xenografts with a CTD/WGCNA approach. iScience 26 (1), 105799–105799. doi: 10.1016/j.isci.2022.105799

PubMed Abstract | CrossRef Full Text | Google Scholar

Rao, X. L., Chen, X., Shen, H., Ma, Q., Li, G. F., Tang, Y. H., et al. (2019). Gene regulatory networks for lignin biosynthesis in switchgrass (Panicum virgatum). Plant Biotechnol. J. 17 (3), 580–593. doi: 10.1111/pbi.13000

PubMed Abstract | CrossRef Full Text | Google Scholar

Robinson, M. D., McCarthy, D. J., Smyth, G. K.. (2010). edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26 (1), 139–140. doi: 10.1093/bioinformatics/btp616

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, R., Sun, Y. H., Li, Q. Z., Heber, S., Sederoff, R., Chiang, V. L. (2010). Towards a systems approach for lignin biosynthesis in populus trichocarpa: transcript abundance and specificity of the monolignol biosynthetic genes. Plant Cell Physiol. 51 (1), 144–163. doi: 10.1093/pcp/pcp175

PubMed Abstract | CrossRef Full Text | Google Scholar

Tong, Z. K., Li, Y., Cai, B., Li, X. Y., Huang, M. J., Huang, H. Q. (2022). Cloning and expression analysis of SAUR gene of impatiens uliginosa franch. Acta Botanica Boreali-Occidentalia Sinica 42 (01), 21–28. doi: 10.7606/j.issn.1000-4025.2022.01.0021

CrossRef Full Text | Google Scholar

Tsai, C. J., Xu, P., Xue, L. J., Hu, H., Nyamdari, B., Naran, R., et al. (2020). Compensatory guaiacyl lignin biosynthesis at the expense of syringyl lignin in 4CL1-knockout poplar(1)(OPEN). Plant Physiol. 183 (1), 123–136. doi: 10.1104/pp.19.01550

PubMed Abstract | CrossRef Full Text | Google Scholar

Tuskan, G. A., DiFazio, S., Jansson, S., Bohlmann, J., Grigoriev, I., Hellsten, U., et al. (2006). The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Sci. 313 (5793), 1596–1604. doi: 10.1126/science.1128691

CrossRef Full Text | Google Scholar

Umezawa, T. (2010). The cinnamate/monolignol pathway. Phytochem. Rev. 9 (1), 1–17. doi: 10.1007/s11101-009-9155-3

CrossRef Full Text | Google Scholar

Van Acker, R., Dejardin, A., Desmet, S., Hoengenaert, L., Vanholme, R., Morreel, K., et al. (2017). Different routes for conifer- and sinapaldehyde and higher saccharification upon deficiency in the dehydrogenase CAD1. Plant Physiol. 175 (3), 1018–1039. doi: 10.1104/pp.17.00834

PubMed Abstract | CrossRef Full Text | Google Scholar

Vanholme, R., Demedts, B., Morreel, K., Ralph, J., Boerjan, W. (2010). Lignin biosynthesis and structure. Plant Physiol. 153 (3), 895–905. doi: 10.1104/pp.110.155119

PubMed Abstract | CrossRef Full Text | Google Scholar

Vanholme, R., De Meester, B., Ralph, J., Boerjan, W. (2019). Lignin biosynthesis and its integration into metabolism. Curr. Opin. Biotechnol. 56, 230–239. doi: 10.1016/j.copbio.2019.02.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J. P., Matthews, M. L., Williams, C. M., Shi, R., Yang, C. M., Tunlaya-Anukit, S., et al. (2018). Improving wood properties for wood utilization through multi-omics integration in lignin biosynthesis. Nat. Commun. 9, 1579. doi: 10.1038/s41467-018-03863-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Wen, J., Liu, Y., Yang, S., Yang, Y., Wang, C. (2022). Genome-wide characterization of laccase gene family from turnip and chinese cabbage and the role in xylem lignification in hypocotyls. Horticulturae 8 (6), 522. doi: 10.3390/horticulturae8060522

CrossRef Full Text | Google Scholar

Wu, T., Hu, E., Xu, S., Chen, M., Guo, P., Dai, Z., et al. (2021). clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb) 2 (3), 100141. doi: 10.1016/j.xinn.2021.100141

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, C., Huang, Z. H., Meng, Z. Q., Fan, X. T., Lu, S., Tan, Y. Y., et al. (2021). A network pharmacology approach to reveal the pharmacological targets and biological mechanism of compound kushen injection for treating pancreatic cancer based on WGCNA and in vitro experiment validation. Chin. Med. 16 (1), 121. doi: 10.1186/s13020-021-00534-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, J., Kong, B., Zhou, Q., Sun, Q., Sang, Y., Zhao, Y., et al. (2023). SCL14 inhibits the functions of the NAC043-MYB61 signaling cascade to reduce the lignin content in autotetraploid populus hopeiensis. Int. J. Mol. Sci. 24 (6), 5809. doi: 10.3390/ijms24065809

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, K., Wang, J. L., Chu, M. P., Jia, C. (2019). Activity of coumarin against Candida albicans biofilms. J. Mycologie Medicale 29 (1), 28–34. doi: 10.1016/j.mycmed.2018.12.003

CrossRef Full Text | Google Scholar

Yu, J. J., Li, L., Jin, Q., Cai, Y. P., Lin, Y., Lv, R. H. (2011). Studies on key enzyme POD types of lignin metabolic pathway during stone cell development of pyrus bretschneideri. Acta Hortic. Sinica 38 (06), 1037–1044. doi: 10.16420/j.issn.0513-353x.2011.06.007

CrossRef Full Text | Google Scholar

Zeng, X., Sheng, J., Zhu, F., Wei, T., Zhao, L., Hu, X., et al. (2020). Genetic, transcriptional, and regulatory landscape of monolignol biosynthesis pathway in Miscanthus x giganteus. Biotechnol. Biofuels 13 (1), 179. doi: 10.1186/s13068-020-01819-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, B., Horvath, S. (2005). A general framework for weighted gene co-expression network analysis. Stat. Appl. Genet. Mol. Biol. 4 (1), 17. doi: 10.2202/1544-6115.1128

CrossRef Full Text | Google Scholar

Zhang, J., Tuskan, G. A., Tschaplinski, T. J., Muchero, W., Chen, J. G. (2020). Transcriptional and post-transcriptional regulation of lignin biosynthesis pathway genes in populus. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.00652

CrossRef Full Text | Google Scholar

Zhao, Y., Wang, J., Chen, J., Zhang, X., Guo, M., Yu, G. (2020). A literature review of gene function prediction by modeling gene ontology. Front. Genet. 11. doi: 10.3389/fgene.2020.00400

CrossRef Full Text | Google Scholar

Keywords: total lignin content, multi-omics, WGCNA, differential genes, enrichment analysis, co-expression network, correlation analysis

Citation: Zhao J, Chao K and Wang A (2023) Integrative analysis of metabolome, proteome, and transcriptome for identifying genes influencing total lignin content in Populus trichocarpa. Front. Plant Sci. 14:1244020. doi: 10.3389/fpls.2023.1244020

Received: 21 June 2023; Accepted: 22 August 2023;
Published: 13 September 2023.

Edited by:

Jeongsik Kim, Jeju National University, Republic of Korea

Reviewed by:

Dhananjay K. Pandey, Amity University, India
Hai Lu, Beijing Forestry University, China

Copyright © 2023 Zhao, Chao and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Achuan Wang, 764005797@qq.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.