ORIGINAL RESEARCH article

Front. Plant Sci., 27 September 2023

Sec. Plant Development and EvoDevo

Volume 14 - 2023 | https://doi.org/10.3389/fpls.2023.1243828

Genome-based identification of the CYP75 gene family in Orchidaceae and its expression patterns in Cymbidium goeringii

  • 1. Key Laboratory of National Forestry and Grassland Admini stration for Orchid Conservation and Utilization at College of Landscape Architecture and Art, Fujian Agriculture and Forestry University, Fuzhou, China

  • 2. College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, China

Abstract

With a great diversity of species, Orchidaceae stands out as an essential component of plant biodiversity, making it a primary resource for studying angiosperms evolution and genomics. This study focuses on 13 published orchid genomes to identify and analyze the CYP75 gene family belonging to the cytochrome P450 superfamily, which is closely related to flavonoid biosynthetic enzymes and pigment regulation. We found 72 CYP75s in the 13 orchid genomes and further classified them into two classes: CYP75A and CYP75B subfamily, the former synthesizes blue anthocyanins, while the latter is involved in the production of red anthocyanins. Furthermore, the amount of CYP75Bs (53/72) greatly exceeds the amount of CYP75As (19/72) in orchids. Our findings suggest that CYP75B genes have a more important evolutionary role, as red plants are more common in nature than blue plants. We also discovered unique conserved motifs in each subfamily that serve as specific recognition features (motif 19 belong to CYP75A; motif 17 belong to CYP75B). Two diverse-colored varieties of C. goeringii were selected for qRT-PCR experiments. The expression of CgCYP75B1 was significantly higher in the purple-red variant compared to the yellow-green variant, while CgCYP75A1 showed no significant difference. Based on transcriptomic expression analysis, CYP75Bs are more highly expressed than CYP75As in floral organs, especially in colorful petals and lips. These results provide valuable information for future studies on CYP75s in orchids and other angiosperms.

Introduction

CYP450 (cytochrome P450, also called CYP) is named because it can bind to ferrous carbon monoxide and has a peak light absorption value of 450 nm (Cederbaum, 2015). In 1962, “CYP450” was first named as a colored substance in the cell, and it exists as an enzyme (Omura and Sato, 1962). It is one of the largest protein superfamilies in nature that almost exists in all prokaryotes and eukaryotes, especially with a large number in plants (Kiani and Jabeen, 2019; Li and Wei, 2020). They are a series of self-oxidating heme enzymes involved in various primary and secondary metabolic activities in plants and participate in biosynthetic pathways in flavonoids, aliphatic acids, sterols, phytohormones, terpenes, lignins and other biomolecules (Schuler and Werck-Reichhart, 2003; Kumar et al., 2014). In general, CYPs can be divided into a gene family with amino acid sequence identity greater than 40% and a subfamily with amino acid sequence identity greater than 55% (Nelson et al., 1996). CYP71−CYP99 and CYP701−CYP999 and found in plants (Liu et al., 2020). CYP75 is a crucial member that regulates flavonoid biosynthesis that determines pigmentation of plant tissues (Zhang et al., 2019). CYP75 gene family includes two subfamilies, CYP75A and CYP75B, regulating two key enzymes in the anthocyanin synthesis pathways, including Flavonoid 3’,5’-Hydroxylase (F3’5’H) and Flavonoid 3’-Hydroxylase (F3’H) which are precursors of blue and red anthocyanins, respectively (Tanaka and Brugliera, 2013).

Recent studies have discussed the function of CYP75s in several plants. They were first identified in the Petunia hybrida, wherein the introduction of clonal F3’H cDNA caused an unusual pink color of petunia and their pollen produced paeoniflorin (Holton et al., 1993; Brugliera et al., 1999). In Arabidopsis thaliana, a TT7 gene was identified, which could encode F3’H and its function was validated in flavonoid and anthocyanins synthesis (Schoenbohm et al., 2000). The sF3’H1 of Glycine max is mainly responsible for the alteration of pubescent color from brown to gray (Toda et al., 2002). In Vitis vinifera, CYP75 genes encoding F3’H and F3’5’H are highly expressed in all tissues of the plant, especially in the epidermis of mature red berries that mainly synthesize anthocyanins (Falginella et al., 2010). In Solanum lycopersicum, CYP75A31 encodes a F3’5H, which accepts flavones, flavanones, dihydroflavonols and flavonols as substrates (Olsen et al., 2010). In Epimedium sagittatum, EsF3’H and EsF3’5H genes are highly expressed in colored tissues and their expressions are positively correlated with the pattern of anthocyanin accumulation in leaves (Huang et al., 2012). The CsF3’5’H in Camellia sinensis acts as a key agent controlling trihydroxyflavone-3-alcohol synthesis and effectively converts 4’-hydroxylated flavonoids into 3’4’5’- and/or 3’4’-hydroxylated products (Wang et al., 2014). The F3’H-1 and F3’H-2 in Hordeum vulgare regulates the accumulation of magenta pigments in peels and stems, respectively, and the F3’5’H-1 is closely related to the accumulation of blue pigments in the barley grain paste layers (Vikhorev et al., 2019). With the continuous advancement of molecular experimental research, it is now possible to change plants color by controlling regulatory genes, such as CYP75. For example, the up-regulated of CYP75A and CYP75B genes which encode a critical dihydroflavonol 4-reductase (DFR) in Dendrobium officinale, may lead to anthocyanins accumulation, promoting color change from green to red (Niu et al., 2021). However, the studies confirmed that CYP75B can code F3’H instead of CYP75A, and some species in Asteraceae have F3’5’H with the same amino acid sequences as F3’H which suggests that the F3’5’H in these plants is regulated by CYP75B instead of CYP75A (Seitz et al., 2006). Similar to Callistephus chinensis (F3’5’H: CYP75B5), Osteospermum hybrida (F3’5’H: CYP75B17)and Pericallis cruenta (F3’5’H: CYP75B18), they all lost CYP75A genes to code F3’5’H then reacquired by duplication and neofunctionalization of CYP75B genes (Seitz et al., 2006). This is further proved by partial amino acid exchanges between F3’5’H and F3’H (Seitz et al., 2007).

Orchidaceae is one of the largest monocotyledon families, boasting significant ornamental and economic importance due to its diverse range of colors and distinctive flower shapes (Li et al., 2021). There are more than 28,000 species and 850 genera in Orchidaceae, represents approximately 10% of all flowering plants worldwide and has the largest number of species (Chase et al., 2015). Orchids are remarkable for shedding light on plant evolution, with more complete orchid genomes now available, researchers have gained significant insight into the genetic foundations of orchid biology (Zhang et al., 2021a). Extensive research has been conducted on CYP75s in model plants, but there is currently limited knowledge about the characteristics of these genes in the Orchidaceae. In this study, we performed genome-wide identification, classification, characterization, and expression pattern analysis of CYP75s in 13 orchids, representing four subfamilies of Orchidaceae with considerable diversity to elucidate the evolution of CYP75s in orchids. The findings could provide new insights into the underlying mechanisms that drive the evolution and diversification of organ morphology in orchids and other flowering plants.

Materials and methods

Data sources

To investigate the features of the CYP75 gene family in Orchidaceae, 13 orchids with completed whole-genome sequencing were selected. They include C. goeringii (Chung et al., 2021; Sun et al., 2021), C. sinense (Yang et al., 2021), C. ensifolium (Ai et al., 2021), Gastrodia elata (Yuan et al., 2018; Xu et al., 2021), D. catenatum (Zhang et al., 2016; Niu et al., 2021), D. chrysotoxum (Zhang et al., 2021b), D. huoshanense (Han et al., 2020), Phalaenopsis aphrodite (Chao et al., 2018), P. equestris (Cai et al., 2015) of Epidendroideae, Platanthera guangdongensis (Li et al., 2022), Pl. zijinensis (Li et al., 2022) of Orchidoideae, Vanilla planifolia (Hasing et al., 2020) of Vanilloideae, Apostasia shenzhenica (Zhang et al., 2017) of Apostasioideae. The genome download urls for 13 orchids can be found in Table S1. And four CYP75A (AUB13331.1 from Horgeum vulgare; ABI95365.1 and AAZ79451.1 from D. hybrid; AEB96145.1 from D. moniliforme) and four CYP75B (BAJ93256.1 from H. vulgare; AT5G07990.1 from A. thaliana; XP015613041.1 from Oryza sativa; AF155332.1 from Petunia hybrida) proteins were downloaded from the NCBI (https://www.ncbi.nlm.nih.gov/). All the protein sequences can be found in the Table S2.

Identification and physicochemical properties of CYP75s

It is complicated to identify CYPs due to their extremely high sequence conservation (Mizutani and Ohta, 2010; Babu et al., 2013; Kumar et al., 2014; Reddy et al., 2014). Therefore, eight CYP75 proteins were used as queries to perform a blast search (built-in TBtools; Chen et al., 2018) against 13 orchid genomes with an E-value threshold of e-20 and a requirement for amino acid sequence similarity greater than 50% (Falginella et al., 2010). Pfam verification was performed on all protein sequences in the blast results. The CYP domains (PF00067) built on the hidden Markov model were downloaded from Sanger center (http://pfam.sanger.ac.uk/). The identification of all alternative orchid CYPs was conducted using the Hmmsearch program (built-in Tbtools; Chen et al., 2018). The subsequent screening study included all protein sequences containing CYP domains. To facilitate identification, 14 A. thaliana CYP proteins (AtCYP71A12, AtCYP71A13, AtCYP71B15, AtCYP73A5, AtCYP76C1, AtCYP77B1, AtCYP77A6, AtCYP78A10, AtCYP82G1, AtCYP84A1, AtCYP89A2, AtCYP98A3, AtCYP701A3, AtCYP706A1) belonging to subfamilies other than CYP75 were also downloaded from the TAIR (https://www.arabidopsis.org/). Then, ML phylogenetic trees were constructed for further screening of CYP75 genes based on all CYP genes of each orchid and other species. The phylogenetic analysis was conducted using the maximum likelihood (ML) approach, and the ML tree was constructed using the RAxML on the CIPRES Science Gateway web server (RAxML-HPC2 on XSEDE; Miller et al., 2015) with 1,000 bootstrap iterations. The phylogenetic tree of CYPs for each orchid is detailed in the Figure S1. The target genes were clustered with eight CYP75 query genes. Ultimately, we aligned all candidate CYP75 protein sequences of 13 orchids and further eliminated the incomplete gene annotation sequences (Figure 1). The completed protein sequences of orchid CYP75s can be found in Table S2. The physicochemical properties of CYP75 proteins were predicted by ExPASy database (https://www.expasy.org/) (Artimo et al., 2012). Subcellular localization was predicted by Plant-mPloc (http://www.csbio.sjtu.edu.cn/bioinf/plant-multi/#) (Chou and Shen, 2010).

Figure 1

Phylogenetic analyses

We performed multiple sequence alignment of CYP75 proteins from 13 orchids and other species using MEGA 7.0 software (Kumar et al., 2016). The alignment sequences selected with the ClustalW program, Gap Opening and Gap Extend, are 15 and 6.66, respectively; the DNA Weight Matrix selection is the IUB; other values keep the default. The phylogenetic analysis was conducted using the maximum likelihood (ML) approach, and the ML tree was constructed using the RAxML on the CIPRES Science Gateway web server (RAxML-HPC2 on XSEDE; Miller et al., 2015) with 1,000 bootstrap iterations.The best model (JTT+I+G4) was determined by using the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC) based on modeltest-ng-0.1.3 (Darriba et al., 2019). Keep other settings as default. The output phylogenetic tree file was polished using Evolview (http://www.evolgenius.info/evolview/) (He et al., 2016).

Motif and gene structure analysis

To illustrate the variations of motifs among orchid CYP75 proteins, the MEME motif search tool was employed to detect the conserved motifs in all CYP75 proteins (Bailey et al., 2009). The MEME parameters were optimized as follows: the maximum number of motifs found was set to 20, and the optimal motif width ranged from six to 50 bases. GSDS6 (http://gsds.gao-lab.org/) (Hu et al., 2015) was used for analyzing structure. The protein motifs and gene structures of CYP75s were visualized with TBtools (Chen et al., 2018).

Collinearity and location analysis on chromosome

Utilizing chromosome-level genome assemblies of C. goeringii, C. ensifolium, and D. chrysotoxum, genomic FASTA files were merged pairwise to generate a database for BLASTp queries. The merged BLAST files and modified GFF3 files for each species were analyzed using MCscanX (Wang et al., 2012) to identify collinear blocks of CYP75 genes between C. goeringii and C. ensifolium, C. goeringii and D. chrysotoxum. The dual synteny plotter tool of MCscanX (JCVI kit) was utilized for visualization of the results pertaining to collinearity.

We selected six orchids with chromosome-level genomes to analyze the distribution of CYP75 genes on their chromosomes. According to the genome data and annotation data of C. goeringii, C. ensifolium, D. huoshanense, D. chrysotoxum, Pl. guangdongensis and Pl. zijinensis, the chromosomal localization of five orchid CYP75 genes were visualized through TBtools (Chen et al., 2018).

Prediction of Cis-acting elements

A total of 2,000bp upstream and downstream of CgCYP75s was extracted via TBtools, respectively (Chen et al., 2018). The online software PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/; Lescot, 2002) was used to identify and annotate the cis-acting elements found in the upstream and downstream regions. Cis-acting element number and response function were visualized using TBtools (Chen et al., 2018).

Transcriptome data and expression analysis

For transcriptome analysis, RSEM (Li and Dewey, 2011) was utilized to quantify transcripts and compute the fragment per kilobase of transcript per million mapped reads (FPKM) score for every gene. Expression heatmaps were produced using TBtools with the FPKM matrix (Chen et al., 2018). To confirm the expression patterns of the CYP75s, sepals, petals, labellums (lips), and gynostemiums were sampled from yellow-green C. goeringii (‘YG’) and purple-red C. goeringii (‘PR’), respectively, these were grown at Fujian Agriculture and Forestry University for quantitative real-time PCR (qRT-PCR) experiment. Each tissue type was sampled in three replicates. Total RNA of these tissues was extracted using the FastPure Plant Total RNA Isolation Kit (Vazyme Biotech Co., Ltd., Nanjing, China). First-strand DNA was synthesized with TransScript® All-in-One First-Strand cDNA Synthesis SuperMix for quantitative PCR (TransGen Biotech, Beijing, China). Premier 5 software was used to design primers for candidate genes and internal reference genes for qRT-PCR (Zhao et al., 2022). Gene-specific primers for two chosen genes and their corresponding internal control genes are presented in Table S3. The qRT-PCR was performed to verify the specific expression of CgCYP75A1 and CgCYP75B1 in the floral organs of ‘YG’ and ‘PR’ C. goeringii. All experiments were conducted in triplicate, with each run consisting of three technical replicates. The relative expression of genes was determined using the 2-ΔΔCT method (Wang et al., 2021; Zhao et al., 2023).

Gene ontology analysis

EggNOG-mapper v2 (http://eggnog-mapper.embl.de/) was used to perform a search against the eggNOG5.0 database for gene ontology (GO) functional annotation (Huerta-Cepas et al., 2019). The prediction of orthology was conducted through sequence alignment, while applying bit-score or E-value filtering to improve the quality of orthology assignments. Functional classification was attained by associating the GO annotation terms with the proteins involved in established biological processes. The results of the GO analysis are available in the Table S7 and visualized using Tbtools (Chen et al., 2018).

Results

Identification and protein features of orchid CYP75s

A total of 72 CYP75s were identified from 13 orchids, with the number of CYP75s in each orchid ranging from two to ten (Table 1). Our study reveals that CYP75B subfamily has a significantly higher number (53/72) compared to CYP75A subfamily (19/72). Each of the 13 orchids contains one−four CYP75A genes and between two−six CYP75B genes. Among them, V. planifolia has the largest number of CYP75 genes, with a total of ten (four CYP75A and six CYP75B), while Pl. guangdongensis has the smallest number with only two CYP75B genes and lacks CYP75A gene.

Table 1

SpeciesGene IDNameAAa(aa)pIbMwc(kDa)IIdAIeGRAVYfLocalizationg
P. aphroditePAXXG087010PapCYP75A15069.1456.7449.8799.05-0.069Endoplasmic reticulum
PAXXG006510PapCYP75B15516.6961.3739.4097.64-0.056Endoplasmic reticulum
PAXXG079820PapCYP75B25136.5457.3344.20101.15-0.123Endoplasmic reticulum
PAXXG116530PapCYP75B35067.7356.3837.35103.700.021Endoplasmic reticulum
PAXXG116560PapCYP75B45036.6855.9038.79102.45-0.004Endoplasmic reticulum
PAXXG116590PapCYP75B52696.1430.4945.0193.57-0.138Endoplasmic reticulum
PAXXG172030PapCYP75B65226.6257.2832.68101.650.042Endoplasmic reticulum
P. equestrisPeq013982PeqCYP75A14917.6554.7743.5794.75-0.099Endoplasmic reticulum
Peq002928PeqCYP75B14896.6953.8934.3299.12-0.038Endoplasmic reticulum
Peq006329PeqCYP75B25316.6259.1439.4898.93-0.053Endoplasmic reticulum
Peq008143PeqCYP75B32715.5930.7936.8098.97-0.084Endoplasmic reticulum
Peq008146PeqCYP75B45118.3957.0239.79100.800.015Endoplasmic reticulum
Peq013868PeqCYP75B55166.5457.4845.36102.07-0.111Endoplasmic reticulum
C. goeringiiGL07540CgCYP75A15038.7555.8248.2998.53-0.030Endoplasmic reticulum
GL10771CgCYP75A22354.8326.1041.0393.83-0.076Endoplasmic reticulum
GL07339CgCYP75B15339.2458.6636.5698.26-0.042Endoplasmic reticulum
GL13941CgCYP75B22925.7032.2835.72102.91-0.092Endoplasmic reticulum
GL26796CgCYP75B35078.1955.9739.78101.600.017Endoplasmic reticulum
GL26797CgCYP75B42725.9930.4739.9896.18-0.153Endoplasmic reticulum
GL27961CgCYP75B55176.9157.3146.28103.23-0.048Endoplasmic reticulum
GL27962CgCYP75B62905.6332.2041.2396.59-0.192Endoplasmic reticulum
C. ensifoliumJL017476CeCYP75A15038.7555.8349.3398.73-0.026Endoplasmic reticulum
JL011638CeCYP75B15197.1357.6938.75103.030.022Endoplasmic reticulum
JL016547CeCYP75B25176.9457.3845.00104.16-0.036Endoplasmic reticulum
JL021317CeCYP75B35238.4157.8239.78100.36-0.021Endoplasmic reticulum
C. sinenseMol022332CsCYP75A15038.6455.9548.6198.33-0.040Endoplasmic reticulum
Mol004997CsCYP75B15309.2458.3035.5696.23-0.070Endoplasmic reticulum
Mol005994CsCYP75B25318.4758.9538.51102.17-0.014Endoplasmic reticulum
Mol021869CsCYP75B35156.7557.1443.98104.19-0.036Endoplasmic reticulum
V. planifoliaVpla_KAG0447236.1VplCYP75A15078.4556.2443.53100.890.053Endoplasmic reticulum
Vpla_KAG0447237.1VplCYP75A22705.1329.7432.8191.85-0.002Endoplasmic reticulum
Vpla_KAG0495957.1VplCYP75A34356.1148.3343.0788.85-0.116Endoplasmic reticulum
Vpla_KAG0496007.1VplCYP75A45038.1055.9247.0593.70-0.031Endoplasmic reticulum
Vpla_KAG0451884.1VplCYP75B15236.4858.0535.05101.99-0.030Endoplasmic reticulum
Vpla_KAG0455627.1VplCYP75B25067.0056.2136.6298.08-0.043Endoplasmic reticulum
Vpla_KAG0455628.1VplCYP75B35018.5155.6338.04102.380.025Endoplasmic reticulum
Vpla_KAG0466647.1VplCYP75B45338.7059.7541.1091.82-0.238Endoplasmic reticulum
Vpla_KAG0466648.1VplCYP75B54818.5853.8638.3493.06-0.289Endoplasmic reticulum
Vpla_KAG0497903.1VplCYP75B65426.6360.1338.6699.48-0.052Endoplasmic reticulum
D. chrysotoxumMaker09736DchCYP75A15048.5955.8447.4699.09-0.037Endoplasmic reticulum
Maker93966DchCYP75A25048.3955.8647.4699.27-0.023Endoplasmic reticulum
Maker59722DchCYP75B15318.1259.2341.61105.63-0.019Endoplasmic reticulum
Maker83891DchCYP75B25196.7557.4943.90100.39-0.054Endoplasmic reticulum
Maker109076DchCYP75B36688.7976.2131.78100.70-0.086Endoplasmic reticulum
Maker118630DchCYP75B45527.2861.4537.5794.71-0.121Endoplasmic reticulum
Maker118636DchCYP75B55187.2857.1739.2398.82-0.027Endoplasmic reticulum
D. catenatumDca000941DcaCYP75A15048.0855.8443.51100.260.008Endoplasmic reticulum
Dca000430DcaCYP75B15237.8057.9040.36101.68-0.029Endoplasmic reticulum
Dca008242DcaCYP75B25126.8656.6931.4498.11-0.036Endoplasmic reticulum
Dca013687DcaCYP75B35257.7559.5639.05103.07-0.067Endoplasmic reticulum
Dca013688DcaCYP75B45128.0457.4139.50103.030.006Endoplasmic reticulum
Dca020470DcaCYP75B55398.1260.3238.82105.14-0.027Endoplasmic reticulum
D. huoshanenseDhu000016471DhuCYP75A15047.6355.7543.8499.48-0.002Endoplasmic reticulum
Dhu000016482DhuCYP75A25048.6655.8345.4599.70-0.012Endoplasmic reticulum
Dhu000012876DhuCYP75B15217.2957.7140.21101.69-0.030Endoplasmic reticulum
Dhu000016330DhuCYP75B25176.3157.3533.9796.79-0.052Endoplasmic reticulum
Dhu000019542DhuCYP75B35398.3960.3338.66104.42-0.040Endoplasmic reticulum
Dhu000020018DhuCYP75B45318.1259.2341.61105.63-0.019Endoplasmic reticulum
Pl. guangdongensisPGU007087PguCYP75B15208.6257.7640.3798.65-0.086Endoplasmic reticulum
PGU010950PguCYP75B25228.9658.0652.92101.69-0.071Endoplasmic reticulum
Pl. zijinensisPZI001224PziCYP75A15049.4155.8647.87101.610.031Endoplasmic reticulum
PZI012105PziCYP75B15208.3257.6440.2099.04-0.077Endoplasmic reticulum
PZI015225PziCYP75B25148.8056.4643.1994.36-0.047Endoplasmic reticulum
A. shenzhenicaAsh019093AshCYP75A15086.6356.6747.9098.17-0.026Endoplasmic reticulum
Ash001251AshCYP75B15217.7157.1834.43100.56-0.016Endoplasmic reticulum
Ash004102AshCYP75B25207.2657.7842.6095.44-0.061Endoplasmic reticulum
Ash007554AshCYP75B35207.3156.9446.9198.71-0.036Endoplasmic reticulum
Ash015225AshCYP75B45086.2256.1645.8495.61-0.011Endoplasmic reticulum
G. elataGel004675GelCYP75A15147.1457.2543.06101.01-0.064Endoplasmic reticulum
Gel010620GelCYP75B15128.7956.2449.5896.45-0.081Endoplasmic reticulum
Gel016567GelCYP75B25179.2657.5047.94100.97-0.064Endoplasmic reticulum
Gel005429GelCYP75B35048.3756.2742.41102.640.013Endoplasmic reticulum

A list of CYP75 genes in 13 Orchids, their characteristics, and subcellular localization of proteins.

AAa, Amino acid number; pIb, Theoretical isoelectric point; Mwc (kDa), Molecular weight; IId, Instability index; AIe, Aliphatic index; GRAVYf, Grand average of hydrophobicity; Localizationg, Subcellular localization predicted by Plant-mPloc (Chou and Shen, 2010).

The 72 CYP75 protein sequences range from 235−668 amino acids, with a mean of 493. The molecular weight ranges from 26.10−76.21 kDa, with a mean of 54.80 kDa. Around 88.89% (64/72) of the CYP75 proteins have high isoelectric points (pI>7), with an average of 7.57. The average instability index (II) is 41.39, and 39 CYP75 proteins are below this index, indicating good protein stability. The average aliphatic index (AI) for the 72 CYP75 proteins is 99.40, indicating high thermal stability. Moreover, the calculated mean hydrophilic index (GRAVY) of CYP75 proteins in all orchids is negative, indicating a high degree of hydrophilicity. All CYP75 proteins are localized within the endoplasmic reticulum (ER), as evidenced by subcellular localization results. This localization pattern is consistent with the majority of CYP proteins, which are known to primarily function in the ER (Neve and Ingelman-Sundberg, 2010).

Gene ontology analysis was performed to delineate gene functional classifications of orchid CYP75s and investigate the important biological processes they might be involved in. As a result, GO terms “response to stimulus,” “response to auxin”, “response to organic substance”, “response to endogenous stimulus”, “response to chemical”, “response to hormone”, “secondary metabolic process”, “secondary metabolite biosynthetic process”, “obsolete oxidation-reduction process”, “membrane” and “oxidoreductase activity” constituted the greatest number of genes for GO ontologies “Biological Process”, “Cellular Component,” and “Molecular Function”, respectively (Figure S2; Table S7). The data suggests that the gene ontology of orchid CYP75s is significantly enriched in the “Biological Process”, which is strongly linked to the production of plant metabolites. Additionally, it is heavily concentrated within response elements that relate to the plant’s reaction to external environmental factors.

Phylogenetic analysis of orchid CYP75s

A phylogenetic tree was constructed to analyze the evolutionary patterns of orchid CYP75s (Figure 2). 72 CYP75 proteins were used, and eight query protein sequences were used. The maximum likelihood (ML) method was used for the phylogenetic tree, which was constructed with RAxML on the CIPRES Science Gateway web server (RAxML-HPC2 on XSEDE; Miller et al., 2015). Bootstrap values were 1,000 replicates with the JTT+I+G4 model. The phylogenetic tree indicated that CYP75s belonged to two categories: CYP75A and CYP75B, which is consistent with the previous studies (Tanaka and Brugliera, 2013). The results clearly show that the number of members in the CYP75B subfamily has far exceeded than that of the CYP75A subfamily in the course of evolution. The branch relationships of the eight known CYP75 protein sequences are found to be correct.

Figure 2

Motif variation and gene structure of orchid CYP75s

Motifs of CYP75 proteins in 13 orchids were examined using the online analysis tool MEME, and 20 motifs were set as upper bound (Figure 3A). There are four conserved motifs commonly found in cytochrome P450 enzymes: PERF motif, K-helix region, I-helix region and the consensus sequences of the heme-binding region (Figures 1, 3B), also referred to as the “P450 signature” (Crooks et al., 2004; Qi et al., 2017). As depicted in Figure 1, all CYP75 proteins found in orchids possess the four common CYP motifs referred to earlier. A total of 20 motifs were detected in the CYP75 proteins using the MEME software (Bailey et al., 2009). The number of CYP75 motifs ranges from nine to 18. The results show that most CYP75 proteins have identical sequence beginning with motif 18 and followed by motif 5, 13, 2, 14, 10, 6, 8, 16, 12, 3, 1, 17, 11, 4, 7, 15, ending with motif 9. Furthermore, all protein sequences of CYP75 exhibit the highly conserved motif 12, 3, 1, 17, 11, 4, 7, 15, and 9. (Figure 3A). Among the 20 motifs, motif 1 corresponds to K-helix region, motif 4 to heme-binding region, motif 11 to PERF motif, and motif 12 to I-helix region (Figure 3B). Furthermore, the unique conserved motifs in each subfamily are also found: motifs 19 are specific to the CYP75A subfamily, while motifs 17 are exclusively present in the CYP75B subfamily. This finding will greatly facilitate the identification of subfamilies within the CYP75 gene family. These 72 CYP75s share many common motifs, indicating high conservation. Most of the CYP75 protein sequences are conserved with differences mainly in the N terminus and intermediate regions. However, there are some CYP75 protein sequences containing a small number of motifs but also contain the typical motifs of CYP protein.

Figure 3

To further explore the characteristics of CYP75 genes in orchids, intron-exon structure is analyzed as shown by Figure S3. The results show that the orchid CYP75 family is composed of one−eight exons and one−seven introns, and in DchCYP75B3 and DhuCYP75B4, the exons are split into many small fragments by introns. Most CYP75 protein sequences have relatively long intron regions, while all CYP75A genes contain only one intron, which is a unique feature of the CYP75A subfamily.

Chromosomal localization of orchid CYP75s

As shown in Figure 4A, both CYP75A genes of C. goeringii are located on chromosome 02, while the remaining CgCYP75B genes are scattered on chromosomes 01, 08, 11, and 14. Among them, chromosomes 02, 08 and 11 all have two CgCYP75s located at the same site. The four CYP75s of C. ensifolium are scattered on chromosomes 01, 02, 09 and 11, respectively (Figure 4B). The CYP75 genes of D. huoshanense are evenly distributed across five chromosomes, specifically chromosomes 01, 06, 10, 16, and 18. The two DhuCYP75A genes are located together on chromosome 06 (Figure 4C). In D. chrysotoxum, only DchCYP75B4 and DchCYP75B5 are co-located on chromosome 07, while the remaining DchCYP75 are separately distributed on various chromosomes. Among them, DchCYP75A1 has not been assembled onto the chromosome, but is located on scaffold 787 (Figure 4D). For both Platanthera species, two PguCYP75Bs are co-located on chromosome 02 and 08 of Pl. guangdongensis, while three PziCYP75s are individually located on chromosome 01, 10 and 13 of Pl. zijinensis (Figures 4E, F).

Figure 4

Collinearity analysis of CYP75 gene family in three orchids

To investigate the evolution of CYP75 genes in orchids, the collinear relationship among CYP75s in C. goeringii, C. ensifolium, and D. chrysotoxum was analyzed. Our collinear analysis revealed a one-to-one correspondence among all CYP75 genes in the three orchids, indicating limited reshuffling of CYP75 orthologs and significant genomic rearrangements following the divergence of Dendrobium and Cymbidium lineages (Figure 5). Furthermore, we also examined that CYP75 gene tandem duplication occurred on the chromosomes of both C. goeringii and D. chrysotoxum, which was consistent with the results of chromosome localization (Figure 4). Our results suggest that a small-scale tandem duplication may have led to the expansion of CYP75 gene family in orchids.

Figure 5

Cis-acting regulatory elements of CgCYP75s

To explore the regulatory roles of CYP75s, we retrieved the 2,000 bp upstream and downstream regions of CYP75 genes in C. goeringii to identify potential cis-elements. We identified a total of 1,257 cis-acting elements, including 38 types and ten responsive functions (Figure 6; Table S4). Among these elements, TATA-box made up the most common elements (46.38%), followed by CAAT-box (22.83%) (Table S5). Cis-element functions included phytohormone responsiveness for gibberellin, auxin, methyl jasmonate (MeJA), salicylic acid, and abscisic acid (ABA); stress responses, such as anoxic, anaerobic, low-temperature, and defense; and growth and development elements, such as light response and MYB binding site (Figure 6). Each CgCYP75 gene contained multiple types of elements with light responsiveness as the most occurring element function (Figure 6), supporting that light is one of the most important environmental factors affecting flavonoid biosynthesis in plants (Zoratti et al., 2014). The second and third most abundant types of elements identified were MeJA-responsive and ABA-responsive elements (Table S4). The results suggest that these elements may play a role in modulating these two phytohormones. Of particular note are the MYB binding site elements, which are key transcription factors involved in pigment synthesis. The significant proportion of these elements further emphasizes the critical role of the CYP75 genes in the synthesis of plant pigments.

Figure 6

Expression patterns of orchid CYP75s and qRT-PCR verification of CgCYP75s

The orchid transcriptomic data from nine orchids out of 13 were visualized as heatmaps via TBtools software as shown in Figure 7. Expression analysis was conducted using transcriptome data from nine orchids, including various plant organs, such as sepals, petals, labellums (also known as lips), the gynostemium, inflorescence, stems, tubers, pollinia, and complete flowers. The expression profile showed that CYP75B genes were expressed broadly in flower organs, while the expression of CYP75A genes was more confined to C. ensifolium, D. catenatum and G. elata, particularly in the gynostemiums (Figure 7). In C. goeringii, CYP75A genes showed little expression in all tissues of two varieties, while CgCYP75B1 exhibited an exclusive expression in four organs of ‘PR’ C. goeringii and gynostemium of ‘GY’ C. goeringii (Figure 7A). Furthermore, CeCYP75B1 and CeCYP75B2 exhibited elevated expression levels across all four floral organs (Figure 7B). In D. chrysotoxum, the expression of the DchCYP75B4 gene was significantly higher on the lip than other parts, due to the predominant distribution of its anthocyanin in the red macula of the lip (Figure 7C). A similar situation existed in D. catenatum, DcaCYP75B1 and DcaCYP75B3 were highly expressed in the lip with purple-red spots (Figure 7D). PeqCYP75B1 was expressed prominently in various tissues of P. equestris, with higher expression in the darker colored lip region. On the other hand, PeqCYP75B5 showed significant expression levels in the sepals. (Figure 7E). AshCYP75B3 of A. shenzhenica was expressed in all parts except for the pollinium, with the highest expression in the inflorescence (Figure 7F). In both of the Platanthera species, PziCYP75B1 and PguCYP75B4 exhibited the highest expression levels in fleshy underground tubers (Figure 7G), possibly indicating their involvement in non-biological stress response (Iwashina, 2003; Pourcel et al., 2007; Zhang et al., 2020). Similarly, G. elata was a mycoheterotrophy orchid without anthocyanins in its tissues, and the expression of GelCYP75A and GelCYP75B were comparable in various tissues (Figure 7H). Their functions are not related to anthocyanin synthesis but are possibly associated with physiological activities required for adaptation to its unique habitat (Iwashina, 2003; Pourcel et al., 2007; Zhang et al., 2020).

Figure 7

To further investigate the specific roles of CYP75 gene expression in two C. goeringii varieties, their sepals, petals, lips and gynostemiums were analyzed by qRT-PCR (Figure 8). In two C. goeringii varieties (Figure 8A), the CgCYP75B1 showed extremely high expression in ‘PR’ C. goeringii, especially in petal which matched the transcriptome data, but were barely detected in ‘GY’ C. goeringii, further verifying that CYP75B genes have an anthocyanin-specific expression in the flower organs. As for CgCYP75A1, its expression level was relatively low in four types of tissues of the two C. goeringii plants (Figure 8B). Interestingly, CgCYP75A1 and CgCYP75B1 showed a certain degree of stable expression in the gynostemiums of both C. goeringii plants (Figure 8B). The function of these CYP75 genes may not be related to anthocyanin synthesis, but rather to attracting pollinators to promote reproduction (Samanta et al., 2011).

Figure 8

Discussion

Flower color is one of the key criteria for evaluating the quality of ornamental plants, as well as their horticultural and economic values. Plant flower color is also a major factor in attracting pollinators, which helps to increase the success rate of pollination and plays an important role in the evolution of plants (Whibley et al., 2006; Hopkins and Rausher, 2012; Mu et al., 2017). Orchidaceae contains the most colorful plants in the world with a rich variety of colors and characteristics (Roberts and Dixon, 2008). As one of the earliest identified gene families in the CYP450 superfamily, CYP75 plays an important role in regulating plants’ flavonoids biosynthesis and the synthesis of flower pigments (Ayabe and Akashi, 2006; Tanaka and Brugliera, 2013). How the CYP75 gene family affects the flower color formation process and a series of physiological processes in orchids by regulating flavonoid biosynthesis is an interesting topic. In this research, a total of 72 CYP75 genes were identified from 13 orchids, according to their respective quantities, suggesting that the CYP75 gene family consists of a small number of copies (Table 1). In this study, the amount of CYP75 varied significantly among each orchid, ranging from two to ten. All extant orchids share a common whole-genome duplication (WGD) event in their ancestry, which was followed by varying degrees of gene loss, resulting in the formation of five subfamilies (Zhang et al., 2017). The number of coding genes in each orchid varies substantially, with examples being 21,938 for P. equestris (Cai et al., 2015), 21,743 for A. shenzhenica (Zhang et al., 2017), 30,897 for (Chung et al., 2021; Sun et al., 2021) C. goeringii, and 29,044 for V. planifolia (Hasing et al., 2020). In addition, some orchids such as C. goeringii, D. huoshanense, and D. chrysotoxum have CYP75 genes that repeat in tandem (Figure 4). Collinearity analyses of CYP75 genes in C. goeringii and C. ensifolium, C. goeringii and D. chrysotoxum have also approved this (Figure 5). Furthermore, there were differences in genome assembly quality among the 13 orchids evaluated. Analyses of Benchmarking Universal Single Copy Orthologs (BUSCO) (Simo et al., 2015) revealed a generally low level of genome assembly completeness for orchids (Table S1). Consequently, the number of CYP75 genes in orchids varies widely.

Phylogenetic relationships and protein structure analyses support the division of the CYP75 gene family into two branches, CYP75A and CYP75B (Figure 2) which is in agreement with previous studies (Tanaka and Brugliera, 2013). Notably, no CYP75A gene was found in Pl. guangdongensis, which is similar to A. thaliana lacking CYP75A (Schoenbohm et al., 2000). The main function of the CYP75A genes is to regulate the precursor of blue anthocyanins (Rausher, 2006). Many blue anthocyanins pigments are formed by the presence of the trioxide B-ring of delphinium derivatives (Ayabe and Akashi, 2006). Rausher (2008) documented the macroscopic trend of flower color evolution in angiosperms, finding that flowers evolving from blue to red were more common than in the opposite situation. An illustration of this phenomenon is observed in the grape genus, whereby the peels, which is blue-violet in color, contains a significant concentration of delphinidins. This leads to an increased quantity of the CYP75A genes (Falginella et al., 2010). However, as for Orchidaceae, the red cyanidin color trait is more common than blue delphinidin color trait, this also explains why the CYP75A gene regulating F3’5’H is significantly less than the CYP75B gene regulating F3’H. This may be attributed to the fact that blue flower species are less abundant than red flower species in Orchidaceae. It may suggest that red hue is a favored trait in the evolution of angiosperms and the CYP75B subfamily, which governs the synthesis of anthocyanin red precursors, holds an evolutionary edge over the CYP75A subfamily. Researchers have found that some CYP75B genes can function as CYP75A genes, such as O. sativa and some Asteraceae plants (Lam et al., 2015). Phylogenetic analysis of existing sequences of CYP75s revealed that CYP75A was derived from CYP75B prior to the divergence of angiosperms and gymnosperms (Seitz et al., 2006). It is demonstrated that the CYP75B subfamily has perhaps a more superior evolutionary position in the CYP75 gene family, performing not only in its own function, but sometimes work as compensation for the CYP75A subfamily (Xiao et al., 2021). In this study, all orchid CYP75s were found to be located on the endoplasmic reticulum, as previous research has shown that CYP450 is a membrane protein primarily found on the ER membrane in eukaryotes (Brignac-Huber et al., 2016). It can be deduced that orchid CYP75 genes are likely involved in plant metabolism and response to biotic stress via the membrane system.

All members of the CYP gene family have four identifiable motifs: the PERF motif, K-helix region, I-helix region, and heme-binding region, as shown in Figure 1 (Crooks et al., 2004; Qi et al., 2017). In addition, this study finds specific motifs within two respective subfamilies: motif 19 is exclusive to CYP75A subfamily, while motif 17 exists only in CYP75B subfamily (Figure 3). These results provide more possibilities for accurate recognition of the two subfamilies in CYP75.

Whole-genome sequencing has enabled researchers to uncover variations in the gene structure of gene families across different species. While gene structure tends to be conserved within the same clade, peculiarities have been observed in the CYP75 gene family of orchids, particularly in subfamilies CYP75A and CYP75B. In contrast to the longer introns typically found in orchids, CYP75As has a single intron (Figure S3), which is a unique characteristic. Longer introns are believed to be favored during gene evolution as they increase recombination between adjacent exons, thus promoting natural selection efficiency (Jo and Choi, 2015). This unique feature of Orchidaceae may explain the extraordinary diversity of orchids.

Gene expression is primarily regulated by cis-acting elements of the transcription start site (Hernandez-Garcia and Finer, 2014). This study identified various types of regulatory elements within the upstream and downstream region of CYP75 genes in C. goeringii, which were further classified into plant hormone response elements, stress response elements, and growth and development elements (Figure 6). Among these elements, the number of light response elements was the largest, indicating that light is one of the most important environmental factors influencing flavonoid biosynthesis (Zoratti et al., 2014), which is closely related to anthocyanin synthesis. Additionally, there were also a considerable number of MYB binding sites within the cis-acting elements of orchid CYP75s. MYB is one of the most important transcription factors regulating plant pigments, and it can enhance B-ring hydroxylation by upregulating F3’5’H1 (Ma et al., 2021). Further research is needed to understand the interaction patterns between the two.

Studies have shown that the expression level of the CYP75 genes is positively related to the accumulation of anthocyanins. Based on the RT-qPCR and transcriptome expression analysis in this study, this point of view can be supported. In two C. goeringii varieties with significantly different colors, the expression of the CYP75B genes which regulate the purple-red anthocyanins is generally much higher in ‘PR’ C. goeringii compared to ‘YG’ C. goeringii. Moreover, the expression level of CgCYP75B is higher than that of CgCYP75A in any of the two varieties. Furthermore, in the lip of ‘YG’ C. goeringii without anthocyanin, the CYP75 gene shows relatively stable expression. This suggests that the function of the CYP75 genes is not mainly to regulate anthocyanin synthesis. The two F3’H and F3’5’H enzymes regulated by the CYP75 genes are both flavonoids, and flavonoids are involved in almost all metabolic processes in plants, including attracting pollinators (Wen et al., 2020). Orchidaceae-specific CYP75s probably reflect the adaptive value of efficient attraction of pollinators. These results further refine the theoretical framework of the CYP75 gene family’s relevance to flower color in orchids and provide new insights into the mechanism of color formation. Furthermore, it is now possible to alter plant color through gene silencing or introducing exogenous genes, which may solve the problem of scarce blue flowers and lay the foundation for the breeding and improvement of novel flower colors in orchids.

Conclusion

In this study, we identified 72 members of the CYP75 gene family from the 13 orchid genomes. We analyzed their protein physicochemical properties, subcellular localization, motifs, intron-exon structures, chromosome distribution, promoter elements, expression patterns and gene ontology classification. We found that the CYP75 gene family in Orchidaceae is a low-copy gene family with high conservation. V. planifolia has the highest count of CYP75 genes, with ten, while Pl. guangdongensis has only two genes, the CYP75A gene is missing and only two CYP75B genes are present. We observed a notable discrepancy between the number of genes in the CYP75A and CYP75B subfamilies, which could potentially be attributed to differences in their evolutionary statuses. The characteristic motifs of CYP450 are present in all CYP75 proteins, with a specific motif found in two subfamilies, respectively. We demonstrate that the distinct roles of cis-elements in light response and MYB binding sites are working together with the crucial function of CYP75s in the biosynthesis of anthocyanins. In addition, the expression patterns generated by transcriptomic and RT-qPCR data supported a color-specific expression of CYP75Bs in the flower organs. Our study presents a comprehensive analysis of the functions and expression patterns of CYP75 genes in Orchidaceae. These results build a foundation for deeper understanding regarding the role of CYP75 genes in plant anthocyanin biosynthesis of plants, offering insights into the flexibility of plant pigmentation. A crucial subsequent task will entail conducting functional analysis of CYP75 in non-model plants, in order to discern the further functions of CYP75 in the context of angiosperms evolution.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author contributions

Z-JL and SL conceived and designed the research. XZ and M-MZ prepared the original draft. XH and YH performed the data analysis. YL and SA wrote and edited the language. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the Forestry Peak Discipline Construction Project of Fujian Agriculture and Forestry University (72202200205).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1243828/full#supplementary-material

References

  • 1

    AiY.LiZ.SunW.-H.ChenJ.ZhangD.MaL.et al. (2021). The Cymbidium genome reveals the evolution of unique morphological traits. Hortic. Res.8, 255. doi: 10.1038/s41438-021-00683-z

  • 2

    ArtimoP.JonnalageddaM.ArnoldK.BaratinD.CsardiG.de CastroE.et al. (2012). ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res.40, W597W603. doi: 10.1093/nar/gks400

  • 3

    AyabeS.AkashiT. (2006). Cytochrome P450s in flavonoid metabolism. Phytochem. Rev.5, 271282. doi: 10.1007/s11101-006-9007-3

  • 4

    BabuP. R.RaoK. V.ReddyV. D. (2013). Structural organization and classification of cytochrome P450 genes in flax (Linum usitatissimum L.). Gene513, 156162. doi: 10.1016/j.gene.2012.10.040

  • 5

    BaileyT. L.BodenM.BuskeF. A.FrithM.GrantC. E.ClementiL.et al. (2009). MEME SUITE: Tools for motif discovery and searching. Nucleic Acids Res.37, W202W208. doi: 10.1093/nar/gkp335

  • 6

    Brignac-HuberL. M.ParkJ. W.ReedJ. R.BackesW. L. (2016). Cytochrome P450 organization and function are modulated by endoplasmic reticulum phospholipid heterogeneity. Drug Metab. Dispos44 (12), 18591866. doi: 10.1124/dmd.115.068981

  • 7

    BruglieraF.Barri-RewellG.HoltonT. A.MasonJ. G. (1999). Isolation and characterization of a flavonoid 3’-hydroxylase cDNA clone corresponding to the Ht1 locus of Petunia hybrida. Plant J.19, 441451. doi: 10.1046/j.1365-313X.1999.00539.x

  • 8

    CaiJ.LiuX.VannesteK.ProostS.TsaiW.-C.LiuK.-W.et al. (2015). The genome sequence of the orchid Phalaenopsis equestris. Nat. Genet.47, 6572. doi: 10.1038/ng.3149

  • 9

    CederbaumA. I. (2015). Molecular mechanisms of the microsomal mixed function oxidases and biological and pathological implications. Redox Biol.4, 6073. doi: 10.1016/j.redox.2014.11.008

  • 10

    ChaoY.-T.ChenW.-C.ChenC.-Y.HoH.-Y.YehC.-H.KuoY.-T.et al. (2018). Chromosome-level assembly, genetic and physical mapping of Phalaenopsis aphrodite genome provides new insights into species adaptation and resources for orchid breeding. Plant Biotechnol. J.16, 20272041. doi: 10.1111/pbi.12936

  • 11

    ChaseM. W.CameronK. M.FreudensteinJ. V.PridgeonA. M.SalazarG.van den BergC.et al. (2015). An updated classification of Orchidaceae: Updated Classification of Orchidaceae. Bot. J. Linn Soc.177, 151174. doi: 10.1111/boj.12234

  • 12

    ChenC.ChenH.HeY.XiaR. (2018). TBtools, a toolkit for biologists integrating various biological data handling tools with a user-friendly interface. bioRxiv [Preprint]. doi: 10.1101/289660

  • 13

    ChouK.-C.ShenH.-B. (2010). Plant-mPLoc: a top-down strategy to augment the power for predicting plant protein subcellular localization. PLoS One5, e11335. doi: 10.1371/journal.pone.0011335

  • 14

    ChungO.KimJ.BolserD.KimH. M.JunJ. H.ChoiJ. P.et al. (2022). A chromosome-scale genome assembly and annotation of the spring orchid (Cymbidium goeringii). Mol Ecol Resour22, 1168–1177. doi: 10.1111/1755-0998.13537

  • 15

    CrooksG. E.HonG.ChandoniaJ.-M.BrennerS. E. (2004). WebLogo: a sequence logo generator. Genome Res.14, 11881190. doi: 10.1101/gr.849004

  • 16

    DarribaD.PosadaD.KozlovA. M.StamatakisA.MorelB.FlouriT. (2019). ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models. Mol. Biol. Evol.37, 291–294. doi: 10.1093/molbev/msz189

  • 17

    DarribaD.PosadaD.KozlovA. M.StamatakisA.MorelB.FlouriT. (2004). ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models. Mol. Biol. Evol.37, 291294. doi: 10.1093/molbev/msz189

  • 18

    FalginellaL.CastellarinS. D.TestolinR.GambettaG. A.MorganteM.Di GasperoG. (2010). Expansion and subfunctionalisation of flavonoid 3’,5’-hydroxylases in the grapevine lineage. BMC Genomics11, 562. doi: 10.1186/1471-2164-11-562

  • 19

    HanB.JingY.DaiJ.ZhengT.GuF.ZhaoQ.et al. (2020). A chromosome-level genome assembly of Dendrobium huoshanense using long reads and Hi-C data. Genome Biol. Evol.12, 24862490. doi: 10.1093/gbe/evaa215

  • 20

    HasingT.TangH.BrymM.KhaziF.HuangT.ChambersA. H. (2020). A phased Vanilla planifolia genome enables genetic improvement of flavour and production. Nat. Food1, 811819. doi: 10.1038/s43016-020-00197-2

  • 21

    HeZ.ZhangH.GaoS.LercherM. J.ChenW.-H.HuS. (2016). Evolview v2: An online visualization and management tool for customized and annotated phylogenetic trees. Nucleic Acids Res.44, W236W241. doi: 10.1093/nar/gkw370

  • 22

    Hernandez-GarciaC. M.FinerJ. J. (2014). Identification and validation of promoters and cis-acting regulatory elements. Plant Sci.217–218, 109119. doi: 10.1016/j.plantsci.2013.12.007

  • 23

    HoltonT. A.BruglieraF.LesterD. R.TanakaY.HylandC. D.MentingJ. G.et al. (1993). Cloning and expression of cytochrome P450 genes controlling flower colour. Nature366 (6452), 276279. doi: 10.1038/366276a0

  • 24

    HopkinsR.RausherM. D. (2012). Pollinator-mediated selection on flower color allele drives reinforcement. Science335, 10901092. doi: 10.1126/science.1215198

  • 25

    HuW.HouX.HuangC.YanY.TieW.DingZ.et al. (2015). Genome-wide identification and expression analyses of Aquaporin gene family during development and abiotic stress in banana. IJMS16, 1972819751. doi: 10.3390/ijms160819728

  • 26

    HuangW.SunW.WangY. (2012). Isolation and molecular characterisation of flavonoid 3’-hydroxylase and flavonoid 3’, 5’-hydroxylase genes from a traditional Chinese medicinal plant, Epimedium sagittatum. Gene497, 125130. doi: 10.1016/j.gene.2011.11.029

  • 27

    Huerta-CepasJ.SzklarczykD.HellerD.Hernández-PlazaA.ForslundS. K.CookH.et al. (2019). eggNOG 5.0: A hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res.47, D309D314. doi: 10.1093/nar/gky1085

  • 28

    IwashinaT. (2003). Flavonoid function and activity to plants and other organisms. Biol. Sci. Space17, 2444. doi: 10.2187/bss.17.24

  • 29

    JoB.-S.ChoiS. S. (2015). Introns: The functional benefits of introns in genomes. Genomics Inform13, 112. doi: 10.5808/GI.2015.13.4.112

  • 30

    KianiY. S.JabeenI. (2019). Lipophilic metabolic efficiency (LipMetE) and drug efficiency indices to explore the metabolic properties of the substrates of selected cytochrome P450 isoforms. ACS Omega5, 179188. doi: 10.1021/acsomega.9b02344

  • 31

    KumarM. S.BabuP. R.RaoK. V.ReddyV. D. (2014). Organization and classification of cytochrome P450 genes in Castor (Ricinus communis L.). Proc. Natl. Acad. Sci. India Sect. B Biol. Sci.84, 131143. doi: 10.1007/s40011-013-0192-8

  • 32

    KumarS.StecherG.TamuraK. (2016). MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol.33, 18701874. doi: 10.1093/molbev/msw054

  • 33

    LamP. Y.LiuH.LoC. (2015). Completion of tricin biosynthesis pathway in rice: cytochrome P450 75B4 is a unique Chrysoeriol 5’-Hydroxylase. Plant Physiol.168, 15271536. doi: 10.1104/pp.15.00566

  • 34

    LescotM. (2002). PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res.30, 325327. doi: 10.1093/nar/30.1.325

  • 35

    LiB.DeweyC. N. (2011). RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinf.12, 323323. doi: 10.1186/1471-2105-12-323

  • 36

    LiC.DongN.ZhaoY.WuS.LiuZ.ZhaiJ. (2021). A review for the breeding of orchids: Current achievements and prospects. Hortic. Plant J.7, 380392. doi: 10.1016/j.hpj.2021.02.006

  • 37

    LiM.-H.LiuK.-W.LiZ.LuH.-C.YeQ.-L.ZhangD.et al. (2022). Genomes of leafy and leafless Platanthera orchids illuminate the evolution of mycoheterotrophy. Nat. Plants8, 373388. doi: 10.1038/s41477-022-01127-9

  • 38

    LiY.WeiK. (2020). Comparative functional genomics analysis of cytochrome P450 gene superfamily in wheat and maize. BMC Plant Biol.20, 93. doi: 10.1186/s12870-020-2288-7

  • 39

    LiuX.ZhuX.WangH.LiuT.ChengJ.JiangH. (2020). Discovery and modification of cytochrome P450 for plant natural products biosynthesis. SSB5, 187199. doi: 10.1016/j.synbio.2020.06.008

  • 40

    MaD.TangH.ReicheltM.PiirtolaE.-M.SalminenJ.-P.GershenzonJ.et al. (2021). Poplar MYB117 promotes anthocyanin synthesis and enhances flavonoid B-ring hydroxylation by up-regulating the flavonoid 3’,5’-hydroxylase gene. J. Exp. Bot.72, 38643880. doi: 10.1093/jxb/erab116

  • 41

    MillerM. A.SchwartzT.PickettB. E.HeS.KlemE. B.ScheuermannR. H.et al. (2015). A restful API for access to phylogenetic tools via the CIPRES Science Gateway. Evol. Bioinform. Online11, EBO.S21501. doi: 10.4137/EBO.S21501

  • 42

    MizutaniM.OhtaD. (2010). Diversification of P450 genes during land plant evolution. Annu. Rev. Plant Biol.61, 291315. doi: 10.1146/annurev-arplant-042809-112305

  • 43

    MuJ.YangY.LuoY.SuR.NiklasK. J. (2017). Pollinator preference and pollen viability mediated by flower color synergistically determine seed set in an Alpine annual herb. Ecol. Evol.7, 29472955. doi: 10.1002/ece3.2899

  • 44

    NeveE. P.Ingelman-SundbergM. (2010). Cytochrome P450 proteins: Retention and distribution from the endoplasmic reticulum. Curr. Opin. Drug Discovery Dev.13, 7885.

  • 45

    NelsonD. R.KoymansL.KamatakiT.StegemanJ. J.FeyereisenR.WaxmanD. J.et al. (1999). P450 superfamily: update on new sequences, gene mapping, accession numbers and nomenclature. Pharmacogenet Genomics6, 142. doi: 10.1097/00008571-199602000-00002

  • 46

    NiuZ.ZhuF.FanY.LiC.ZhangB.ZhuS.et al. (2021). The chromosome-level reference genome assembly for Dendrobium officinale and its utility of functional genomics research and molecular breeding study. Acta Pharm. Sin. B11, 20802092. doi: 10.1016/j.apsb.2021.01.019

  • 47

    OlsenK. M.HehnA.JugdéH.SlimestadR.LarbatR.BourgaudF.et al. (2010). Identification and characterisation of CYP75A31, a new flavonoid 3’ 5’-hydroxylase, isolated from Solanum lycopersicum. BMC Plant Biol.10, 21. doi: 10.1186/1471-2229-10-21

  • 48

    OmuraT.SatoR. (1962). A new cytochrome in liver microsomes. J. Biol. Chem.237, 13751376.

  • 49

    PourcelL.RoutaboulJ.CheynierV.LepiniecL.DebeaujonI. (2007). Flavonoid oxidation in plants: From biochemical properties to physiological functions. Trends Plant Sci.12, 2936. doi: 10.1016/j.tplants.2006.11.006

  • 50

    QiX.YuX.XuD.FangH.DongK.LiW.et al. (2017). Identification and analysis of CYP450 genes from transcriptome of Lonicera japonica and expression analysis of chlorogenic acid biosynthesis related CYP450s. PeerJ5, e3781. doi: 10.7717/peerj.3781

  • 51

    RausherM. D. (2006). “The evolution of flavonoids and their genes,” in The science of flavonoids (Berlin: Springer), 175211. doi: 10.1007/978-0-387-28822-2_7

  • 52

    RausherM. D. (2008). Evolutionary transitions in floral color. Int. J. Plant Sci.169, 721. doi: 10.1086/523358

  • 53

    ReddyS. M.KumarM. S.RaoK. V.ReddyV. D. (2014). Structural organization, classification and phylogenetic relationship of cytochrome P450 genes in Ctrus cleemtina and Citrus sinensis. Tree Genet Genomes10, 399409. doi: 10.1007/s11295-013-0695-8

  • 54

    RobertsD. L.DixonK. W. (2008). Orchids. Curr Biol18, R325R329. doi: 10.1016/j.cub.2008.02.026

  • 55

    RozewickiJ.LiS.AmadaK. M.StandleyD. M.KatohK. (2019). MAFFT-DASH: integrated protein sequence and structural alignment. Nucleic Acids Res.47, W5W10. doi: 10.1093/nar/gkz342

  • 56

    SamantaA.DasG.DasS. K. (2011). Roles of flavonoids in plants. Int J Pharm Sci Tech6, 1235.

  • 57

    SchoenbohmC.MartensS.EderC.ForkmannG.WeisshaarB. (2000). Identification of the Arabidopsis thaliana flavonoid 3’-hydroxylase gene and functional expression of the encoded P450 enzyme. Biol. Chem.381, 749–753. doi: 10.1515/BC.2000.095

  • 58

    SchulerM. A.Werck-ReichhartD. (2003). Functional genomics of P450s. Annu. Rev. Plant Biol.54, 629667. doi: 10.1146/annurev.arplant.54.031902.134840

  • 59

    SeitzC.AmeresS.ForkmannG. (2007). Identification of the molecular basis for the functional difference between flavonoid 3’-hydroxylase and flavonoid 3’,5’-hydroxylase. FEBS Lett.581, 34293434. doi: 10.1016/j.febslet.2007.06.045

  • 60

    SeitzC.EderC.DeimlB.KellnerS.MartensS.ForkmannG. (2006). Cloning, functional identification and sequence analysis of flavonoid 3’-hydroxylase and flavonoid 3’,5’-hydroxylase cDNAs reveals independent evolution of flavonoid 3’,5’-hydroxylase in the Asteraceae family. Plant Mol. Biol.61, 365381. doi: 10.1007/s11103-006-0012-0

  • 61

    SimoF. A.WaterhouseR. M.IoannidisP.KriventsevaE. V.ZdobnovE. M. (2015). Busco: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics31, 32103212. doi: 10.1093/bioinformatics/btv351

  • 62

    SunY.ChenG.-Z.HuangJ.LiuD.-K.XueF.ChenX.-L.et al. (2021). The Cymbidium goeringii genome provides insight into organ development and adaptive evolution in orchids. Ornamental Plant Res.1, 113. doi: 10.48130/OPR-2021-0010

  • 63

    TanakaY.BruglieraF. (2013). Flower colour and cytochromes P450. Phil. Trans. R. Soc B368, 20120432. doi: 10.1098/rstb.2012.0432

  • 64

    TodaK.YangD.YamanakaN.WatanabeS.HaradaK. (2002). A single-base deletion in soybean flavonoid 3-hydroxylase gene is associated with gray pubescence color. Plant Mol. Biol.50 (2), 187196. doi: 10.1023/a:1016087221334

  • 65

    TroshinP. V.ProcterJ. B.BartonG. J. (2011). Java bioinformatics analysis web services for multiple sequence alignment-JABAWS:MSA. Bioinformatics27, 20012002. doi: 10.1093/bioinformatics/btr304

  • 66

    VikhorevA. V.StryginaK. V.KhlestkinaE. K. (2019). Duplicated flavonoid 3’-hydroxylase and flavonoid 3’, 5’-hydroxylase genes in barley genome. PeerJ7, e6266. doi: 10.7717/peerj.6266

  • 67

    WangY.-S.XuY.-J.GaoL.-P.YuO.WangX.-Z.HeX.-J.et al. (2014). Functional analysis of flavonoid 3’,5’-hydroxylase from tea plant (Camellia sinensis): Critical role in the accumulation of catechins. BMC Plant Biol.14, 347. doi: 10.1186/s12870-014-0347-7

  • 68

    WangQ.-Q.ZhuM.-J.YuX.BiY.-Y.ZhouZ.ChenM.-K.et al. (2021). Genome-wide identification and expression analysis of terpene synthase genes in cymbidium faberi. Front. Plant Sci.12. doi: 10.3389/fpls.2021.751853

  • 69

    WangY. P.TangH. B.DebarryJ. DTanX.LiJ. P.WangX. Y.et al. (2012). MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res.40, e49. doi: 10.1093/nar/gkr1293

  • 70

    WenW.AlseekhS.FernieA. R. (2012). Conservation and diversification of flavonoid metabolism in the plant kingdom. Curr Opin Plant Biol.55, 100–108. doi: 10.1016/j.pbi.2020.04.004

  • 71

    WhibleyA. C.LangladeN. B.AndaloC.HannaA. I.BanghamA.ThébaudC.et al. (2006). Evolutionary paths underlying flower color variation in Antirrhinum. Science313, 963966. doi: 10.1126/science.1129161

  • 72

    XiaoY.WenJ.MengR.MengY.ZhouQ.NieZ.-L. (2021). The expansion and diversity of the CYP75 gene family in Vitaceae. PeerJ9, e12174. doi: 10.7717/peerj.12174

  • 73

    XuY.LeiY.SuZ.ZhaoM.ZhangJ.ShenG.et al. (2021). A chromosome-scale Gastrodia elata genome and large-scale comparative genomic analysis indicate convergent evolution by gene loss in mycoheterotrophic and parasitic plants. Plant J.108, 16091623. doi: 10.1111/tpj.15528

  • 74

    YangF.GaoJ.WeiY.RenR.ZhangG.LuC.et al. (2021). The genome of Cymbidium sinense revealed the evolution of orchid traits. Plant Biotechnol. J.19, 25012516. doi: 10.1111/pbi.13676

  • 75

    YuanY.JinX.LiuJ.ZhaoX.ZhouJ.WangX.et al. (2018). The Gastrodia elata genome provides insights into plant adaptation to heterotrophy. Nat. Commun.9, 1615. doi: 10.1038/s41467-018-03423-5

  • 76

    ZhangP.DuH.WangJ.PuY.YangC.YanR.et al. (2020). Multiplex CRISPR/Cas9-mediated metabolic engineering increases soya bean isoflavone content and resistance to soya bean mosaic virus. Plant Biotechnol. J.18, 13841395. doi: 10.1111/pbi.13302

  • 77

    ZhangT.LiuC.HuangX.ZhangH.YuanZ. (2019). Land-plant phylogenomic and pomegranate transcriptomic analyses reveal an evolutionary scenario of CYP75 genes subsequent to whole genome duplications. J. Plant Biol.62, 4860. doi: 10.1007/s12374-018-0319-9

  • 78

    ZhangG.-Q.LiuK.-W.LiZ.LohausR.HsiaoY.-Y.NiuS.-C.et al. (2017). The Apostasia genome and the evolution of orchids. Nature549, 379383. doi: 10.1038/nature23897

  • 79

    ZhangG.-Q.XuQ.BianC.TsaiW.-C.YehC.-M.LiuK.-W.et al. (2016). The Dendrobium catenatum Lindl. genome sequence provides insights into polysaccharide synthase, floral development and adaptive evolution. Sci. Rep.6, 19029. doi: 10.1038/srep19029

  • 80

    ZhangW.ZhangG.ZengP.ZhangY.HuH.LiuZ.et al. (2021a). Genome sequence of Apostasia ramifera provides insights into the adaptive evolution in orchids. BMC Genomics22, 536. doi: 10.1186/s12864-021-07852-3

  • 81

    ZhangY.ZhangG.-Q.ZhangD.LiuX.-D.XuX.-Y.SunW.-H.et al. (2021b). Chromosome-scale assembly of the Dendrobium chrysotoxum genome enhances the understanding of orchid evolution. Hortic. Res.8, 183. doi: 10.1038/s41438-021-00621-z

  • 82

    ZhaoX.LiuD.-K.WangQ.-Q.KeS.LiY.ZhangD.et al. (2022). Genome-wide identification and expression analysis of the GRAS gene family in Dendrobium chrysotoxum. Front. Plant Sci.13. doi: 10.3389/fpls.2022.1058287

  • 83

    ZhaoX.ZhangM.HeX.ZhengQ.HuangY.LiY.et al. (2023). Genome-wide identification and expression analysis of the SPL gene family in three orchids. IJMS24, 10039. doi: 10.3390/ijms241210039

  • 84

    ZorattiL.KarppinenK.Luengo EscobarA.HäggmanH.JaakolaL. (2014). Light-controlled flavonoid biosynthesis in fruits. Front. Plant Sci.5. doi: 10.3389/fpls.2014.00534

Summary

Keywords

Orchidaceae, cytochrome P450, CYP75 gene family, expression analysis, Cymbidium goeringii

Citation

Li Y, Zhao X, Zhang M-M, He X, Huang Y, Ahmad S, Liu Z-J and Lan S (2023) Genome-based identification of the CYP75 gene family in Orchidaceae and its expression patterns in Cymbidium goeringii. Front. Plant Sci. 14:1243828. doi: 10.3389/fpls.2023.1243828

Received

22 June 2023

Accepted

11 September 2023

Published

27 September 2023

Volume

14 - 2023

Edited by

Dianella G. Howarth, St. John’s University, United States

Reviewed by

Cao Deng, DNA Stories Bioinformatics Center, China; Zhichao Xu, Northeast Forestry University, China

Updates

Copyright

*Correspondence: Zhong-Jian Liu, ; Siren Lan,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics