- Werner Siemens Chair of Synthetic Biotechnology, Department of Chemistry, Technical University of Munich, Munich, Germany
Diterpene synthases catalyze complex, multi-step C-C coupling reactions thereby converting the universal, aliphatic precursor geranylgeranyl diphosphate into diverse olefinic macrocylces that form the basis for the structural diversity of the diterpene natural product family. Since catalytically relevant crystal structures of diterpene synthases are scarce, homology based biomolecular modeling techniques offer an alternative route to study the enzyme's reaction mechanism. However, precise identification of catalytically relevant amino acids is challenging since these models require careful preparation and refinement techniques prior to substrate docking studies. Targeted amino acid substitutions in this protein class can initiate premature quenching of the carbocation centered reaction cascade. The structural characterization of those alternative cyclization products allows for elucidation of the cyclization reaction cascade and provides a new source for complex macrocyclic synthons. In this study, new insights into structure and function of the fungal, bifunctional Aphidicolan-16-ß-ol synthase were achieved using a simplified biomolecular modeling strategy. The applied refinement methodologies could rapidly generate a reliable protein-ligand complex, which provides for an accurate in silico identification of catalytically relevant amino acids. Guided by our modeling data, ACS mutations lead to the identification of the catalytically relevant ACS amino acid network I626, T657, Y658, A786, F789, and Y923. Moreover, the ACS amino acid substitutions Y658L and D661A resulted in a premature termination of the cyclization reaction cascade en-route from syn-copalyl diphosphate to Aphidicolan-16-ß-ol. Both ACS mutants generated the diterpene macrocycle syn-copalol and a minor, non-hydroxylated labdane related diterpene, respectively. Our biomolecular modeling and mutational studies suggest that the ACS substrate cyclization occurs in a spatially restricted location of the enzyme's active site and that the geranylgeranyl diphosphate derived pyrophosphate moiety remains in the ACS active site thereby directing the cyclization process. Our cumulative data confirm that amino acids constituting the G-loop of diterpene synthases are involved in the open to the closed, catalytically active enzyme conformation. This study demonstrates that a simple and rapid biomolecular modeling procedure can predict catalytically relevant amino acids. The approach reduces computational and experimental screening efforts for diterpene synthase structure-function analyses.
Introduction
With more than 50,000 different molecules known to date terpenes are the greatest natural occurring product family found in organisms from bacteria to fungi, mammals, and plants. They are all derived from the isoprene units' dimethylallyl diphosphate and isopentenyl diphosphate. Condensation reactions of this molecules lead to the formation of different length phosphorylated linear terpenes, serving as substrate for terpene synthases. This enzyme family carry out highly stereo complex C-C coupling reactions, resulting in structurally complex macrocycles that contribute to the structural and functional diversity of terpenes (Christianson, 2017). Diterpenes are derived from the linear aliphatic precursor geranylgeranyl diphosphate (GGDP) being cyclized by diterpene synthases. More specifically, diterpene synthases are classified into class I and class II enzymes based on the structural presence of the conserved motifs DDXD or DDXXD/E and NSE/DTE, respectively. While class II reactions perform a protonation initiated cyclization reaction to generate phosphorylated bicyclic structures, class I reactions are initiated by hydrolyses of the GGDP pyrophosphate moiety that is coordinated by a Mg2+-triad thereby generating mono- or poly-cyclic structures.
The natural product Aphidicolin, initially isolated from the fungus Cephalosporium aphidicola, is a hydroxylated, tetracyclic diterpenoid that exhibits a broad range of biological activities and applications (Brundret et al., 1972; Dalziel et al., 1973). More specifically, it is a potent inhibitor of the eukaryotic DNA α-polymerase with a commercial application as a cell synchronization agent. The compound is in pharmaceutical development due anti-tumor, anti-viral, and anti-leishmanial activity (Ikegami et al., 1978; Pedrali-Noy et al., 1980; Kayser et al., 2001; Edwards et al., 2013; Starczewska et al., 2016). Recently, other organisms including the fungus Nigrospora sphaerica and the pathogenic fungus Phoma betae have been identified as natural Aphidicolin producers. Current data suggests that Aphidicolin biosynthesis is exclusive to fungal metabolism and that natural sources for Aphidicolin are limited (Starratt and Loschiavo, 1974; Fujii et al., 2011; Lopes and Pupo, 2011). Nevertheless, elucidation of the responsible Aphidicolin biosynthetic gene cluster in P. betae allowed for the identification of a bifunctional diterpene synthase that contains both a functional class I and class II domain (Oikawa et al., 2001). The Aphidicolan-16-ß-ol synthase (ACS) generates the stereo-chemically demanding Aphidicolan-16-ß-ol (AD)—core structure of Aphidicolin—structure via a two-step reaction as depicted in Figure 1 (Oikawa et al., 2002).
Figure 1. Model of a bifunctional diterpene synthase. In the case of ACS GGDP is initially converted to syn-CDP in the class II active site (located between ß and γ domain). Syn-CDP is further cyclizied to AD in class I active site (α-domain).
Initially, GGDP is rearranged in the class II active site cleft by protonation to the bicyclic syn-copalyl diphosphate (syn-CDP). Subsequently, syn-CDP is elaborated to AD in the class I active site (Adams and Bu'Lock, 1975; Oikawa et al., 2002). As depicted in Figure 2 the cyclization mechanism in the class I active site, initiated by the hydrolysis of the pyrophosphate group, results in 8-ß-pimaradienyl carbocation formation. A subsequent attack of the vinyl group, bridging the C ring, directly undergoes a Wagner-Meerwein rearrangement and results in the formation of the aphidicolenyl carbocation. Eventually, this cation is quenched by water thereby generating AD.
Figure 2. Proposed cyclization mechanism of the ACS class I reaction (Oikawa et al., 2002).
Terpene cyclization mechanisms are conventionally elucidated by radio labeling of protons and carbons (Dickschat, 2017). This substrate specific labeling provides for identification of unusual hydride shifts and rearrangements. Alternatively, the enzyme's cyclization mechanisms can be probed by altering amino acids, trying to terminate the reaction cascade at a specific transition state (Morrone et al., 2008; Janke et al., 2014; Schrepfer et al., 2016; Jia et al., 2017). Therefore, random mutagenesis can be performed but the screening effort for this methodology is elaborate without an efficient high throughput screening options (Lauchli et al., 2013). Biomolecular modeling allows for the rational identification and in silico modulation of amino acid networks that are involved in complex reaction cascades (Pemberton et al., 2015; Schrepfer et al., 2016; Christianson, 2017; Escorcia et al., 2018). This methodology provides for a knowledge based approach of enzyme mutagenesis and screening. Nevertheless, a particular challenge for this strategy is based on the missing structural information for most terpene synthases. However, as their structural elements and domains are highly conserved (Christianson, 2017), homology modeling is a potential route to identify catalytically relevant amino acids despite the low primary sequence identities in this enzyme family (Xu and Li, 2003). Unfortunately, most available crystal structures of terpene synthase are deposited in the open apo-enzyme configuration that is catalytically inactive. This open enzyme conformation presents an additional obstacle when catalytically relevant amino acids have to be identified in silico. At present, only two diterpene synthase structures have been reported in the closed, catalytically active form (Liu et al., 2014; Serrano-Posada et al., 2015). Therefore, automated homology modeling approaches will almost always result in catalytically non-relevant open enzyme configuration. Moreover, while prediction tools can place large cofactors (i.e., FAD, NADH, Heme) correctly in the apo-protein framework, ligand-metal interactions are difficult to predict because of the multiple coordination geometries and the lack of sufficiently accurate force field parameters (Khandelwal et al., 2005). Hence, structure function predictions that depend on the interplay between the amino acids of the protein framework with small metal ions cannot be conducted solely by application of automated software tools. In this context, a rational combination of structural information by superposition and extraction of cofactors is performed to prepare the protein structure for docking studies. Nevertheless, this approach often neglects reliable positioning of the cofactor coordinating amino acids. Additionally, falsely predicted positioning of amino acid side chains in the active site cleft can lead to invalid interpretation of a homology model based protein-ligand complex. To improve this situation, this study elucidated rapid and simple methodologies to refine diterpene homology models for docking studies thereby allowing for reliable structure-function predictions. In this context, an ACS class I homology model of the α-domain was predicted from the primary sequence. Subsequently, these models were compared to catalytically relevant closed terpene synthases structures. The location of metals was refined and fitted against specifically selected structural templates and multiple docking studies were carried out and validated. Our in silico results were experimentally evaluated by ACS mutagenesis studies. This lead to an identification of essential amino acid residue sidechains that are necessary for retaining the enzymes activity. Additionally, we detected amino acid substitutions that abort the catalytic reaction cascade en- route from syn-CDP to AD. Structural analyses and elucidation of these compounds revealed the formation of syn-copalol and a labdane related, non-hydroxylated diterpene by the ACS mutants Y658L and D661A. Our approach of a protein homology model based structure function analysis can be easily adapted for other terpene synthases. This methodology allows for rapid and simple analysis of the catalytically relevant amino acid network that help studying complex reaction cascades and developing new biocatalysts.
Materials and Methods
Materials and Chemicals
All genes used were synthesized by Life technologies GmbH and the codon usage was optimized for E. coli if not stated otherwise. Primers were obtained from Eurofins Genomics GmbH. Strains and plasmids were obtained from Merck KGaA. All chemicals used were obtained at highest purity from Roth chemicals or Applichem GmbH. Enzymes were purchased from Thermo Fisher Scientific.
Software and Web-Tools
RaptorX was applied for homology modeling studies (http://raptorx.uchicago.edu; Källberg et al., 2012). The initial predicted structure was analyzed and further modified in the environment of UCSF Chimera software package (Pettersen et al., 2004; http://www.cgl.ucsf.edu/chimera). Comparative modeling by spatial restraints was performed by MODELLER (Eswar et al., 2006), and all substrate docking studies performed by AutoDock Vina (Trott and Olson, 2010; http://vina.scripps.edu). Chemical structures were drawn by PerkinElmer ChemBioDraw Ultra (http://www.cambridgesoft.com). For ligand preparation the Avogadro (Hanwell et al., 2012; https://avogadro.cc/) software package was used. A syn-CDP toppar stream file was generated by CHARMM General Force Field program version 1.0.0 for use with CGenFF version 3.0.1 (https://cgenff.paramchem.org; Vanommeslaeghe et al., 2010, 2012; Vanommeslaeghe and MacKerell, 2012). Two ns molecular dynamic studies of the docked ACS model B in a water sphere have been performed under CHARMM general force field by NAMD (Phillips et al., 2005; http://www.ks.uiuc.edu/Research/namd/). NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. For high resolution pictures the protein was prepared by Visual Molecular Dynamics (http://www.ks.uiuc.edu/Research/vmd/; Humphrey et al., 1996) and rendered by Tachyon implemented in the VMD software package (Stone, 1998).
Docking
Ligand structures were downloaded from https://pubchem.ncbi.nlm.nih.gov/ available and geometrically optimized by 500 steps of steepest descent under MMFF94 force field parameters included in Avogadro. Protein structures were prepared by Dock Prep, which is part of the Chimera software environment. The AMBER force field (AMBERff14SB) was applied to the receptor while Gasteiger charges were added to the ligand and co-factors. As recently reported, docking can be improved by assigning partial charges to metal ions (Hu and Shelver, 2003). In this context, Mg-ion charges were set to +1. Syn-CDP charge was set to −3. Docking was performed by AutoDock Vina using standard parameters. Docking poses were chosen based on a structural comparison to the pyrophosphate group that is co-crystallized in pdb 5A0J (see Figure S1B). The chosen pose was furthermore validated by re-dock approaches. Therefore, the predicted syn-CDP pose was de novo geometrically optimized by 500 steps of steepest descent under MMFF94 force field parameters included in Avogadro software environment prior to docking repetition (see Figure S1A).
Model Generation
An initial homology model of the ACS α-domain was predicted by RaptorX starting from the amino acid 565. A model based on the pdb crystal structure 5A0J, referring to a labdane related diterpene synthase, was manually selected for further structure function analyses. In order to prepare the model for docking studies, the coordinating Mg2+-ion triad and water molecules were implemented in the structure by different methods. Model A was generated by structural alignment to 5A0J. Cofactor positions were transferred from the structure template to Model A without any further adjustment prior to docking studies. Model B was created by MODELLER implemented in the Chimera software environment using the 5A0J as template structure. In this model hetero atoms and water molecules in the structure environment were computationally implemented. The pyrophosphate group was removed prior to docking with syn-CDP. Model C was prepared analogously to Model B but prior to refinement by MODELLER, syn-CDP was docked into the template structure 5A0J.
Model Validation
The protein ligand complex of Model B was validated by molecular dynamics studies. Therefore, syn-CDP was initially extracted from Model B and parameterized by CHARMM General Force Field program version 1.0.0 for use with CGenFF version 3.0.1. VMD was used to parameterize the protein and for merging ligand and protein. Subsequently, a water sphere was added around the protein-ligand complex. Two nanoseconds of molecular dynamic studies under CHARMM General Force Field was applied to the protein complex by NAMD. The calculated rmsd of the generated frames was plotted over time (Figure S2). A constant rmsd value was chosen as the criteria for an equilibrated protein-ligand complex. The last frame obtained was compared to the initial model B (Figure S3).
Plasmids for Diterpene Production
For all cloning procedures E. coli HMS 174 (DE3) was used. Clones were cultivated at 37°C in Luria-Bertani (LB) medium. Chloramphenicol (34 μg/L) and Kanamycin (50 μg/L) were added as required. For efficient production of the diterpene AD, E. coli's internal 1-deoxy-xylulose-5 phosphate pathway flux was increased by overexpression of deoxy-xylulose 5 phosphate synthase (dxs: GenBank: YP001461602.1), isopentenyl-diphosphate delta isomerase (idi: GenBank: AAC32208.1), and further extended by expressing geranylgeranyl diphosphate synthase (crtE: GenBank: KPA04564.1) and Aphidicolan-16-ß-ol synthase (acs: GenBank: AB049075.1). Therefore, dxs and acs were amplified from original sources by PCR. Polycistronic operons (Table 1) were constructed by BioBrick cloning standard (Shetty et al., 2008).
Site directed mutations of acs were generated by PCR. Forward primers were designed exhibiting the respective mutation at the 5′ end while the corresponding reverse primers were phosphorylated at 5′ end (Table S1). PCR products were ligated by T4 Ligase prior to transformation. All amino acid exchanges were confirmed by sequencing.
Production of Diterpenes
All diterpene production experiments were performed in E. coli BL 21 (DE3). To investigate the product outcome of ACS mutants, pACYC acs plasmids were co-transformed with pAX dic. Cultivation was performed in minimal media supplemented with 6 g/L yeast extract and 30 g/L glycerol at 25°C. After 60 h the culture was extracted with a mixture of hexane, ethanol and ethyl acetate (1:1:1) (v/v/v) for 1 h. The extract was centrifuged at 10,000 g for 2 min. The upper, organic phase was directly analyzed for diterpene products via GC-MS.
Diterpene Analytics
GC-MS analyses of diterpenes was performed by a Trace GC Ultra with DSQII (Thermo Fisher Scientific). Therefore, 1 μL sample was loaded (Split 1/10) by TriPlus AS onto a SGE BPX5 column (30 m, I.D 0.25 mm, Film 0.25 μm). The initial column temperature was set to 160°C and maintained for 5 min before a temperature gradient at 8°C/min up to 320°C was applied. The final temperature was kept for additional 3 min. MS data were recorded at 70 eV (EI) and m/z (rel. intensity in %) as total ion current (TIC). The recorded m/z range was in between 50 to 650.
NMR spectra were recorded in CDCl3 with an Avance III 500 MHz (Bruker) at 300 K. 1H NMR chemical shifts are given in ppm relative to CDCl3 (δ = 7.26 ppm). The 2D experiments (HSQC) were performed using standard Bruker pulse sequences and parameters.
Results and Discussion
Homology Model Refinement
The steady increase in published protein crystal structures provides for an accelerated improvement of computational homology prediction. Especially due to the high structurally conservation of the terpene synthase enzyme families, biomolecular tools can predict structures solely based on the amino acid sequence. In this context, structure prediction of the bifunctional ACS was performed to analyze the highly complex conversion of GGDP via syn-CDP to the tetracyclic AD which is the core structure of the cytostatic compound Aphidicolin. ACS belongs to the diterpene synthase family and we identified three highly structurally conserved domains. The initial conversion from the universal diterpene precursor GGDP to syn-CDP occurs in class II active site, located between the ACS ß- and γ-domain. The subsequent syn-CDP cyclization to AD is then conducted in the class I active site that is positioned in the middle of an α-helical bundle forming the ACS α-domain. Notably, the fungal ACS is structurally highly similar to the previously crystallized plant diterpene synthases Abietadiene (pdb: 3S9V) and Taxadiene synthase (pdb: 3P5R), respectively. Homology prediction based on the full ACS sequence took those two structures into account, but for both, crystals could only be achieved in N-terminal truncated forms. Furthermore, these crystal structures have only been solved in an open conformation that is catalytically inactive. In order to circumvent the consideration of these catalytically inactive templates, only the ACS α-domain sequence was used for homology prediction. A model based on the labdane related diterpene synthases (LRS) (pdb: 5A0J), which is provided in a catalytically active holo-complex (Serrano-Posada et al., 2015), was selected for ACS homology refinement. The structural superposition of Abietadiene (pdb: 3S9V), LRS (pdb: 5A0J), and the ACS model, as depicted in Figure 3, explicitly demonstrates that there is a better fit between the ACS model and the LRS crystal structure. While the structural fit between LRS and the ACS model is visually well apparent, we have not calculated an rmsd value qualifier as structural domains that do not constitute the active site region are highly variable.
Figure 3. Structural alignment of Abietadiene synthase (gray), LRS (blue), and ACS (purple) in complex with Mg2+-ions and syn-CDP.
Co-crystallized cofactors (Mg2+-ion triad) and waters, both provided in the LRS structure, are also involved in the ACS reaction en-route from syn-CDP to AD. Therefore, we differentially adapted both, the positions of the Mg2+ ions and waters into the ACS models that resulted in the generation of three ACS models (A–C). Model A was prepared by adaptation of cofactor positions from the template structure LRS after structural alignment. Initial evaluation of this model indicated that this un-refined modeling method results in clashes of cofactors positions with amino acids side chains. Generally, in homology prediction the active site's cavity is not reserved for the substrate or cofactors specifically. Therefore, we presume that amino acid sidechains occupy this free space due to applied energy minimization optimizations. This is demonstrated in our docking studies of model A, where ACS amino acid Y658 is preventing syn-CDP to completely access the active site cavity. With the MODELLER package, which is based on comparative protein structure modeling by spatial restraints, a protein structure can be refined based on a template structure. Additionally, hetero-atoms and water molecules can be included directly in the model refinement. This refinement methodology applied to our initial model structure lead to the generation of ACS model B. This model B computationally included the three Mg2+-ions, a pyrophosphate group (conventionally derived by Mg2+ based hydrolysis of the phosphorylated substrate [syn-CDP] substrate) and water molecules directly as they are all present in the LRS template structure. Model B provides reliable positioning of the conserved amino acids that constitute the class I diterpene synthase signature DDXXD/E and NSE/DTE motifs in relation to the adapted Mg2+- ions, water and pyrophosphate moieties, respectively. Subsequently, we removed the pyrophosphate group from the model B structure to enable docking with the native syn-CDP substrate. Our docking data indicated that in Model B syn-CDP can completely access the active site's cavity. A specific syn-CDP conformation was selected pointing toward the ACS G-Helix, as this flexible helix is proposed to be involved in terpene cyclization reactions (Yoshikuni et al., 2006; Baer et al., 2014; Jia et al., 2017). This docking pose was validated by multiple re-docking approaches (Figure S1A). Additionally, we validated the pose while the position of the pyrophosphate moiety was compared to the pyrophosphate group co-crystallized in LRS (Figure S1B). Finally, a third approach for structure-function analyses was performed by docking syn-CDP into LRS prior to ACS refinement with MODELLER. Again, a syn-CDP conformation was chosen with close proximity toward the G-Helix. On the basis of this LRS holo-protein complex, an ACS holo-complex model C was generated. This method provided for a protein model that was refined around the substrate and cofactors. This methodology also provided for a precise specification of amino acids involved in the AD cyclization reaction. For all three models amino acids located within a five Ǻ vicinity to the docked substrate syn-CDP (thereby neglecting the pyrophosphate moiety) were analyzed by mutational studies to elucidate their catalytic relevance (see Figure 4, Table S2).
Figure 4. Homology models of ACS synthase refined and prepared for docking of the substrate syn-CDP. Model A results are colored red, Model B results yellow, and Model C results blue.
Mutational Validation of Catalytically Relevant ACS Amino Acids
Due to their stereo-chemical diversity, natural diterpene scaffolds are attractive research leads. The enormous stereo-chemical demand of diterpene macrocycles renders them difficult to access via total chemical synthesis approaches. Therefore, biosynthetic routes to generate these complex structures are currently an intense research focus (Dickschat, 2016; Bian et al., 2017; Jones, 2017). The ability to access new diterpene macrocycles via selective alteration of amino acids in diterpene synthases provides for a highly varied accessible chemical space. For the class I cyclooctat-9-en-7-ol synthase, which naturally generates a tricyclic fusicoccin type diterpene, amino acid mutations in the vicinity of the active site lead to intermittent abortion of the reaction cascade. Hence, alternative macrocyclic structures, such as the bicyclic dolabellane and the monocyclic cembrane, could be generated thereby elucidating the reaction cascade (Görner et al., 2013; Janke et al., 2014). In this study, insights into the class I reaction of the ACS were achieved by mutational studies. In that respect, we intended to quench the reaction from syn-CDP to AD at previously proposed transitional states (Adams and Bu'Lock, 1975; Oikawa et al., 2002). Based on the proposed ACS transitional states we presume that syn-labdatriene and syn-copalol (termination product of the syn-copalyl carbocation), stereoisomers of syn-pimaradiene (termination products of the pimaradienyl carbocation), or aphidicolene and stemodene (termination products of the aphidicolenyl carbocation) are potential abortion products (see Figure 5).
Figure 5. Expected products generated by ACS if the reaction cascade en- route from syn-CDP to AD is prematurely terminated.
For an intermittent abortion of the reaction cascade from syn-CDP to AD, we have selected amino acids within a range of five Ǻ to the docked ligands as prime targets for mutagenesis (see Figure 4). Preliminary studies revealed that sidechain substitutions encompassing amino acids exchanges that inherently change physico-chemical properties frequently resulted in inactive enzyme variants (Janke et al., 2014; Schrepfer et al., 2016). In this context, we focused on changing the size of the respective amino acid sidechain thereby trying to preserve physico-chemical characteristics. Alternatively, we chose amino acid side chain substitutions that would replace polar groups with similar size amino acids (Table S2).
ACS syn-CDP docking results pointed toward a strong interaction between the decalin core and surrounding hydrophobic sidechains. However, as the decalin structure of syn-CDP remains untouched in further cyclization steps most of the implemented mutations near this particular moiety resulted in inactive (I626A, Y923L, F789L) or wildtype activity variants (F629L, Y658F, C831G, C831T, T920G, Y923F). Based on our modeling results, we also identified specific amino acids located in the ACS G-helix that in other studies have been proposed to be of catalytic relevance (Baer et al., 2014; Jia et al., 2017). While mutational changes in the G-Helix of Kaurene synthase like diterpene synthases resulted in alternative product profiles (Jia et al., 2017), our analogous approaches with ACS only provided inactive (A786L, F789L) or wildtype active (A786G, F789Y) variants. Nevertheless, our results support previous findings that propose the G-Helix as an essential flexible motif which is involved in the catalytically relevant structural change from the open to the closed enzyme configuration (Baer et al., 2014).
Only the substitution of ACS Y658L and D661A provided for a varied product outcome. In addition to amino acids that constitute the DXXDD/E and NSE/DTE signature motifs that are responsible for Mg2+-ion coordination, our combined in silico and experimental study identified only a few amino acids (see Figure 6, colored in pink) capable to terminate activity. Our successful mutations (D661A, Y658L) indicated that the unusual cyclization from syn-CDP to AD proceeds in a spatially restricted area of the active site's cleft. Additionally, our data suggests that the pyrophosphate group remains in the active site and coordinates the reaction cascade. This is in accordance to the recently postulated Taxadiene synthase reaction mechanism (Schrepfer et al., 2016).
Figure 6. (A) ACS active sites cleft in complex with Mg2+ and syn-CDP. Amino acid network within five Ǻ to syn-CDP are displayed. (B) ACS active sites cleft in complex with Mg2+ and syn-CDP. Substitution of labeled amino acid (displayed in pink) resulted in inactive enzyme versions or mutants with altered product outcome.
ACS Mutants D661A and Y658L
GC-MS analyses of the ACS mutants Y658L and D661A revealed that this mutations lead to the formation of two unknown diterpene products (see Figure 7). In contrast to the native AD, which had a GC retention time of 17.67 min, these new diterpenes had a retention time of 12.79 and 13.46 min, respectively. The latter product with a retention time of 13.46 min, showed a total mass of 290 m/z. Comparison of the MS spectral data suggests that this was a hydroxylated diterpene with a similar structure to syn-copalol (Hoshino et al., 2011). Subsequently, this compound was isolated and structural characterized by NMR (Figures S4, S5). The results are in accordance to previous spectral data for syn-copalol (Yee and Coates, 1992). One plausible explanation for syn-copalol formation is the quenching of the syn-copalyl carbocation intermediate by water in the active site of the enzyme. The other diterpene product with a retention time of 12.79 min had a total mass of 272 m/z indicating that this structure was not-hydroxylated. While we expected the formation of syn-labda-8(17),12E,14-triene, comparison with published MS-spectra revealed significant differences (Morrone et al., 2011). Unfortunately, due to the low amounts produced and purification issues for this highly hydrophobic compound, we could not conduct NMR analysis. However, we presume that this compound is also originated from the syn-copalyl carbocation and that a labdane related diterpene with high structural similarity to syn-labda-8(17),12E,14-triene was generated by the ACS mutants. The newly generated diterpenes are of great interest as copalol derivatives display various biological activities analogous to aphidicolin (Hanson, 2015).
Figure 7. Analysis of ACS wildtype and ACS D661A mutant product outcome by GC. The MS-patterns for syn-labdatriene, syn-copalol and AD (from right to left) are presented below.
The structural changes (D661A and Y658L) still allowed syn-CDP binding in the active site with subsequent hydrolyses of the pyrophosphate group. The syn-copalyl carbocation was then quenched either by water (release of syn-copalol) or an amino acid side chain (release of non-hydroxylated diterpene). Furthermore, as we did not find other substitution that stopped cyclization at the proposed transitional states and as we could not even detect changes in the byproduct formation of the active mutants, we presume that the ACS cyclization occurs in a spatially restricted area and that the pyrophosphate group remains in the active site, which is in accordance to recent reports (Schrepfer et al., 2016).
Former diterpene centered production processes were limited by low target compound yields. However, optimization of recombinant diterpene production hosts has extensively progressed to provide gram per liter yields (Ajikumar et al., 2010; Schalk et al., 2012). Today, access to novel diterpene lead structures is limited by the effective identification of relevant enzyme systems from large scale genome sequencing projects. Therefore, rational alteration of known terpene synthase product profiles by using a combination of in silico prediction and knowledge based mutagenesis studies can allow for a more rapid and targeted expansion of the desired chemical space.
Conclusion
A model of ACS synthase was computed that required the application of various methods for model refinement to improve the quality of in silico structure function analysis. A model of the catalytically active, closed ACS α-domain complex was generated. Examination of this model provided for the identification of catalytically active amino acid sidechains. The in silico results were confirmed by mutational studies of the ACS. The amino acid substitutions Y658L and D661A in the vicinity of the ACS active site lead to formation of the alternative cyclization products syn-copalol and a minor labdane related diterpene. Formation of these products were delineated by quenching of the syn-copalyl carbocation en-route to AD. Additional mutants leading to inactive enzyme variants (A786L, F789L) provided insights into catalytically relevant amino acid residues within the G-Helix. The cumulative in-silico and experimental data suggests that amino acids constituting the G-loop motif of class I terpene cyclases are involved in the transformation of the open to the closed, catalytically active enzyme conformation. Moreover, as we only obtained a limited number of alternative cyclization products in our mutational screens, we presume that AD formation occurs in a rather confined location of the ACS active site. With respect to our biomolecular modeling approaches, we demonstrated that application of simple and rapid computational methodologies can be employed for prediction and structure function analyses of class I diterpene synthases.
Author Contributions
TB and MF supervised this study. MH initiated this study and performed virtual modeling and docking studies. NM and MM conducted mutagenesis experiments and screening under supervision of MH and MF. Data was analyzed by MH, MF, NM, MM, and TB. All figures were created by MH. All authors verified the data, contributed to the manuscript, and approved the final version.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
MH, MF, and TB would like to acknowledge the financial support of the German ministry for Education and Research (BMBF) with the grant number 031A305A. TB gratefully acknowledges funding by the Werner Siemens foundation for establishing the field of Synthetic Biotechnology at the Technical University of Munich (TUM).
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fchem.2018.00101/full#supplementary-material
Abbreviations
ACS, Aphidicolan-16-β-ol synthase; AD, Aphidicolan-16-β-ol; GGDP, geranylgeranyl diphosphate; LRS, labdane related diterpene synthase; syn-CDP, syn-copalyl diphosphate.
References
Adams, M. R., and Bu'Lock, J. D. (1975). Biosynthesis of the diterpene antibiotic, aphidicolin, by radioisotope and 13C nuclear magnetic resonance methods. J. Chem. Soc. Chem. Commun. 389–391. doi: 10.1039/c39750000389
Ajikumar, P. K., Xiao, W.-H., Tyo, K. E., Wang, Y., Simeon, F., Leonard, E., et al. (2010). Isoprenoid pathway optimization for Taxol precursor overproduction in Escherichia coli. Science 330, 70–74. doi: 10.1126/science.1191652
Baer, P., Rabe, P., Fischer, K., Citron, C. A., Klapschinski, T. A., Groll, M., et al. (2014). Induced-fit mechanism in class I terpene cyclases. Angew. Chem. Int. Edn. 53, 7652–7656. doi: 10.1002/anie.201403648
Bian, G., Deng, Z., and Liu, T. (2017). Strategies for terpenoid overproduction and new terpenoid discovery. Curr. Opin. Biotechnol. 48, 234–241. doi: 10.1016/j.copbio.2017.07.002
Brundret, K. M., Dalziel, W., Hesp, B., Jarvis, J. A. J., and Neidle, S. (1972). X-Ray crystallographic determination of the structure of the antibiotic aphidicolin: a tetracyclic diterpenoid containing a new ring system. J. Chem. Soc. Chem. Commun. 1027–1028. doi: 10.1039/c39720001027
Christianson, D. W. (2017). Structural and chemical biology of terpenoid cyclases. Chem. Rev. 117, 11570–11648. doi: 10.1021/acs.chemrev.7b00287
Dalziel, W., Hesp, B., Stevenson, K. M., and Jarvis, J. A. J. (1973). The structure and absolute configuration of the antibiotic aphidicolin: a tetracyclic diterpenoid containing a new ring system. J. Chem. Soc. Perkin Trans. 1, 2841–2851. doi: 10.1039/p19730002841
Dickschat, J. S. (2016). Bacterial terpene cyclases. Nat. Prod. Rep. 33, 87–110. doi: 10.1039/C5NP00102A
Dickschat, J. S. (2017). Modern aspects of isotopic labellings in terpene biosynthesis. Eur. J. Org. Chem. 2017, 4872–4882. doi: 10.1002/ejoc.201700482
Edwards, T. G., Helmus, M. J., Koeller, K., Bashkin, J. K., and Fisher, C. (2013). Human papillomavirus episome stability is reduced by aphidicolin and controlled by DNA Damage response pathways. J. Virol. 87, 3979–3989. doi: 10.1128/JVI.03473-12
Escorcia, A. M., van Rijn, J. P. M., Cheng, G.-J., Schrepfer, P., Brück, T. B., and Thiel, W. (2018). Molecular dynamics study of taxadiene synthase catalysis. J. Comput. Chem. doi: 10.1002/jcc.25184. [Epub ahead of print].
Eswar, N., Webb, B., Marti-Renom, M. A., Madhusudhan, M. S., Eramian, D., Shen, M.-Y., et al. (2006). Comparative protein structure modeling using modeller. Curr. Protoc. Bioinformatics Chapter 5, Unit-5.6. doi: 10.1002/0471250953.bi0506s15
Fujii, R., Minami, A., Tsukagoshi, T., Sato, N., Sahara, T., Ohgiya, S., et al. (2011). Total biosynthesis of diterpene aphidicolin, a specific inhibitor of DNA polymerase α: heterologous expression of four biosynthetic genes in Aspergillus oryzae. Biosci. Biotechnol. Biochem. 75, 1813–1817. doi: 10.1271/bbb.110366
Görner, C., Häuslein, I., Schrepfer, P., Eisenreich, W., and Brück, T. (2013). Targeted engineering of cyclooctat-9-en-7-ol synthase: a stereospecific access to two new non-natural fusicoccane-type diterpenes. ChemCatChem 5, 3289–3298. doi: 10.1002/cctc.201300285
Hanson, J. R. (2015). Diterpenoids of terrestrial origin. Nat. Prod. Rep. 32, 76–87. doi: 10.1039/C4NP00108G
Hanwell, M. D., Curtis, D. E., Lonie, D. C., Vandermeersch, T., Zurek, E., and Hutchison, G. R. (2012). Avogadro: an advanced semantic chemical editor, visualization, and analysis platform. J. Cheminform. 4:17. doi: 10.1186/1758-2946-4-17
Hoshino, T., Nakano, C., Ootsuka, T., Shinohara, Y., and Hara, T. (2011). Substrate specificity of Rv3378c, an enzyme from Mycobacterium tuberculosis, and the inhibitory activity of the bicyclic diterpenoids against macrophage phagocytosis. Org. Biomol. Chem. 9, 2156–2165. doi: 10.1039/C0OB00884B
Hu, X., and Shelver, W. H. (2003). Docking studies of matrix metalloproteinase inhibitors: zinc parameter optimization to improve the binding free energy prediction. J. Mol. Graph. Modell. 22, 115–126. doi: 10.1016/S1093-3263(03)00153-0
Humphrey, W., Dalke, A., and Schulten, K. (1996). VMD: Visual molecular dynamics. J. Mol. Graph. 14, 33–38. doi: 10.1016/0263-7855(96)00018-5
Ikegami, S., Taguchi, T., Ohashi, M., Oguro, M., Nagano, H., and Mano, Y. (1978). Aphidicolin prevents mitotic cell division by interfering with the activity of DNA polymerase-α. Nature 275:458. doi: 10.1038/275458a0
Janke, R., Görner, C., Hirte, M., Brueck, T., and Loll, B. (2014). The first structure of a bacterial diterpene cyclase: CotB2. Acta Crystallogr. D Biol. Crystallogr. 70, (Pt 6), 1528–1537. doi: 10.1107/S1399004714005513
Jia, M., Zhou, K., Tufts, S., Schulte, S., and Peters, R. J. (2017). A pair of residues that interactively affect diterpene synthase product outcome. ACS Chem. Biol. 12, 862–867. doi: 10.1021/acschembio.6b01075
Jones, B. (2017). Diterpenoids: Types, Functions and Research. New York, NY: Nova Science Publishers, Incorporated.
Källberg, M., Wang, H., Wang, S., Peng, J., Wang, Z., Lu, H., et al. (2012). Template-based protein structure modeling using the RaptorX web server. Nat. Protoc. 7, 1511–1522. doi: 10.1038/nprot.2012.085
Kayser, O., Kiderlen, A. F., Bertels, S., and Siems, K. (2001). Antileishmanial activities of aphidicolin and its semisynthetic derivatives. Antimicrob. Agents Chemother. 45, 288–292. doi: 10.1128/AAC.45.1.288-292.2001
Khandelwal, A., Lukacova, V., Comez, D., Kroll, D. M., Raha, S., and Balaz, S. (2005). A Combination of docking, QM/MM methods, and MD simulation for binding affinity estimation of metalloprotein ligands. J. Med. Chem. 48, 5437–5447. doi: 10.1021/jm049050v
Lauchli, R., Rabe, K. S., Kalbarczyk, K. Z., Tata, A., Heel, T., Kitto, R. Z., et al. (2013). High-throughput screening and directed evolution of terpene synthase-catalyzed cylization(). Angew. Chem. Int. Ed. Engl. 52, 5571–5574. doi: 10.1002/anie.201301362
Liu, W., Feng, X., Zheng, Y., Huang, C.-H., Nakano, C., Hoshino, T., et al. (2014). Structure, function and inhibition of ent-kaurene synthase from Bradyrhizobium japonicum. Sci. Rep. 4:6214. doi: 10.1038/srep06214
Lopes, A. A., and Pupo, M. T. (2011). Biosynthesis of aphidicolin proceeds via the mevalonate pathway in the endophytic fungus Nigrospora sphaerica. J. Braz. Chem. Soc. 22, 80–85. doi: 10.1590/S0103-50532011000100010
Morrone, D., Hillwig, M. L., Mead, M. E., Lowry, L., Fulton, D. B., and Peters, R. J. (2011). Evident and latent plasticity across the rice diterpene synthase family with potential implications for the evolution of diterpenoid metabolism in the cereals. Biochem. J. 435, 589–595. doi: 10.1042/BJ20101429
Morrone, D., Xu, M., Fulton, D. B., Determan, M. K., and Peters, R. J. (2008). Increasing complexity of a diterpene synthase reaction with a single residue switch. J. Am. Chem. Soc. 130, 5400–5401. doi: 10.1021/ja710524w
Oikawa, H., Nakamura, K., Toshima, H., Toyomasu, T., and Sassa, T. (2002). Proposed mechanism for the reaction catalyzed by a diterpene cyclase, aphidicolan-16β-ol synthase: experimental results on biomimetic cyclization and examination of the cyclization pathway by ab initio calculations. J. Am. Chem. Soc. 124, 9145–9153. doi: 10.1021/ja025830m
Oikawa, H., Toyomasu, T., Toshima, H., Ohashi, S., Kawaide, H., Kamiya, Y., et al. (2001). Cloning and functional expression of cDNA encoding aphidicolan-16β-ol synthase: a key enzyme responsible for formation of an unusual diterpene skeleton in biosynthesis of aphidicolin. J. Am. Chem. Soc. 123, 5154–5155. doi: 10.1021/ja015747j
Pedrali-Noy, G., Spadari, S., Miller-Faurès, A., Miller, A. O., Kruppa, J., and Koch, G. (1980). Synchronization of HeLa cell cultures by inhibition of DNA polymerase alpha with aphidicolin. Nucleic Acids Res. 8, 377–387. doi: 10.1093/nar/8.2.377
Pemberton, R. P., Ho, K. C., and Tantillo, D. J. (2015). Modulation of inherent dynamical tendencies of the bisabolyl cation via preorganization in epi-isozizaene synthase. Chem. Sci. 6, 2347–2353. doi: 10.1039/C4SC03782K
Pettersen, E. F., Goddard, T. D., Huang, C. C., Couch, G. S., Greenblatt, D. M., Meng, E. C., et al. (2004). UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612. doi: 10.1002/jcc.20084
Phillips, J. C., Braun, R., Wang, W., Gumbart, J., Tajkhorshid, E., Villa, E., et al. (2005). Scalable molecular dynamics with NAMD. J. Comput. Chem. 26, 1781–1802. doi: 10.1002/jcc.20289
Schalk, M., Pastore, L., Mirata, M. A., Khim, S., Schouwey, M., Deguerry, F., et al. (2012). Toward a biosynthetic route to sclareol and amber odorants. J. Am. Chem. Soc. 134, 18900–18903. doi: 10.1021/ja307404u
Schrepfer, P., Buettner, A., Goerner, C., Hertel, M., van Rijn, J., Wallrapp, F., et al. (2016). Identification of amino acid networks governing catalysis in the closed complex of class I terpene synthases. Proc. Natl. Acad. Sci. U.S.A. 113, E958–E967. doi: 10.1073/pnas.1519680113
Serrano-Posada, H., Centeno-Leija, S., Rojas-Trejo, S., Stojanoff, V., Rodriguez-Sanoja, R., Rudino-Pinera, E., et al. (2015). Crystallization and X-ray diffraction analysis of a putative bacterial class I labdane-related diterpene synthase. Acta Crystallogr. Sect. F 71, 1194–1199. doi: 10.1107/S2053230X15014363
Shetty, R. P., Endy, D., and Knight, T. F. (2008). Engineering BioBrick vectors from BioBrick parts. J. Biol. Eng. 2:5. doi: 10.1186/1754-1611-2-5
Starczewska, E., Beyaert, M., Michaux, L., Vekemans, M.-C., Saussoy, P., Bol, V., et al. (2016). Targeting DNA repair with aphidicolin sensitizes primary chronic lymphocytic leukemia cells to purine analogs. Oncotarget 7, 38367–38379. doi: 10.18632/oncotarget.9525
Starratt, A. N., and Loschiavo, S. R. (1974). The production of aphidicolin by Nigrospora sphaerica. Can. J. Microbiol. 20, 416–417. doi: 10.1139/m74-063
Stone, J. E. (1998). An Efficient Library for Parallel Ray Tracing and Animation: A Thesis Presented to the Faculty of the Graduate School of the University of Missouri-Rolla in Partial Fulfillment of Requirements for the Degree of Master of Science in Computer Science. Master thesis, University of Missouri-Rolla, Rolla, MO.
Trott, O., and Olson, A. J. (2010). AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31, 455–461. doi: 10.1002/jcc.21334
Vanommeslaeghe, K., Hatcher, E., Acharya, C., Kundu, S., Zhong, S., Shim, J., et al. (2010). CHARMM General Force Field (CGenFF): a force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. J. Comput. Chem. 31, 671–690. doi: 10.1002/jcc.21367
Vanommeslaeghe, K., and MacKerell, A. D. (2012). Automation of the CHARMM General Force Field (CGenFF) I: bond perception and atom typing. J. Chem. Inf. Model. 52, 3144–3154. doi: 10.1021/ci300363c
Vanommeslaeghe, K., Raman, E. P., and MacKerell, A. D. (2012). Automation of the CHARMM General Force Field (CGenFF) II: assignment of bonded parameters and partial atomic charges. J. Chem. Inf. Model. 52, 3155–3168. doi: 10.1021/ci3003649
Xu, J., and Li, M. (2003). Assessment of RAPTOR's linear programming approach in CAFASP3. Proteins 53, 579–584. doi: 10.1002/prot.10531
Yee, N. K. N., and Coates, R. M. (1992). Total synthesis of (+)-9,10-syn- and (+)-9,10-anti-copalol via epoxy trienylsilane cyclizations. J. Org. Chem. 57, 4598–4608. doi: 10.1021/jo00043a014
Keywords: homology modeling, aphidicolin, diterpene, diterpene synthase, homology model refinement
Citation: Hirte M, Meese N, Mertz M, Fuchs M and Brück TB (2018) Insights Into the Bifunctional Aphidicolan-16-ß-ol Synthase Through Rapid Biomolecular Modeling Approaches. Front. Chem. 6:101. doi: 10.3389/fchem.2018.00101
Received: 10 January 2018; Accepted: 20 March 2018;
Published: 10 April 2018.
Edited by:
Daniela Schuster, Paracelsus Medizinische Privatuniversität, Salzburg, AustriaReviewed by:
Victor Guallar Guallar, Barcelona Supercomputing Center, SpainDharmendra Kumar Yadav, Gachon University of Medicine and Science, South Korea
Arnout Voet, KU Leuven, Belgium
Copyright © 2018 Hirte, Meese, Mertz, Fuchs and Brück. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Monika Fuchs, monika.fuchs@tum.de
Thomas B. Brück, brueck@tum.de