Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 13 April 2021
Sec. Biology of Archaea
This article is part of the Research Topic Mechanisms of Preservation and Change in Archaeal Genomes View all 8 articles

Stringent Primer Termination by an Archaeo-Eukaryotic DNA Primase

\r\nJan Bergsch,&#x;Jan Bergsch1,2†Jean-Christophe Devillier,&#x;Jean-Christophe Devillier1,2†Gunnar JeschkeGunnar Jeschke3Georg Lipps*Georg Lipps1*
  • 1Institute of Chemistry and Bioanalytics, University of Applied Sciences Northwestern Switzerland, Muttenz, Switzerland
  • 2Department of Biology, Institute of Biochemistry, ETH Zurich, Zurich, Switzerland
  • 3Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich, Switzerland

Priming of single stranded templates is essential for DNA replication. In recent years, significant progress was made in understanding how DNA primase fulfils this fundamental function, particularly with regard to the initiation. Equally intriguing is the unique property of archeao-eukaryotic primases to terminate primer formation at a well-defined unit length. The apparent ability to “count” the number of bases incorporated prior to primer release is not well understood, different mechanisms having been proposed for different species. We report a mechanistic investigation of primer termination by the pRN1 primase from Sulfolobus islandicus. Using an HPLC-based assay we determined structural features of the primer 5′-end that are required for consistent termination. Mutations within the unstructured linker connecting the catalytic domain to the template binding domain allowed us to assess the effect of altered linker length and flexibility on primer termination.

Introduction

During cellular DNA replication, the inability of replicative polymerases to initiate synthesis on a single strand requires the activity of DNA primase. Its function is the de novo synthesis of oligonucleotides on single stranded DNA serving as a substrate for a DNA polymerase that in turn extends the primer at its 3′-OH terminus. As a consequence of the unidirectional nature of DNA synthesis, a new primer must be formed for every Okazaki fragment on the lagging strand (Kornberg, 1980). Since primases are more error prone than the replicative polymerases (Sheaff and Kuchta, 1994) the primers have to be replaced after replication by a proofreading enzyme (Hombauer et al., 2011; Liberti et al., 2013). By using RNA, the replication machinery creates a marker for proofreading of primase-generated stretches during Okazaki fragment processing. Archaeal primases that form chimeric or DNA primers (Beck and Lipps, 2007) thus need to terminate reliably at short primer lengths to prevent errors from affecting genome stability.

Primer synthesis can be divided into two main phases, the first phase being the synthesis of the initial dinucleotide while the second phase is the repeated elongation of the dinucleotide/template (Kuchta and Stengel, 2010). Typical DNA primases synthesize primers of a defined length, usually between four and 20 nucleotides. This is in contrast to DNA polymerases whose product length is not intrinsically limited and which even use sliding clamps to increase the processivity (Mizrahi et al., 1985). Primases produce a narrow length distribution of primers. Consequently, a mechanism must be in place to terminate the repeated elongation of the primer-template at certain primer lengths. Interestingly, the termination of primer synthesis at a defined length is not brought about by a signal from another component of the replisome. Primer termination is readily observed in primase assays in the absence of further replicative enzymes. Evidently, the mechanism that limits primers to a defined length is an inherent property of the primase itself.

Two distinct classes of primase enzymes exist, viz. the bacterial and bacteriophage DnaG type primases and the archeoeukaryotic primases (AEP) (Iyer et al., 2005). Both types differ in sequence and structure but carry out the same crucial function in replication (Bergsch et al., 2019).

In the eukaryotic primosome, primase forms a complex with DNA polymerase α (Polα), yet primer termination at a specific length is observed in assays without Polα. For instance, the primers formed by Drosophila melanogaster DNA primase have a typical length of 15 nt (Conaway and Lehman, 1982), whereas human primase forms a primer with a length of nine nucleotides (Baranovskiy et al., 2016a). The specific typical length of primers characteristic of a primase enzyme has been termed a unit length primer (Desogus et al., 1999; Baranovskiy et al., 2016a).

Stability and amenability to structural investigation have contributed to the role of archaeal primases as model enzymes of the AEP superfamily (Keck and Berger, 2001). The primase of the archaeon Methanococcus jannaschii produces primers of a unit length of 7 nt, but can extend them to form products of twice this length or a distribution of longer products (in the manner of a polymerase) depending on the pH (Desogus et al., 1999), pointing toward a role of the protonation of one or multiple residues in the regulation of primer length determination. The crenarchaeon Saccharolobus solfataricus primase generates primers with a length distribution from 14 to 18 nt (Yan et al., 2018).

We have been investigating the primase enzyme from the plasmid pRN1 in Saccharolobus islandicus (pRN1 primase). Homologs of this plasmidal replication enzyme are present in related plasmids from the pRN plasmid family (Peng et al., 2000). This enzyme is a particularly compact, yet highly active and stable primase, allowing for detailed investigation of the primase mechanism by quantitative assays as well as structural approaches (Figure 1).

FIGURE 1
www.frontiersin.org

Figure 1. Domain structure and primer product of the pRN1 replication protein. (A) Domain structure and activities of the pRN1 replication enzyme. The pRN1 primase consists of an N-terminal catalytic domain (Prim_Pol, cd04859 amino acids 40–248, red), a flexible linker (amino acids 249–260) and a C-terminal helix bundle domain (HBD, pfam13010 amino acids 256–370, green). The full-length replication protein also contains a C-terminal helicase domain (blue) which is independent to the primase domain (Beck and Lipps, 2007). This study was performed with deletion mutants comprising the Prim_Pol, linker, and HBD. (B) The enzyme synthesizes an 8 nt long mixed RNA/DNA primer. Ribonucleotides are in lower case, deoxyribonucleotides in capitals. The primase requires a GTG recognition motif in the template, downstream of which the primer is formed.

The primase activity of the replication protein ORF904 from the archaeal plasmid pRN1 resides in the N-terminal half of the enzyme more specifically, the catalytic Prim_Pol domain (aa 40–248; compare Figure 1) and the helix-bundle domain (HBD, aa 256–370) are required for activity (Beck et al., 2010). This enzyme produces chimeric primers with a unit length of 8 nt, of which the first NTP incorporated (the 5′-terminal of the primer) is constitutively a ribonucleotide, while the remaining part of the primer is DNA (Beck and Lipps, 2007).

Although the catalytic domain alone is sufficient to support DNA polymerase activity, the HBD is indispensable for primase activity. The exact contribution of the HBD toward primer synthesis has long remained unclear. Recently, we could show that the HBD is able to bind sequence-specifically the template and two nucleoside triphosphates (Boudet et al., 2019). We therefore suggest that the HBD prepares the initial dinucleotide synthesis by binding and preassembling the three substrates (template, initiating ribonucleotide and elongating deoxyribonucleotide) required for the reaction to occur. Such a contribution to primer synthesis would explain why the HBD is required for de novo synthesis of a primer but not for elongating a primer hybridized to a template. The pRN1 primase enzyme is a single subunit primase in contrast to most eukaryotic primases which are heterodimeric. Noteworthy, primer synthesis is only observed when catalytic domain and HBD are covalently coupled. A mixture of the isolated domains is not enzymatically active, underscoring the importance of the linker region between catalytic domain and HBD (Beck et al., 2010). In the crystal structure of the primase, residues K250 to F260 situated between the catalytic domain and the HBD are flexible (Beck et al., 2010).

The DNA binding interface of the HBD resembles the DNA binding interface of the primase large subunit (PriL) of cellular archaeo-eukaryotic primase (Boudet et al., 2019). The contribution of PriL for primer synthesis is not very well defined on a molecular level either, but increasing evidence suggests that the helical p58C subunit of PriL binds the template and might also bind the initiating nucleotide (Baranovskiy et al., 2016a, b). Thus, PriL and HBD appear to share a similar functional role in primer synthesis by preparing the dinucleotide synthesis. Moreover, human p58C is able to specifically bind the phosphorylated 5′ primer end hybridized to the template, suggesting that the primer/template remains bound to the PriL during the primer synthesis catalyzed by the active site of the catalytic subunit PriS (Baranovskiy et al., 2016a).

Thus, for the archaeo-eukaryotic primases the following mechanistic picture emerges: These primases have two functionally conserved domains, the catalytic domain carrying out the enzymatic reaction of nucleotide condensation and a helical domain responsible for preparing primase initiation, or more specifically, preparing dinucleotide formation (Boudet et al., 2019). While there is meanwhile a rather good understanding of how primer synthesis is initiated, the mechanism of primer termination at specific primer lengths is not well investigated and understood. The following hypotheses explaining termination of primers specifically at unit length have been envisaged:

Firstly, termination of primers at the observed length might be a stochastic effect based on the low processivity of primases (Spiering et al., 2017). This would inevitably lead to a broad distribution of primer lengths.

Secondly, the occurrence of primer length “counting” can be explained by taking into consideration that primase remains bound to the 5′-end of the nascent primer during catalysis of the primer elongating reaction while the 3′-end of the primer is contacted by the active site. The dual interaction allows the enzyme to measure the linear distance between these sites. Termination occurs as soon as a primer length is attained that corresponds to the maximum distance attainable by the malleability resulting from flexible parts of the structure. In fact, specific interaction of the 5′ end with the PriX subunit of S. solfataricus primase lead Yan et al. (2018) to propose a molecular “caliper model” to explain the termination of primer extension in the S. solfataricus primase.

Also based on concomitant binding of both primer ends, a third mechanism of primer termination has been proposed. This model takes into account that formation of the primer-template duplex results in a change from linear single stranded DNA to a double strand, which tends to adopt a helical conformation. Consequently, the nascent helix forces the enzyme domains at the ends of the primer to rotate relative to each other around the axis of the template DNA. This mode of termination has been proposed for the human primosome based on structural evidence of sterical hindrance between p58N and p58C leading to a clash exactly after 9 nt (Baranovskiy et al., 2016a).

The pRN1 primase forms a unit-length primer of eight nucleotides (Lipps et al., 2003). In order to better understand the elongation cycle of the pRN1 primase and its specific termination we first investigated in more detail the substrate requirements of the primase initiation and primase extension activity, building on recent structural insights. Next, we investigated which of the proposed mechanisms of primer termination could apply to pRN1 primase. To this end, we took advantage of the covalent coupling of the catalytic domain and the HBD which allowed us to rigorously test the mechanistic requirements of primer termination by constructing mutants with linker sequences differing in length and flexibility. Finally we built a structure model of pRN1 primase in complex with the substrate DNA.

Materials and Methods

Recombinant Expression and Purification of Proteins

The pRN1 primase and mutants thereof (amino acids 40–370) was expressed recombinantly in Escherichia coli BL21 (DE3) using the expression plasmid pET28c(+) and a c-terminal His6-tag. Point mutants were generated according to the QuickChange PCR protocol involving two mutagenic oligonucleotide primers (synthetic primers were obtained from Microsynth AG) and Pfu polymerase, parental DNA was digested with DpnI. The incorportion of mutations was verified by Sanger sequencing (Microsynth AG).

Cultures were grown in LB medium (Miller) up to OD600 nm = 0.6, then induced by addition of 1 mM IPTG and fermentation was continued for 16 h at 30°C. Cells were collected by centrifugation at 8,500 g for 20 min and the pellet resuspended in 20 mL of lysis buffer (50 mM Na2HPO4, 100 mM NaCl, 10 mM MgCl2 pH 8.0) per 2 L culture to which 0.01% v/v Triton X-100 was added prior to sonication. The whole cell extract was cleared by centrifugation at 34,000 g for 20 min. The supernatant was loaded onto a Co2+ Talon affinity chromatography column (5 mL), eluted with an imidazole gradient. Fractions containing the target proteins (∼ 15 mL) were pooled and dialyzed against storage buffer (50 mM HEPES, 100 mM NaCl, 10 mM MgCl2 pH 7.0). The resulting solution was concentrated by ultrafiltration (10 kDa MWCO) to approximately 2 mL and further purified by size exclusion chromatography on a 60 cm × 16 mm Superdex 75 column (GE healthcare) using storage buffer (50 mM HEPES, 100 mM NaCl, 10 mM MgCl2 pH 7.0) as the mobile phase. Peak fractions were pooled and concentrated by ultrafiltration to approximately 0.5 mM. Protein concentrations were measured using the theoretical extinction coefficient at 280 nm. The preparation typically yielded between 20 and 50 mg protein per liter of culture volume.

Primase Assay

Reactions were performed in 15 mL volume containing 1× NEB-1 buffer (New-England Biolabs; 10 mM Bis-Tris-Propane-HCl, 10 mM MgCl2, 1 mM DTT, pH 7.0), 50 μM of the relevant NTPs, 5 μM ssDNA template, and 0.25 μM of enzyme. Synthetic DNA oligonucleotides used as templates or primers were obtained from Microsynth AG, and were of HPLC-purified quality. Some assays used oligonucleotides produced by pRN1 primase previously isolated by preparative HPLC. If a primer-template duplex DNA was used as a substrate for the primase essay, the strands were annealed by the following temperature program without further purification: 98°C for 2 min; 70°C for 5 min; 50°C for 10 min; 40°C for 5 min; 25°C for 2 min (in reaction buffer).

The assays were incubated at 55°C for 20–120 min, then stopped by addition of 15 mL 15 mM EDTA pH 8.0 and incubated on ice for 5 min. The resulting mixtures were centrifuged 5 min at 16,000 g prior to analysis by HPLC.

Preparative and Analytical HPLC, MS

The products of primase reactions were separated and analyzed on an XBridge C18 3.5 mm (3.0 mm inner diameter 150 mm length) HPLC column (Waters) at 55°C using an Agilent 1100 Series HPLC system. For analytical purposes 20 μL of sample were injected. A gradient elution was carried out using as solvent A 8 mM triethylamine and 200 mM hexafluoroisopropanol in water and as solvent B a mixture of 50% v/v A and 50% v/v methanol. The mobile phase was delivered at a flow rate of 1 mL/min, with a gradient from 5% B to 50% B over 8 min. Analytes were detected by measuring absorbance at 260 nm using a diode array detector (Gilar et al., 2002). Peak areas in HPLC chromatograms were integrated using the program PeakFit v4.12. The amounts of primase products were calculated from the peak areas based on the extinction coefficients of the different oligonucleotides according to the conversion factors given in Supplementary Table 1. Rates of nucleotide incorporation and primer synthesis were calculated in units of mol dNTPs or mol primers per mol enzyme per minute.

For preparative HPLC the injection volumes was increased to 50 or 80 μL and the gradient was extended to 20 min. Peak fractions were manually collected and lyophilized using a Christ Alpha 2-4LD freeze dryer. The identity and purity of products were verified by analytical HPLC and by MALDI-TOF mass spectrometry. Samples for MALDI-TOF analysis were dissolved in water and spotted onto a 2′,4′,6′-Trihydroxyacetophenone (THAP) matrix and were analyzed at the mass spectrometry facility of the University of Bern.

Structure Modeling

For modeling the link between the DNA sections GCAG, bound to the Prim_Pol domain and GTGCTCA, bound to the HBD, we used the backbone pseudo-rotamer library for RNA developed by Humphris-Narayanan and Pyle (2012). The library at 5° resolution (Pyle Lab, 2021) contains 577 backbone conformations for each nucleotide. We first determined which of these conformations superimposed with the given coordinates of the two terminal G nucleotides with a root mean square deviation (RMSD) below a threshold value ΔRMSD. For all link combinations we predicted the coordinates of the P and C4′ atoms of the first nucleotide of the second DNA section by superimposing the pseudo-rotamer from the library with the coordinates of the last nucleotide of the first DNA section. Likewise, we predicted the coordinates of the C4′ atom of the last nucleotide of the first DNA section by superimposing the pseudo-rotamer with the coordinates of the first nucleotide of the second DNA section. This provided two triplets of C4′ (i)-P(i + 1)-C4′ (i + 1) coordinates that we superimposed. Link geometries were accepted at this stage if the superposition RMSD was again below ΔRMSD. From these link geometries we selected those for which the nucleobases did not clash, obtaining 9,046 link geometries at ΔRMSD = 0.25 Å and 53,466 link geometries at ΔRMSD = 0.5 Å. We then removed all link geometries with clashes between the two domains as well as those with a CA-CA distance longer than 49.4 Å between the C-terminal residue of the CAD and the N-terminal residue of the HBD, which are joined by a 12-residue linker (G249-F260). This reduced the number of possible link geometries to 2 at ΔRMSD = 0.25 Å and to 27 at ΔRMSD = 0.5 Å. In both ensembles, the peptide linkers were then constructed by YASARA (Krieger et al., 2009), followed by geometry optimization in YASARA. The two primase conformers at ΔRMSD = 0.25 Å constitute the two alternative models presented. The broader ensemble contained conformations resembling the two models except for relative domain rotations by a few degrees and was therefore not considered further.

Results and Discussion

Depending Upon Substrate pRN1 Primase Works as a Primase or as a DNA Polymerase

In contrast to other archaeo-eukaryotic primases, pRN1 primase strictly uses deoxynucleotides for primer elongation, whilst the first nucleotide of the primer is a ribonucleotide (Figure 1). In addition to its primase activity the pRN1 primase also has DNA polymerase activity. Using circular single-stranded DNA template extension products of several kilobases are observed (Lipps et al., 2003). However, in contrast to the first observation, when a single-stranded template containing the 5′-GTG-3′ motif is offered as substrate, a primer of the length of eight nucleotides is the major product (Beck and Lipps, 2007). It thus appears that the pRN1 primase operates in two distinct modes, the “primase mode” and the “DNA polymerase mode.” We therefore analyzed the products of the pRN1 primase reaction assays via HPLC chromatography using different defined short templates and primer/templates in order to understand how pRN1 primase selects its “operating” mode.

When pRN1 primase is supplied with a GTG containing template (Table 1, substrate GTG) a primer of a length of eight nucleotides is synthesized when ATP, dATP and dTTP are included in the reaction mix. A single base change in the GTG recognition motif (substrate CTG) abrogates primer synthesis in line with previous experiments on the template specificity of the primase activity (Beck and Lipps, 2007) (chromatograms shown in Supplementary Figure 1).

TABLE 1
www.frontiersin.org

Table 1. DNA substrates used in this study.

When incubating pRN1 primase with substrate GTG (A7T6GTGATC) in the absence of dTTP an only six base long primer (pppaA5) is synthesized. Primer synthesis is aborted at this length due to the lack of the cognate nucleotide dTTP (see Supplementary Figure 1 trace B).

Unlike a synthetic DNA primer, such a primer formed by pRN1 primase contains a triphosphorylated ribo-adenosine at the 5′ end (primer substrate pppaA5, Table 1). We isolated ∼ 10 nmol of this primer to serve as substrate for subsequent assays by a preparative HPLC purification and confirmed the identity of the aborted primer product by MALDI mass spectrometry (Supplementary Figure 2).

Next, we tested whether pRN1 primase can restart primer synthesis from a preformed primer/template. For this purpose we hybridized template GTG with the abortion product pppaA5 and alternatively with chemically synthesized oligodeoxynucleotide (Table 1, primer A6). Both primers have the same base sequence but differ at the 5′ terminal backbone sugar and by the 5′ triphosphate group, which is only present in the enzymatically synthesized abortion product.

The chemically synthesized DNA primer/template (substrate A6/GTG) is not extended by the pRN1 primase to form unit length primers; only slow formation of the run-off product through its DNA polymerase activity occurs, albeit at a much lower rate compared to primase activity (see below in Table 2 for the incorporation rates). Thus, an extension product of the primer unit length of eight nucleotides, as expected for a primase reaction, is not observed in the HPLC chromatogram of the assay containing a synthetic DNA primer (Figure 2A).

TABLE 2
www.frontiersin.org

Table 2. Nucleotide incorporation rates of the pRN1 primase with different substrates.

FIGURE 2
www.frontiersin.org

Figure 2. Elongation assay: pRN1 enzyme incubated with different primer-template substrates. (A) Primer-template combination A6/GTG: The synthetic 6 nt primer is not elongated by the primase in presence of dTTP. (B) Primer-template combination pppaA5/GTG (6 nt primer with triphosphorylated ribonucleoside at 5′ terminus): The 6 nt primer is elongated by the primase in presence of dTTP. Incorporation of dTTP by the primase leads to formation of unit length products (primase mode, for comparison of primers formed with this template, refer to Supplementary Figure 1). (C) Primer-template combination pppaA5/CTG: The primer pppaA5 on a template lacking the recognition sequence GTG is not elongated. No unit length products are observed with template CTG instead of GTG.

In contrast, with the enzymatically produced primer hybridized to the same template (substrate pppaA5/GTG) the primer is extended to the unit length primer but not to the run-off extension product (Figure 2B). We refer to this activity of pRN1 enzyme, leading to a unit-length extension product shorter than the available template as the “primase mode.”

To further corroborate this finding we repeated the assay with the substrate pppaA5/CTG. This substrate has a single point mutation in the GTG recognition motif, which is required by pRN1 primase for de novo primer synthesis. With substrate pppaA5/CTG, an eight nucleotide long primer is not observed (Figure 2C) in line with the observation that initiation of primase mode requires the GTG motif in the template strand. While the 30 min incubation time of the primase assay used here results in no detectable extension of the primer, the run-off product is detected after prolonged incubation (120 min instead of 30 min)—an activity of pRN1 primase which we refer to as “DNA polymerase mode.”

Thus, the “primase mode” is only activated when the GTG motif is present in the template and the primer contains the genuine 5′ end structure bearing a 5′ triphosphorylated ribonucleotide. Only under these restrictive conditions is it possible to restart primer synthesis from a preformed primer/template duplex. We conclude that the primase activity depends upon contacts with the 5′ end of the primer and the GTG motif in the template strand during the primer synthesis reaction. This is in line with our structural investigations which demonstrated that the GTG motif and nucleoside triphosphates are bound by the HBD (Boudet et al., 2019).

Templates devoid of the GTG motif are only weakly bound by the HBD. Similarly, a 5′ primer end lacking triphosphorylation is likely to be bound more weakly by the HBD since it is known that HBD harbors two nucleotide binding sites that bind the nucleotides mainly via the triphosphate groups. Consequently, activation of the “primase mode” can only occur if the 5′ primer end hybridized to the GTG containing template is bound and orientated by the HBD. In the absence of this specific activation, the primase remains in “DNA polymerase mode,” resulting in a much slower elongation reaction unable to terminate at a specific unit length. We consider this polymerase activity not physiologically significant if compared with the much more processive cellular DNA polymerases. Absence of proofreading and processivity in the polymerase mode of pRN1 primase suggest this enzyme provides replication initiation at the recognition sequence but not the main polymerase function which is likely provided by cellular enzymes.

Next, we compared the kinetics of nucleotide incorporation by pRN1 primase in “primase” and in “polymerase” mode. To this end, we quantified the appearance of the respective reaction products in the primase assay via HPLC analysis (Table 2). We found that the DNA polymerase activity of pRN1 primase occurs with a relatively low rate (∼0.1 mol dNTP per mol enzyme incorporated per minute), however, when a GTG containing template or a genuine primer/template is offered (“primase mode”), the incorporation rate increases about 20-fold (see Table 2).

Mechanistically this activation is likely caused by binding and orientation of the DNA substrate by the HBD in preparation for the enzymatic reaction occurring at the active center of the catalytic domain as discussed above. We therefore predict that the HBD is not only required to initiate dinucleotide synthesis but that the HBD remains bound to the primer/template throughout the complete primase extension cycle until an eight nucleotide unit length primer has been generated. This raises the question whether the HBD is also involved in the termination of primer synthesis occurring as soon as a unit length has been reached by the catalytic domain. The experimental data presented in Table 2 further show that dNTP incorporation in polymerase mode is slightly more rapid with the construct lacking the C-terminal domain. This observation may be interpreted as a hindrance of the polymerase activity harbored in the N-terminal domain if the HBD binds primer-templates that lack the triphosphate or the recognition sequence and hence do not lead to activation of “primase mode.”

The Interdomain Linker Length Influences Primase Activity and Primase Abortion Rate

The pRN1 primase is a single subunit archaeo-eukaryotic primase offering the unique possibility to study the interaction between the catalytic domain and the HBD in more detail by mutating the linker sequence between both domains. The flexible linker sequence (G249–F260) between the domains is flanked by α-helical secondary structure elements of the catalytic domain and the HBD. Phylogenetically the linker sequence itself is partially conserved (refer to sequence alignment in Figure 3) containing a high fraction of charged amino acids. The linker length varies by up to two residues longer or up to two residues shorter as observed in the close homologs of pRN1 primase.

FIGURE 3
www.frontiersin.org

Figure 3. Alignment of the region between the catalytic domain and the HBD of the pRN1 replication enzyme with homologous archaeal replication proteins, including those encoded by mobile genetic elements (pRN1, pRN2, pMEN7, and pDL10). The sequence of the pRN1 replication protein is given in the first line with helices highlighted with gray background and the beta-strand boxed. The residues in the linker sequence which could not be resolved in the crystal structure (pdb: 31M1) are underlined and bold. The alignment was generated with COBALT at a 4 bit threshold for coloring.

Since the linker sequence is partially conserved we first tested the effect of substituting the linker sequence by an increasing number of glycine-serine repeats. With this approach, we sought to avoid possible complications arising from residues potentially exerting a specific effect on the primase activity. The mutants with the increasing number of serine-glycine repeats were expressed recombinantly in E. coli and the purified protein was assayed using an HPLC-based primase activity assay. In this assay a short oligonucleotide with the primase recognition motif 5′-GTG-3′ is supplied as a template and in the presence of dATP and rATP on average eight (seven to ten) nucleotide long primer solely consisting of adenine bases is synthesized. Amounts of the different primase products were quantified by integrating the corresponding peak areas. We distinguish between the aborted primers (the dinucleotide pppaA as well as the trinucleotide pppaAA; lower case letters ribonucleotide, upper case letters deoxynucleotide), intermediate products with lengths between four and six nucleotides and unit-length primers of 7–10 nucleotides. With the wild-type enzyme the observed products are mainly unit-length primers and aborted products and almost no intermediate products (see Supplementary Figure 1).

The dinucleotide is released if the first phosphodiester bond forming step (the condensation of the initiating rATP with the first elongating dATP) is not followed by further elongation of the dinucleotide/template. Since the dinucleotide/template hybrid is unstable, lack of productive elongation might result whenever repositioning of the dinucleotide/template at the active site of the catalytic domain occurs too slowly, leading to release of the dinucleotide from the enzyme-substrate complex. The same reason might explain why a small amount of the trinucleotide pppaAA is also released as observed in the assay. Further intermediate primer products are barely observed, pointing to a more efficient repositioning as soon as the primer reaches a length of four nucleotides.

In our standard assay with the substrate T8GTGT3 we would expect to observe only primers with the unit-length of eight nucleotides. However, products of the length of seven, nine, and 10 nucleotides are also observed. The same length distribution is observed if the longer template T14GTGT3 is used instead. We explain the occurrence of these longer products as well as the seven nucleotide long primer with a slippage of the primer on the homonucleotide stretch of the template during the primase elongation cycles. This explanation is supported by the observation that a template with a mixed base sequence downstream of the GTG motif leads to the exclusive formation of the 8 nt long primer (see also Figure 4).

FIGURE 4
www.frontiersin.org

Figure 4. HPLC based primer length determination of linker length mutants. Primase assay of wild type primase incubated with template “mixed” produces a unit length primer of 8 nt (ppppaGAAGAAG). The same main product is observed in varying amount with the linker mutants Δ6, P4, (SG)4, +4, +8, and +12. The identity of the product was confirmed to be the ppppaGAAGAAG with MALDI-TOF mass spectrometry (calculated exact mass 2,746.38 gmol–1, found: 2,746.04).

All mutants with the increasing number of serine-glycine repeats show greater activity than the wild-type protein (Table 3). Thus, it appears that a more flexible linker favors primer synthesis. However, the increase of activity is limited to twofold of the wild-type activity, which is already reached if one serine-glycine repeat is incorporated in the linker sequence. A further increase in backbone flexibility does not increase the enzymatic activity further. To substantiate the relationship between backbone flexibility and activity we next substituted the glycine residues of the (SG)4 mutant with prolines in order to obtain linkers with decreased flexibility. As can be seen in Table 3 the decrease of linker flexibility by an increasing numbers of incorporated proline residues is well correlated with a decrease in primase activity. We also note that the fraction of aborted primers is negatively correlated with the enzymatic reaction rate.

TABLE 3
www.frontiersin.org

Table 3. Primase activity of mutants with differing linker backbone flexibility (template T8GTGT3).

Next, we investigated the effect of linker length on enzymatic activity. For this purpose, mutants with different numbers of serine-glycine repeats were produced and tested (Table 4). The highest enzymatic activity was observed with the mutant having four additional amino acids in the linker (two additional serine-glycine repeats). Further extension of the linker decreases primase activity. We attribute this to the reduced local concentration of the HBD relative to the active site. The observed tendency is in agreement with the fact that a mixture of the separate domains does not display any primase activity.

TABLE 4
www.frontiersin.org

Table 4. Primase activity of linker length mutants (template T8GTGT3).

Upon shortening the linker, the enzymatic activity is strongly reduced. Truncating the linker by only one amino acid already reduced the activity about threefold. The even shorter variants display a progressively greater loss of activity. Noteworthy, as with the linker flexibility series (Table 3) we find that the fraction of aborted primers is negatively correlated with the enzymatic reaction rates of the linker length mutants.

The Unit-Length of the pRN1 Primase Primer Is Eight Nucleotides Regardless of Interdomain Linker Length

The mechanism of primer termination of pRN1 primase is not well understood. Our data suggest that in the “primase mode” the primer/template end remains attached to the HBD. Therefore possibly, there should be an upper limit on primer length imposed by the length of the linker between the two domains. The “caliper model,” as suggested for S. solfataricus primase (Yan et al., 2018), would predict the unit-length of the primer to be determined by the maximal attainable distance between the primer 5′ end binding site on the HBD and the active site of the catalytic domain (where the 3′-end must be bound).

To test whether this mechanism of primer termination also applies to pRN1 primase we determined the primer lengths produced by our linker length mutants. As the ssDNA template we used an oligonucleotide with a mixed nucleotide composition upstream of the 5′-GTG-3′ motif. The mixed sequence is intended to prevent slippage of the primer along the template, and indeed we only obtained a single product of eight nucleotides in this variant of the assay (Figure 4). Surprisingly, the linker mutants varying in linker length by 15 amino acid residues (mutant +12 vs. mutant −3) all produce a linker of eight nucleotides. We can therefore exclude the possibility that primer termination proceeds according to the “caliper model.”

How Is Primer Termination Triggered?

The ability to terminate elongation after reaching unit length may be considered a defining property of the primase mechanism as opposed to processive polymerases since it limits the amount of nucleotides incorporated by the error-prone primase to a minimum before a high-fidelity DNA polymerase takes over, thereby minimizing the amount of proofreading required to process Okazaki fragments and speeding the replication process. Our data clearly shows that primer termination is not influenced by the linker length—thus we exclude the “caliper” model.

Our experiments do not support primer length determination by stochastic dissociation from the substrate primer-template either. Firstly, with a template devoid of homonucleotide runs, we strictly observe only primers with a length of 8 nt, whereas spontaneous dissociation would by necessity result in a wider distribution of primer lengths. Secondly, if a preformed primer-template duplex is incubated with primase in the presence of the cognate dNTPs, the stochastic model would predict elongation by the same number of nucleotides as are used in a unit length primer. This is, however, not the case, since elongation occurs only up to the overall unit-length.

The third proposed model of primer termination explains the specific termination at unit length primers through a steric clash resulting from the rotation of the domains which is induced by the helical conformation of the primer-template. This model is in principle compatible with our observations since the clash and ensuing primer termination might occur at a specific angle dictated by the geometry of the protein structures of the two domains and not by the linker length. To gain better structural understanding whether such a termination mechanism is plausible, we modeled the structure of the active primase-substrate complex. Starting point for this model was the NMR structure of the HBD sequence specifically binding the GTG motif in the template ssDNA strand [pdb: 6GVT, (Boudet et al., 2019)]. The initial dinucleotide of primer synthesis is formed complementary to the two bases directly 5′ of the GTG motif. Therefore these two bases of the template must interact with the active site of the Prim_Pol domain. Currently there is no direct structural information of the liganding of template and nucleotides in the active site of pRN1 primase. However the active site of pRN1 primase shares structural homology with the Polymerase Domain (PolDom) from Mycobacterium tuberculosis Ligase D, an enzyme involved in non-homologous end joining. Two crystal structures of this enzyme liganded with template ssDNA and a UTP molecule at the elongating position have been determined [pdb: 3PKY (Brissett et al., 2011) and 4MKY (Brissett et al., 2013), respectively]. We used this information to place UTP and a four base template strand in the crystal structure of the pRN1 primase [pdb: 3M1M, (Beck et al., 2010)]. Next, the two separate domains (Figures 5A,B) underwent a rigid body docking while enforcing the covalent linkage of two the domains via the flexible linker (wild type sequence) and fusing the two template parts into one continuous DNA template. During the docking procedure the two template parts were restrained relative to the interacting protein surfaces and the covalent bonds at the fusion site in the single-stranded DNA backbone were sampled from an RNA rotamer library. With this procedure we arrived at two biochemically and structurally plausible models (Figure 5C). The conformations differ considerably between the two models: In one model (left) the template is bent at the fusion site between the GTG motif (magenta) and the base opposite to the initiating base (blue). In this model the connecting linker (gray) is not fully extended which is in line with our experimental observation that even shortened linkers support primer synthesis (see Table 4). The alternative model (Figure 5C, right) has a nearly linear (but twisted) DNA backbone conformation. In this case the linker is stretched to connect both domains. This structure could more easily explain the recognition of the terminal triphosphate group of the primer by the HBD, which we biochemically observed to be essential for primer synthesis together with the ribose at the 5′ primer end. In the NMR structure of the quaternary complex (Boudet et al., 2019), HBD binds both ATPs via the triphosphates. One triphosphate moiety is in close proximity to the template backbone and the adenine base is close to the template base opposite of the initiating nucleotide (blue). Thus it is possible that the triphosphate of the 5′ primer end (which is part of the initiating nucleotide) could displace the triphosphates of one of the ATPs bound by HBD during primer synthesis initiation.

FIGURE 5
www.frontiersin.org

Figure 5. Structure model of the initiating primase. (A) Solution structure of the HBD domain (green) binding sequence-specifically a template with a GTG motif (pdb: 6GVT), (B) Model of catalytic Prim_Pol domain (red, pdb: 3M1M) complexed with template and UTP at the elongating position (derived from M. tuberculosis PolDom (pdb: 4MKY and 3PKY) having the same conserved active site). (C) Two models of the complete primase during initiation of primer synthesis. The two models were obtained by rigid body docking between the protein complexes of Prim_Pol/GCAG and HBD/GTGCTCA while enforcing a covalent bond between the two terminal guanosines and modeling a flexible linker between both domains. The dinucleotide is formed opposite of the two bases upstream of the GTG motif and constrains the possible conformations of the initiating primase as the GTG remains bound to the HBD domain and dinucleotide formation is catalyzed at the active site of the Prim_Pol domain.

In summary, the well-regulated primer termination was found to be dependent on interaction with the 5′ terminus, yet it is not the result of the flexible interdomain region acting as a molecular caliper to restrict the distance that can be accommodated between the binding sites in the two domains.

The rotational clash model advanced to explain this regulation in eukaryotic primases (Baranovskiy et al., 2016b) suggests primer termination would occur by the relative motion of the domains dictated by the growing double-stranded helical DNA between the Prim_Pol and the HBD. In our models the rotation and translation of the HBD relative to the Prim_Pol domain would first shorten the distance covered by the flexible linker. Thus primer synthesis beyond the dinucleotide would be accommodated. When the primer length approaches eight nucleotides (i.e., ∼24 Å of double-stranded DNA) both domains will be quite distant from each other which renders a rotational clash between both domains doubtful.

It is difficult to fully explain the observed stringent primer termination by the currently proposed mechanisms of primer termination. We found experimentally that the HBD is crucial for primer synthesis and that the covalent linkage between both domains is required for efficient primer synthesis. We argue that the main function of the HBD is to prepare primer synthesis by binding and orientating the substrates. In fact without the HBD, primer initiation is not possible and elongating dNTP incorporation is slowed by about a factor of 20. On this basis we suggest that efficient activity in “primase mode” is stimulated by contacts between the two domains, which thus act cooperatively in this enzyme-subtrate complex. Such specific interaction would depend upon correct arrangement of the domains relative to each other and the nucleic acid substrates, which appears to be determined by the presence of the interdomain linker and the binding of both primer ends as supported by our experiments. Upon exceeding unit length the complex would reach a conformation that no longer permits the activity promoting interaction of the domains either as a consequence of increasing flexibility or due to the increased length of dsDNA separating them.

Our investigations of the primase extension cycle demonstrated that the substrate requirements for genuine primase activity can be explained if the triphosphorylated 5′ primer end together with the motif GTG remains bound to the helix bundle domain HBD. Variation of the flexibility and the length of the interdomain linker between catalytic domain and HBD showed a profound effect on primase activity and primer abortion rate but primer termination continued to occur robustly at eight nucleotides. Capturing the structures of the enzyme-substrate complex during the elongation cycle will be required to decide how termination is triggered by the lack of stimulation of the HBD enforced by the growing primer/template.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Author Contributions

JB performed the experiments reported in Figures 2, 4, Table 2, and Supplementary Figures. JB and J-CD wrote the manuscript. J-CD performed the experiments reported in Tables 3, 4. GJ performed the model calculation reported in Figure 5. GL directed the project, obtained funds and wrote the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the SNF (grant nos.: 310030_163345 and 310030_141160).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The services of Urs Kämpfer at the mass spectrometry facility at University of Berne are gratefully acknowledged.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2021.652928/full#supplementary-material

References

Baranovskiy, A. G., Babayeva, N. D., Zhang, Y., Gu, J., Suwa, Y., Pavlov, Y. I., et al. (2016a). Mechanism of concerted RNA-DNA primer synthesis by the human primosome. J. Biol. Chem. 291, 10006–10020. doi: 10.1074/jbc.M116.717405

PubMed Abstract | CrossRef Full Text | Google Scholar

Baranovskiy, A. G., Zhang, Y., Suwa, Y., Gu, J., Babayeva, N. D., Pavlov, Y. I., et al. (2016b). Insight into the human DNA primase interaction with template-primer. J. Biol. Chem. 291, 4793–4802. doi: 10.1074/jbc.M115.704064

PubMed Abstract | CrossRef Full Text | Google Scholar

Beck, K., and Lipps, G. (2007). Properties of an unusual DNA primase from an archaeal plasmid. Nucleic Acids Res. 35, 5635–5645. doi: 10.1093/nar/gkm625

PubMed Abstract | CrossRef Full Text | Google Scholar

Beck, K., Vannini, A., Cramer, P., and Lipps, G. (2010). The archaeo-eukaryotic primase of plasmid pRN1 requires a helix bundle domain for faithful primer synthesis. Nucleic Acids Res. 38, 6707–6718. doi: 10.1093/nar/gkq447

PubMed Abstract | CrossRef Full Text | Google Scholar

Bergsch, J., Allain, F. H.-T., and Lipps, G. (2019). Recent advances in understanding bacterial and archaeoeukaryotic primases. Curr. Opin. Struct. Biol. 59, 159–167. doi: 10.1016/j.sbi.2019.08.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Boudet, J., Devillier, J.-C., Wiegand, T., Salmon, L., Meier, B. H., Lipps, G., et al. (2019). A Small helical bundle prepares primer synthesis by binding two nucleotides that enhance sequence-specific recognition of the DNA template. Cell 176, 154–166.e13. doi: 10.1016/j.cell.2018.11.031

PubMed Abstract | CrossRef Full Text | Google Scholar

Brissett, N. C., Martin, M. J., Bartlett, E. J., Bianchi, J., Blanco, L., and Doherty, A. J. (2013). Molecular basis for DNA double-strand break annealing and primer extension by an NHEJ DNA polymerase. Cell Rep. 5, 1108–1120.

Google Scholar

Brissett, N. C., Martin, M. J., Pitcher, R. S., Bianchi, J., Juarez, R., Green, A. J., et al. (2011). Structure of a preternary complex involving a prokaryotic NHEJ DNA polymerase. Mol. Cell 41, 221–231.

Google Scholar

Conaway, R. C., and Lehman, I. R. (1982). Synthesis by the DNA primase of Drosophila melanogaster of a primer with a unique chain length. Proc. Natl. Acad. Sci. U.S.A. 79, 4585–4588.

Google Scholar

Desogus, G., Onesti, S., Brick, P., Rossi, M., and Pisani, F. M. (1999). Identification and characterization of a DNA primase from the hyperthermophilic archaeon Methanococcus jannaschii. Nucleic Acids Res. 27, 4444–4450. doi: 10.1093/nar/27.22.4444

PubMed Abstract | CrossRef Full Text | Google Scholar

Gilar, M., Fountain, K. J., Budman, Y., Neue, U. D., Yardley, K. R., Rainville, P. D., et al. (2002). Ion-pair reversed-phase high-performance liquid chromatography analysis of oligonucleotides: Retention prediction. J. Chromatogr. A 958, 167–182. doi: 10.1016/S0021-9673(02)00306-0

CrossRef Full Text | Google Scholar

Hombauer, H., Campbell, C. S., Smith, C. E., Desai, A., and Kolodner, R. D. (2011). Visualization of eukaryotic DNA mismatch repair reveals distinct recognition and repair intermediates. Cell 147, 1040–1053. doi: 10.1016/j.cell.2011.10.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Humphris-Narayanan, E., and Pyle, A. M. (2012). Discrete RNA libraries from pseudo-torsional space. J. Mol. Biol. 421, 6–26. doi: 10.1016/j.jmb.2012.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Iyer, L. M., Koonin, E. V., Leipe, D. D., and Aravind, L. (2005). Origin and evolution of the archaeo-eukaryotic primase superfamily and related palm-domain proteins: structural insights and new members. Nucleic Acids Res. 33, 3875–3896. doi: 10.1093/nar/gki702

PubMed Abstract | CrossRef Full Text | Google Scholar

Keck, J. L., and Berger, J. M. (2001). Primus inter pares (First among equals). Nat. Struct. Mol. Biol. 8, 2–4. doi: 10.1038/82996

PubMed Abstract | CrossRef Full Text | Google Scholar

Kornberg, A. (1980). DNA Replication. ED.1980. San Francisco, CA: Freeman.

Google Scholar

Krieger, E., Joo, K., Lee, J., Lee, J., Raman, S., Thompson, J., et al. (2009). Improving physical realism, stereochemistry and side-chain accuracy in homology modeling: four approaches that performed well in CASP8. Proteins 77, 114–122. doi: 10.1002/prot.22570

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuchta, R. D., and Stengel, G. (2010). Mechanism and evolution of DNA primases. Biochim. Biophys. Acta 1804, 1180–1189.

Google Scholar

Liberti, S. E., Larrea, A. A., and Kunkel, T. A. (2013). Exonuclease 1 preferentially repairs mismatches generated by DNA polymerase α. DNA Repair. 12, 92–96. doi: 10.1016/j.dnarep.2012.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Lipps, G., Rother, S., Hart, C., and Krauss, G. (2003). A novel type of replicative enzyme harbouring ATPase, primase and DNA polymerase activity. EMBO J. 22, 2516–2525.

Google Scholar

Mizrahi, V., Henrie, R. N., Marlier, J. F., Johnson, K. A., and Benkovic, S. J. (1985). Rate-limiting steps in the DNA polymerase I reaction pathway. Biochemistry 24, 4010–4018. doi: 10.1021/bi00336a031

PubMed Abstract | CrossRef Full Text | Google Scholar

Peng, X., Holz, I., Zillig, W., Garrett, R. A., and She, Q. (2000). Evolution of the family of pRN plasmids and their integrase-mediated insertion into the chromosome of the crenarchaeon Sulfolobus solfataricus11Edited by J. Karn. J. Mol. Biol. 303, 449–454. doi: 10.1006/jmbi.2000.4160

PubMed Abstract | CrossRef Full Text | Google Scholar

Pyle Lab (2021). Software | Pyle Lab at Yale University. Available online at: https://pylelab.org/software (accessed January 11, 2021).

Google Scholar

Sheaff, R. J., and Kuchta, R. D. (1994). Misincorporation of nucleotides by calf thymus DNA primase and elongation of primers containing multiple noncognate nucleotides by DNA polymerase alpha. J. Biol. Chem. 269, 19225–19231.

Google Scholar

Spiering, M. M., Hanoian, P., Gannavaram, S., and Benkovic, S. J. (2017). RNA primer–primase complexes serve as the signal for polymerase recycling and Okazaki fragment initiation in T4 phage DNA replication. Proc. Natl. Acad. Sci. U.S.A. 114, 5635–5640. doi: 10.1073/pnas.1620459114

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, J., Holzer, S., Pellegrini, L., and Bell, S. D. (2018). An archaeal primase functions as a nanoscale caliper to define primer length. Proc. Natl. Acad. Sci. U.S.A. 115, 6697–6702. doi: 10.1073/pnas.1806351115

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: replication, priming, archaea, genome maintenance, regulation, DNA protein interaction

Citation: Bergsch J, Devillier J-C, Jeschke G and Lipps G (2021) Stringent Primer Termination by an Archaeo-Eukaryotic DNA Primase. Front. Microbiol. 12:652928. doi: 10.3389/fmicb.2021.652928

Received: 13 January 2021; Accepted: 19 March 2021;
Published: 13 April 2021.

Edited by:

Yoshizumi Ishino, Kyushu University, Japan

Reviewed by:

Changyi Zhang, University of Illinois at Urbana–Champaign, United States
Silvia Ayora, Consejo Superior de Investigaciones Científicas (CSIC), Spain

Copyright © 2021 Bergsch, Devillier, Jeschke and Lipps. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Georg Lipps, Z2VvcmcubGlwcHNAZmhudy5jaA==

These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.