|
|
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
RESEARCH PAPER
Department of Molecular and Cell Biology, Division of Biochemistry and Molecular Biology, University of California, Berkeley, California 94720-3204, USA
| Abstract |
|---|
|
|
|---|
[Keywords: Crystal structure E1E2 complex; papillomavirus; helicase structure; AAA+ protein; DNA replication]
Received May 11, 2004; revised version accepted June 16, 2004.
8 ORFs), the viral life cycle depends heavily on interaction with the host cell for enzymes and ancillary factors for its own gene expression and replication programs. The PVs have provided important novel insights into the regulation of cell proliferation and cancer etiology; for example, the targeted degradation of p53 directed by the HPV16 E6 protein first revealed how key cellular regulatory proteins could be specifically ubiquitinated by E3 ligases for destruction by the proteosome (Scheffner et al. 1993
The first two universal steps in the formation of a prereplication complex are origin recognition and subsequent distortion and separation of duplex DNA strands. The simplicity of the PV prereplication complex offers advantages for mechanistic and structural dissection of these steps. Only two PV-encoded proteins, E1 and E2, are necessary for the initiation of viral DNA replication. The E1 protein (for review, see Sverdrup and Myers 1997
) serves as the viral initiator protein that binds to the viral origin and, on DNA melting, converts to a helicase that functions during viral DNA replication. The N-terminal
200 amino acids of E1 are not well conserved between various papillomaviral types except for two boxes corresponding to the nuclear localization signal (Fig. 1A). The central portion E1 comprises the site-specific DNA-binding domain of E1 that associates with a series of inverted-repeat binding sites at the viral origin (Mendoza et al. 1995
; Chen and Stenlund 1998
; Enemark et al. 2002
). The helicase domain of E1 is contained in the C-terminal half of the protein and contains motifs characteristic of Walker-type ATP-binding and hydrolysis proteins. Truncated versions of HPV E1 deleted for the entire site-specific DNA-binding domain are active helicases for templates containing forks or single strands (White et al. 2001
; S.E. Moyer, E.A. Abbate, and M.R. Botchan, unpubl.) but are unable to load onto duplex DNA. The protein thus has one set of domains for non-specific DNA interactions that are critical for helicase activity and another domain for site-specific DNA recognition that is essential for origin targeting. The overall domain architecture of E1, including the spacing between the nuclear localization signals, distinct site-specific DNA recognition domain, and helicase motifs of E1 is similar to that of the SV40 T-antigen (Park et al. 1994
; Mansky et al. 1997
).
|
200 amino acids. This domain is essential for all viral DNA functions including transcription, replication, and viral DNA segregation. The modular activation domain is connected to the C-terminal DNA-binding/dimerization domain by a flexible proline-rich hinge domain (Fig. 1A). E2's role in PV replication is to help target monomers of E1 to the viral origin through cooperative DNA binding and to ensure the orderly conversion of the complex to an active helicase. In this respect, E2 is both an initiation protein and a helicase loader (Mohr et al. 1990
The E2-binding sites lie adjacent to the E1-binding loci at the origin and are critical for formation of an E1 · E2 · ori ternary complex that consists of a dimer of E1 and a dimer of E2 (Chen and Stenlund 1998
). The E1 · E2 assembly serves as the prereplication complex for the virus and is essential for replication in vivo, as E1 alone has low intrinsic DNA-binding specificity. Synergistic interactions between the two factors guides the specificity between the Scylla and Charybdis of tight binding to the viral DNA and nonspecific interactions with chromosomal DNA. Although both proteins bind to the origin through their respective DNA-binding domains, strong interactions between the C-terminal helicase domain of E1 and the activation domain of E2 create and stabilize an E1 · E2 assembly as well (Yang et al. 1991b
; Li et al. 1993
; Yasugi et al. 1997
; Masterson et al. 1998
).
For the bovine papillomavirus (BPV), additional contacts between the DNA-binding domains of each protein have been reported (Chen and Stenlund 1998
). For BPV1, the interaction between the DNA-binding domains of E1 and E2 is believed to be an architectural one that produces a DNA shape that permits the stronger protein: protein interactions in the other domains of the partners. This situation may be restricted to BPV1 because of the especially close proximity of the E2-binding site to the adjacent E1-binding site (three base pairs separate the two) in the BPV origin. When E2 DNA-binding sites are displaced from the E1 assembly site, this particular interaction becomes dispensable (Gillitzer et al. 2000
) and a hybrid E2 with a Gal4 DNA-binding domain can substitute for the DNA-binding tether if Gal4 sites are provided in cis (Winokur and McBride 1996
). In contrast, the activation domain of E2 is specifically required for replication in vivo and that function cannot be replaced with other foreign activation sequences.
Following formation of the E1 · E2 complex at the viral origin, a series of poorly understood reactions ensue by which E1 molecules in the prereplication complex reorganize and convert into an active, doubly hexameric helicase (Fouts et al. 1999
; Lin et al. 2002
). Conversion requires the binding of additional E1 molecules to the origin and is accelerated by ATP (Sanders and Stenlund 1998
). How E2 regulates E1 assembly in this conversion process, as well as its ultimate fate, is not clear. For BPV1, following the binding of additional E1 molecules, E2 is displaced from the viral origin (Lusky et al. 1994
; Sanders and Stenlund 1998
). For HPV-11, it has been reported that E2 remains associated with the DNA during the assembly of E1 molecules, even though E2 might serve as a potential roadblock for extensive unwinding (Lin et al. 2002
). These differences raise several important questions. For example, if E2 leaves the BPV origin on the assembly of bound E1 molecules, is displacement of E2 necessary and, if so, why? Does ATP binding play a role in the assembly process or does nucleotide simply power the helicase to displace a proximal E2? Do differences between the viral systems reflect a fundamental divergence in how DNA replication is initiated?
To address some of these questions and better understand the interplay of E1 and E2 at the viral origin, we have determined the structure of a heterodimer containing the full-length activation domain of E2 in complex with the C-terminal helicase domain of E1. The structure shows that E2 regulates E1 assembly by sterically blocking key E1 oligomerization interfaces. This feature indicates that E2 functions not just as a "chaperone" that shuttles E1 to the origin, but it also acts as a molecular "matchmaker" between E1 and DNA to ensure the proper assembly of the E1 double hexamer. Our structural studies, coupled with biochemical experiments, demonstrate that ATP is an allosteric effector of this process and plays an important role in converting the site-specific DNA binding form of E1 in the prereplication complex to a nonsequence-specific DNA helicase that functions during replication.
| Results |
|---|
|
|
|---|
The E1 · E2 complexes used in crystallization trials were formed by expressing both proteins separately, and then mixing the extracts together. These complexes were stable throughout a three-column purification scheme (see Materials and Methods). The E2 portion of the complex contains the full-length activation domain of HPV18 E2 (amino acids 1-215), which is a longer variant of the HPV18 E2 protein whose uncomplexed structure (amino acids 66-215) was solved previously (Harris and Botchan 1999
) and shown to be homologous to the HPV16 E2 activation domain (Antson et al. 2000
). To find a minimal region of E1 that would be soluble and interact with E2, we constructed a series of N- and C-terminal deletions. A fragment of HPV18 E1 containing residues 428-631 in complex with the E2 activation domain yielded crystals suitable for structure determination. This E1 truncation showed affinities for E2 similar to longer fragments, as measured by glutathione-S-transferase (GST) pull-down experiments, and formed a stable heterodimer with E2 in solution, as measured by gel filtration (data not shown). Crystals of the complex were obtained by standard screening procedures, but extensive optimization of crystal harvesting and freezing conditions was necessary to produce high-resolution diffraction data (Materials and Methods). Phases were determined by using MAD from crystals soaked in erbium chloride. The final structure of the complex has been refined to 2.1 Å resolution (Table 1).
|
The E1 · E2 complex crystallized with six heterodimers in the asymmetric unit, offering six independent views of the assembly. Few structural differences are evident between the heterodimers (average C
RMSD < 0.9 Å). Potential E2 activation domain dimerization interfaces seen in different structures of uncomplexed E2 monomers (Harris and Botchan 1999
; Antson et al. 2000
) are not present in any of the E1 · E2 complexes observed here, even though the three E2 N-terminal helices that form various dimerization contacts in uncomplexed molecules are structurally superposable to those in the E1 · E2 heterodimer. The absence of E2-AD homodimer contacts is not due to E1 binding, because E1 associates with the inner surface of the helical domain of E2 (Fig. 1B), as opposed to the outer surface, which mediates E2 · E2 interactions in the free HPV16 and HPV18 E2 structures. The absence of dimerized E2 molecules in the E1 · E2 complex, as well as in the uncomplexed HPV11 E2 structure (Wang et al. 2004
), supports the notion that the dimerization of E2-AD observed in some crystal forms may be fortuitous. For E2-dependent DNA replication in vivo, a single E2-binding site is sufficient in transient assays (Russell and Botchan 1995
); similarly, an E2 dimer containing a single activation domain functions as wild type in vitro (Lim et al. 1998
). These data all indicate that the E1 · E2 interaction does not require dimers of the E2 activation domain. For other E2 functions, such as transcriptional regulation or plasmid segregation, E2 · E2 activation domain interactions may be important, as suggested (Antson et al. 2000
). No functionally significant E1 · E1 interactions are apparent in the crystal lattice.
The E1 · E2 complex is characterized by an overall "C"-shaped structure with dimensions of 75 x 60 x 40 Å. The top and side of the "C" are formed by the E2 activation domain, whereas the bottom is formed by E1 (Fig. 1B). E2 is formed by a largely
-helical N-terminal domain, which is in turn linked to the C-terminal
-strand domain by an extended linker segment (Fig. 1B). There is no significant change in either the location or organization of the major structural elements of E2 in the E1 complex as compared with its unbound forms. The C
RMSD for E2 in the complex as compared with free E2 is 1.0 Å (over 128 amino acids) and 1.2Å (over 189 amino acids) for the HPV18 and HPV16 activation domains, respectively. The only noteworthy difference between the previously solved E2 structures and E2 in the complex is that there is no clear density for the last
-strand at the C terminus, an element that was visible in prior HPV18 and HPV16 structures. This C-terminal stretch of amino acids, which is distant from the E1 interaction surface, lies proximal to the E2 hinge region (see Fig. 1A). The differences observed here likely reflect a high degree of mobility in this end of the E2 activation domain.
The E1 helicase core is predominantly globular, though long loops extend out from the core at either end. The core contains a central parallel
-sheet comprising five strands that are sandwiched by several
-helices. The
/
topology of E1 is characteristic of Walker-type, P-loop ATPases, consistent with its membership in the AAA+ (ATPases associated with various cellular activities) superfamily of proteins (Neuwald et al. 1999
) and the SF3 family of helicases (Koonin 1993
). The SF3 helicase superfamily contains three characteristic motifs.
The first two motifs include the Walker-A (P-loop) and Walker-B (D(D/E)XX) elements, which are necessary for nucleotide binding and hydrolysis. The P-loop is located between strand
1 and helix
4, whereas the Walker-B motif is formed by two successive aspartate residues at the end of strand
3. Motif "C" of the SF3 family contains an invariant Asn residue, which in E1 maps to the C terminus of
6. This motif, often designated as the "sensor 1" region of AAA+ proteins, is thought to either respond to the hydrolytic state of bound nucleotide or to participate in positioning a water for nucleophilic attack on the
-phosphate of ATP.
The last helix in E1 (
9) is also notable. This element runs through the center of a small, circular constellation of helices formed by helices
1-
3. This cluster appears to be important for the proper folding of this domain, because further N-terminal or C-terminal deletions resulted in E1 proteins that are poorly expressed and largely insoluble (E.A. Abbate and M.R. Botchan, unpubl.). Because the E1 helicase core domain is highly conserved between different variants of the virus (Fig. 2B), we expect the structure to be homologous in other E1 variants.
|
The interaction surface between E1 and E2 requires residues from structural elements of both proteins that are spread out over an extended range of primary amino acid sequence (Figs. 2A,B, 3C,D). This may explain why it has been difficult to define the interaction domain by simple deletion analysis or scanning mutagenesis. In total, 940 Å2 of surface area is buried per protomer on formation of the E1 · E2 complex. The three helices of the N-terminal domain of E2 comprise the major contact point for E1. The helical linker segment connecting the N-terminal and C-terminal domains of E2, as well as the short anti-parallel
-strands between helices
B and
C, also make numerous contacts with E1 (Fig. 3). Interestingly, there is no interaction between the C-terminal
-strand domain structure of E2 and E1, creating an open cavity that gives the complex its overall "C" shape (Fig. 3A). The absence of contacts between the C-terminal domain of E2 and E1 allows this region of E2 to flex slightly with respect to the N-terminal domain, a characteristic reflected in the higher B-factors for residues contained in this region. In addition, superposition of the six dimers in the asymmetric unit reveals that the observed differences between the E2 molecules are due to subtle shifts in the relative orientation of the C-terminal domain of E2 with respect to the N-terminal helical domain. The main interaction surface on E1 is formed by a bundle of helices comprising
2,
3, and
9 (Fig. 3D). E1 also makes additional contacts to E2 through a long extended loop between
7 and
9 (loop-2; Fig. 3A,D).
|
B) of the three N-terminal helices of E2 and forms a buried salt bridge with Arg 454 of E1 (Fig. 4A). Alignment of E1 sequences from HPV 6, 11, 16, and 18 showed that Arg 454 is absolutely conserved in the human variants of the virus (Fig. 2B). In BPV1, however, this arginine is replaced by an asparagine, a substitution that not only shortens the side chain, but also precludes any favorable ionic attraction with the corresponding glutamic acid. Interestingly, it appears that other interactions in this region have evolved to account for this variation in BPV1. For all HPV's, a hydrophobic residue points directly toward the salt bridge (EG, Ile 19 in HPV18), which may strengthen the ionic interaction, whereas in BPV the corresponding hydrophobic residue is replaced by a glutamine. Thus, intimate contact between E2 and E1 at this position in the complex appears to be preserved in BPV by creating a three-amino acid hydrogen bonding network (Asn:Gln: Arg) in place of the HPV ion pair (Glu:Arg).
|
7), do not have an effect on formation of the complex.
E1 and the SV40 large T-antigen
E1 and SV-40 T-antigen are thought to be highly similar proteins in both structure and function. Both proteins bind site specifically to and assemble on their respective viral origins and distort the DNA in a manner that leads to origin melting. Following the melting event, both proteins function as doubly hexameric helicases (Bullock 1997
; Simmons 2000
). Unexpectedly, structural studies have revealed that the site-specific DNA-binding domains of BPV E1 and T-antigen possess the same fold, despite sharing <10% amino acid sequence identity (Luo et al. 1996
; Enemark et al. 2000
).
Li et al. (2003
) have recently determined the crystal structure of the hexameric T-antigen helicase domain. To compare the C-terminal helicase modules of E1 and T-antigen, we aligned the primary and secondary structural features of these domains (Fig. 5). Sequence analysis of the fully functional helicase region, which extends beyond the residues included in our structure, reveals that the two proteins exhibit
20% pairwise identity (Fig. 5D). A set of helices N-terminal to the ATPase domain of T-antigen appear to have equivalent counterparts in E1 (Fig. 5A; Li et al. 2003
). As in T-antigen, this N-terminal region of E1 is important for oligomerization of the protein; the E1 fragment crystallized here (residues 428-631) is monomeric in the absence of E2, as judged by gel filtration chromatography (data not shown), whereas a fragment that includes the N-terminal domain (amino acids 358-631) forms a stable hexamer (see following).
|
E1 hexamer organization
Given the close structural similarity between the helicase domains of T-antigen and E1, we used the hexameric structure of T-antigen as a guide to model a hexameric E1 state. Only those C
positions in the core ATPase domain of E1 and T-antigen were included in the superposition (Fig. 6A). Many loop regions, as well as side chains in the Walker-A and -B regions overlapped, despite the fact that we were comparing the structure of E1 in a complex with E2 to that of a hexameric T-antigen structure. In the intact E1 molecule, the domain solved here is part of a multitiered structure, wherein the N-terminal extension of the helicase domain sits on top of the large ring depicted in our model. This upper tier is expected to contribute significantly to the total E1 · E1 interaction surface, which we expect to be similar to the 2500 Å2 reported for T-antigen (Li et al. 2003
).
|
15 Å in diameter, a value that agrees with the reported size of the central channel of the analogous lower tier of T-antigen (Li et al. 2003
A second feature consistent with our E1 hexamer model arises at interfaces between the protomers. One characteristic of many multimeric ATPases is the presence of a conserved "arginine finger", a motif that stimulates ATPase activity in an oligomerized complex by extending from one subunit into the catalytic site of an adjacent protomer and interacting with the
-phosphate of bound ATP. In AAA+ proteins, mutation of the arginine finger, which generally resides in the "Box VII" or "SRC" motifs (Neuwald et al. 1999
; Davey et al. 2002
) results in a protein significantly impaired for ATPase activity (Song et al. 2000
; Putnam et al. 2001
). Accordingly, on modeling the E1 hexamer, we noted that a conserved arginine, Arg 589 (Fig. 2), at the N terminus of
7 became positioned adjacent to the ATP-binding site of an adjacent monomer (Fig. 6A).
To test whether Arg 589 might serve as an arginine finger in E1, we replaced this residue with alanine and assayed the mutant for ATPase activity. The R589A mutation was made in a slightly longer E1 (358-631) construct than that used for crystallization because the shorter fragment (428-631) fails to stably oligomerize. Coupled ATPase assays showed that mutation of Arg 589 greatly impaired nucleotide hydrolysis activity compared with a wild-type E1 fragment of the same length (Fig. 6C). Indeed, the kinetic parameters for ATP hydrolysis for this mutant were equivalent to a double aspartate substitution of the E1 Walker-B motif (D529A/D530A). As controls, two other mutants were also made and tested for function: K557A, which maps to the predicted DNA-binding loop in the central cavity of E1, and R454A, which cripples E2 interaction. As expected, neither had any appreciable effect on E1 ATPase activity. Importantly, mutation of Arg 589 had no effect on the ability of the protein to oligomerize, as judged by gel filtration chromatography (data not shown) and was not defective for E2 interaction (Fig. 4B), indicating the R589A protein was properly folded. These data provide the first evidence that E1, like other AAA+ proteins, uses an arginine finger motif to construct a catalytically competent, bipartite ATP-binding site. Our data also indicate that a hexamer organization for E1 subunits predicated on the T-antigen model is likely to capture the salient features of this quaternary E1 state.
Regulation of E1 hexamerization by E2 and ATP
The E1 hexamer model also reveals the mechanism by which E2 blocks E1 oligomerization. As can be seen in Figure 6D, superposition of the E1 · E2 complex on the E1 hexamer shows that the C-terminal
-sheet domain of E2 shields the E1 oligomerization surface. This observation predicts that E2 can directly regulate hexamer assembly by physically preventing additional E1 subunits from associating with E1 protomers tethered at the origin. Consequently, disruption of the intimate E1:E2 complex on the DNA must occur before monomer E1:E1 interactions commence. Therefore, our results suggest that previous reports of HPV-11 E2 remaining bound to origin DNA during the assembly of E1 (Lin et al. 2002
) is a consequence of E2 being tethered at a distance from the E1 sites following disruption of the E1 · E2 association. In contrast, the close proximity of the E2-binding sites to the location of E1 assembly in BPV likely necessitates physical displacement of E2 from the origin.
If E2 bound to E1 sterically occludes hexamerization, what serves as the switch to disengage E2? The gel filtration profiles of these proteins in the presence and absence of ATP provide a clue (Fig. 7A). Without ATP, the E1 · E2 complex elutes as a single peak (top trace), with an elution volume corresponding to a 1:1 complex of E1 and E2. When the ratio of E1 · E2 in each fraction eluted from the column is measured by SDS-PAGE, the complex appears to have a 1:1 stoichiometry (Fig. 7A, bottom panels). However, in the presence of ATP, a high-molecular-weight peak and a low-molecular-weight peak now appear. The elution volume of the high-molecular-weight peak is consistent with the size of an E1 hexamer and contains predominantly free E1. The later-eluting peak contains free E2 and, as judged by elution volume, is monomeric.
|
S, indicating that ATP binding, rather than hydrolysis, was important for the release. Although ATP shifts the equilibrium toward dissociation, association of the complex is still favored. However, we speculate that in the context of the full-length proteins bound and positioned on origin DNA, there will be a hysteresis that irreversibly drives the formation of a hexameric E1 helicase locked onto melted DNA (see model following).
The E1 · E2 structure provides a molecular explanation for the ability of ATP to shift the equilibrium of the complex toward dissociation. Juxtaposed to the P-loop of the ATP-binding site is a long extension that provides one of the major interaction surfaces in contact with E2 (Fig. 7B, loop-2). Loop-2 is unusual in that it substitutes for the helical "lid" domains typically found in AAA+ proteins (Fig. 7C); in this regard both E1 (and its cousin T-antigen) lack the sensor-II regions of AAA+ proteins and actually represent a distinct subclass of AAA+ ATPases. Fitting nucleotide into the E1 ATP-binding site by superposition with the structure of the RuvB AAA+ protein (Putnam et al. 2001
) shows how this PV-specific reconfiguration of the AAA+ fold influences E1 · E2 association/dissociation (Fig. 7B). There is good steric and chemical complementarity of the sugar and phosphate groups of the modeled nucleotide with the surface of E1 in the complex. The fit of the adenine moiety, however, is another matter. The position of the base of the modeled nucleotide is such that it points at a number of solvent-exposed aromatic and aliphatic residues contained in loop-2. The conformation of loop-2, however, is maintained by its contacts with E2, such that these residues sterically block the space expected to be occupied by the adenine moiety (Fig. 7B, bottom). This observation indicates that a conformational change must occur in loop-2 in order to accommodate binding of the base of ATP.
Together, these findings suggest a mechanism by which ATP might help remodel the E1 · E2 complex and promote assembly of the replication complex during initiation. In one model, loop-2 would undergo an induced fit on ATP binding that would reconfigure its structure. This, in turn, would disrupt numerous interactions that bond loop-2 to the N-terminal domain of E2, thereby decreasing the stability of the complex. Alternatively, it is possible that E2 might be poised to respond to the presence of an exogenous E1 subunit that is appropriately positioned with respect to the E1 protein of the E1 · E2 complex to correctly oligomerize. In this scheme, the
-domain of E2 might act as a trigger to disengage E2 from E1, thereby allowing ATP to bind to E1. Once bound, ATP would reconfigure the loop-2 region, preventing E2 from reassociating with E1.
| Discussion |
|---|
|
|
|---|
-sheet domain, respectively. Furthermore, our binding studies reveal no appreciable difference in the affinity between various extended E1 fragments and E2.
Taken together, the results reported here have a number of implications for constructing a model that accounts for the progression of PV replication initiation. The pathway begins when a complex consisting of E2 and a dimer of E1 binds origin DNA sites through their respective recognition domains (Fig. 8, step 1). Here, the N-terminal activation domains of E2 would be complexed with the C-terminal helicase domains of E1, preventing inadvertent E1 oligomerization. The E1 · E2 interaction would also increase the overall specificity of the complex for the PV origin by combining the DNA-binding affinities of both of the E1 and E2 DNA-binding domains and could serve to suppress the nonspecific DNA-binding activity of the E1 helicase domain (Stenlund 2003a
). We suspect that the basic residues of loop-1 (labeled in Fig. 8 as the "DNA-binding loop") contribute to the nonspecific DNA-binding surface, because they appear to form the positively charged inner surface of the hexamer (Fig. 6C) that is expected to interact with nucleic acid during helicase progression.
|
100° about the central DNA axis when bound to the PV origin (Fig. 8, inset). When four molecules of E1 bind to the origin in vivo, we expect that the E2 activation domains would initially block interactions between their partner E1 subunits and the newly bound monomers (Fig. 8, step 2). Although this 4:2 E1:E2 intermediate has not been detected, it may have a short half-life, because the second pair of E1 monomers could dissociate, reversing the complex back to step 1.
Because the
-sheet domain of E2 shields the oligomerization surface on its associated E1 partner from the newly bound E1 subunits, E2 must be disengaged to progress to the next stage of the initiation reaction. ATP could directly drive this process by binding to E1, thereby reconfiguring the loop-2 region and disrupting the E2 interaction interface. It is also possible that helicase domains of the E2-free E1 monomers, which are positioned approximately a quarter of a turn about the DNA helix away from the E1 · E2 complex, might lie close enough to engage the E2
-sheet "shield" domain and help dislodge E2. Thus, the switch that leads to E2 release might rely on a combination of ATP binding and an appropriately docked E1 dimer, which would serve as the nucleus for assembly of the hexameric helicase.
In this regard, as first proposed by Enemark et al. (2002
), we suggest that the oligomerization of E1 helicase domains may commence not between two E1 proteins that reside within a dimer, but instead between two E1 proteins that are part of different dimers (Fig. 8, step 3). The primary determinant for this organization derives from the fact that E1-binding sites exist in the PV origin as inverted repeats. As a consequence, E1 proteins first bind their target regions in a twofold rotationally symmetric, or head-to-head, manner (Fig. 8, step 1; Enemark et al. 2002
). This configuration presents a problem for the formation of a hexameric helicase, which forms a cyclically symmetric (head-to-tail) assembly of oligomers. Thus, for E1 monomers within a dimer to assemble properly into a hexamer, one protomer would need to rotationally reorient its position with respect to the other by as much as 180°. In contrast, a pair of E1 monomers from different dimers that are offset from each other about the DNA helix, as is seen in structures of the E1 DBD tetramer, are pre-positioned such that the "head" of one partner points in the general direction of the "tail" of the other (Fig. 8).
There are several lines of reasoning that support this oligomerization model. In the Enemark et al. (2002
) co-crystal structure of the tetrameric E1 DNA-binding domain · DNA complex, it was noted that the two DNA-binding domains within a dimer each interacted predominately with one of the two DNA strands, and the strand contacted by one protomer was the complement of the one contacted by its dimer mate. Curiously, this organization was reflected in both dimers, such that two domains in different dimers, but related to one another about the DNA axis, actually contact the same DNA strand (Fig. 8, step 2; Enemark et al. 2002
). This configuration is significant, because it provides a means for E1 helicase domains within a hexamer to jointly coordinate the same DNA strand as they begin to associate. Moreover, each of the two hexamers would assemble around a different DNA, potentially providing the energy needed to distort and melt the DNA duplex.
The interior translocation loop of E1 (loop-1) might be expected to be the element of the helicase that interacts with the phosphate backbone of DNA during this process. In support of this, BPV E1 does form an extended footprint at the viral origin in the absence of the E2 activation domain, presumably through contacts between the helicase domain of E1 and DNA (Stenlund 2003a
). Moreover, motifs in the region of T-antigen equivalent to the E1 loop-1 segment make contact with flanking DNA sequences at the SV-40 origin that are important for viral DNA replication (Reese et al. 2004
). The interaction between the helicase domain of E1 and flanking DNA sequences may be important for restricting flexibility between the helicase domain and the DNA-binding domain and could help the protein manifest an unwinding torque on the DNA during E1 oligomerization.
A salient feature of this model is that assembly of the hexamer of E1 at the origin is coupled both to origin melting and creation of a functional helicase (step 4). Once assembled, the helicase is poised to processively unwind the melted strands. There is evidence that T-antigen and E1 (Stahl and Knippers 1983
; Fouts et al. 1999
; Sanders and Stenlund 2000
; Alexandrov et al. 2002
) actually function as double hexamers during this process. It is interesting to note that in the assembly scheme proposed earlier (Fig. 8), the dimer interfaces formed initially between the site-specific DNA-binding domains might serve as the contact points for the two hexamers, a model that has been similarly proposed for the N-terminal domain of the MCM helicase (Fletcher et al. 2003
). In this regard, the combination of activities observed for E1 and T-antigen, which act both as replication initiators and as DNA melting and helicase proteins, may presage the coordinated action of a set of related AAA+ proteins that initiate DNA replication: ORC, Cdc6, and MCM.
| Materials and methods |
|---|
|
|
|---|
The activation domain of the HPV18 E2 ORF was cloned into pGEX2T (Amersham) for expression as an N-terminal GST fusion. E1 fragments were cloned into either a modified N-terminal maltose-binding protein (MBP) fusion vector, pMAL-c2X (NEB), which replaced the factor Xa cleavage site with a thrombin cleavage site, or into the same vector with an added N-terminal hexa-histidine tag, for expression as a His-MBP fusion. Mutants were made using the QuikChange (Stratagene) method, according to the manufacturer's protocol. Vectors were transformed into the Escherichia coli strain XA90, and proteins were overexpressed in 2XYT media by induction at an O.D.600 of
1.2 with 0.5 mM IPTG for 16 h at 20°C.
For purification of the E1(maltose-binding protein) E2 complex, cells expressing MBP-E1 and GST-E2 were lysed separately by sonication in 50 mM Tris-HCl (pH 7.5), 10% glycerol, 1 M NaCl, 10 mM EDTA, 1 mM PMSF, and 1 mM Tris(2-carboxyethyl)phosphine hydrochloride (TCEP). The E2 extract was incubated with glutathione Sepharose (Amersham) with mixing for 2 h at 4°C. The E2 extract was then removed and the resin-bound E2 was incubated with the E1 extract for
16 h at 4°C with mixing. The resin was washed with lysis buffer, followed by extensive washing with 10% Q buffer (25 mM Tris-HCl at pH 8.2), 10% glycerol, 100 mM NaCl, 1 mM EDTA, 1 mM TCEP). Proteins were then eluted with 10% Q buffer containing 20 mM glutathione. GST and MPB were cleaved by incubation with thrombin and separated from the complex by passage over a Mono-Q column (Amersham). Further purification on a Superdex75 (Amersham) sizing column (10% Q buffer) separated the E1 · E2 complex from any remaining free E2. The purified complex was >95% pure as judged by SDS-PAGE and Coomassie staining. Proteins used for biochemical studies were used directly from gel filtration fractions or were concentrated by ultrafiltration (Centriprep10, Amicon) to
10 mg/mL, for crystallization.
For pull-down experiments and ATPase assays, GST-tagged E2 and a His-MBP tagged E1 were expressed as described earlier. Purification of E2 was the same as for the complex, except no incubation with E1 was performed. Following elution from the glutathione resin, E2 was bound to a Q-Sepharose (Amersham) column and washed with 10% Q buffer to remove free glutathione. GST-E2 was then eluted with 10% Q buffer containing 400 mM NaCl. His-MBP-E1 extract was made as described earlier, except that all buffers lacked EDTA and TCEP and contained 5 mM
-mercaptoethanol. The extract was incubated with Ni-NTA agarose (QIAGEN) for 2 h at 4°C. The resin was washed with lysis buffer followed by 10% Q buffer (plus 5 mM imidazole), and eluted with 10% Q buffer containing 250 mM imidazole. The eluate was bound to a Q-Sepharose column and washed with 10% Q buffer to remove free imidazole and then eluted with 10% Q buffer containing 400 mM NaCl.
Crystallization
The purified and concentrated E1(428-631) · E2(1-215) complex was dialyzed against 10 mM Tris-HCl (pH 8.0), 50 mM NaCl, and 1 mM TCEP for crystallization. Crystals were grown in microbatch, under oil, by mixing the protein solution with crystallization buffer (50 mM MES at pH 5.9-6.1, 7%-10% PEG 3,350, 20-40 mM MgCl2) in a 1:1 ratio at 4°C. Crystals appeared overnight and grew to average dimensions of 75 x 75 x 125 µm in 3 d. Cryoprotectant was introduced slowly by a 20-step, sequential transfer at 4°C into the final cryo solution (1:2 dilution of the crystallization buffer plus 30% MPD), with 10-min incubations between steps. Crystals were then flash frozen in liquid nitrogen. Heavy atom derivatives were obtained by soaking crystals in a solution containing a 1:2 dilution of crystallization buffer (without MgCl2) and 1 mM erbium (III) chloride overnight. Cryoprotectant was added in a similar manner as with the native crystals, except MgCl2 was replaced by 1 mM ErCl3, which was included in all steps and the final cryoprotectant.
Data collection, structure determination, and refinement
The native data set was collected on Beamline 8.2.2 and derivative data sets on Beamline 8.3.1 at the Advanced Light Source of Lawrence Berkeley National Laboratory, using ADSC Quantum-Q315 CCD detectors. Diffraction data were processed using DENZO and SCALEPAK (Otwinowski and Minor 1997
). The crystals are in the orthorhombic space group P212121 with cell parameters a = 82.297 Å, b = 88.745 Å, c = 375.027 Å. There are six E1 · E2 heterodimers in the asymmetric unit. Heavy atom (erbium) sites were identified with SOLVE (Terwilliger and Berendzen 1999
). Density modification of experimental maps was performed with RESOLVE (Terwilliger and Berendzen 1999
) and DM (Cowtan 1994
). Three of the six E2 activation domains were located in the density maps by the program FFFEAR (Cowtan 1998
) using the solved HPV18 E2 structure. The remaining three E2 activation domains were manually positioned and E1 and the first two helices of E2 were manually built using O (Jones et al. 1991
). Refinement was carried out with REFMAC5 (Murshudov et al. 1999
) using ARP (Lamzin and Wilson 1997
) for placement of water. The model was refined to a final Rwork of 21.9% and an Rfree of 26.3%. Amino acids 3-198 of E2 and amino acids 428-629 are accounted for in the best E1 · E2 dimer in the asymmetric unit. Geometric analysis was carried out with Procheck (Laskowski et al. 1993
).
GST pull-down assays
GST-E2(1-215) and His-MBP-E1(358-631) were used to perform GST pull-down experiments. E2 (100 nM) was incubated with various concentrations of E1 (25 nM, 100 nM, 200 nM) in pull-down buffer (25 mM Tris-HCl at pH 8.2, 10% glycerol, 150 mM NaCl, 1 mM EDTA, 1 mM DTT) containing 100 µg/mL BSA for 1 h at 4°C with mixing. This mixture was then transferred to a GST MicroSpin Purification Module (Amersham) and mixed with the resin for 1 h at 4°C. The resin was washed three times with pull-down buffer and bound proteins were eluted with pull-down buffer containing 20 mM glutathione. Samples were resolved by SDS-PAGE and gels were stained with the fluorescent dye SYPRO Orange (Amersham) and scanned (Typhoon 9400, Amersham) for quantitation.
Sizing column with ATP
The E1(358-631) · E2(1-215) complex was incubated with MgCl2 (5 mM) or with ATP (1 mM) and MgCl2 (5 mM) in 10% Q buffer (see earlier) lacking EDTA for 30 min at room temp. Following incubation, samples were run on a Superdex75 (Amersham) sizing column. Fractions were resolved by SDS-PAGE and stained as described earlier for quantitation.
ATPase assays
ATPase activity was measured by coupling the hydrolysis of ATP to the oxidation of NADH through the enzymes pyruvate kinase and lactate dehydrogenase. Activity was monitored spectrophotometrically as a decrease in absorbance at 340 nm. The assay was performed essentially as described (Rocque et al. 2000
). The reaction was initiated with E1 at a final concentration of 1.23 µM.
| Acknowledgments |
|---|
|
|
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| Footnotes |
|---|
Article published online ahead of print. Article and publication date are at http://www.genesdev.org/cgi/doi/10.1101/gad.1220104.
1 E-MAIL jmberger{at}uclink.berkeley.edu; FAX (510) 643-1729. ![]()
2 E-MAIL mbotchan{at}uclink.berkeley.edu; FAX (510) 643-1729. ![]()
| References |
|---|
|
|
|---|
Antson, A.A., Burns, J.E., Moroz, O.V., Scott, D.J., Sanders, C.M., Bronstein, I.B., Dodson, G.G., Wilson, K.S., and Maitland, N.J. 2000. Structure of the intact transactivation domain of the human papillomavirus E2 protein. Nature 403: 805-809.[CrossRef][Medline]
Bosch, F.X., Manos, M.M., Munoz, N., Sherman, M., Jansen, A.M., Peto, J., Schiffman, M.H., Moreno, V., Kurman, R., and Shah, K.V. 1995. Prevalence of human papillomavirus in cervical cancer: A worldwide perspective. International biological study on cervical cancer (IBSCC) Study Group. J. Natl. Cancer Inst. 87: 796-802.
Bullock, P.A. 1997. The initiation of simian virus 40 DNA replication in vitro. Crit. Rev. Biochem. Mol. Biol. 32: 503-568.[Medline]
Chen, G. and Stenlund, A. 1998. Characterization of the DNA-binding domain of the bovine papillomavirus replication initiator E1. J. Virol. 72: 2567-2576.
Chiang, C.M., Ustav, M., Stenlund, A., Ho, T.F., Broker, T.R., and Chow, L.T. 1992. Viral E1 and E2 proteins support replication of homologous and heterologous papillomaviral origins. Proc. Natl. Acad. Sci. 89: 5799-5803.
Cowtan, K. 1994. A CCP4 density modification package. Joint CCP4 ESF-EACBM. Newslett. Prot. Crystallogr. 31: 34-38.
____. 1998. Modified phased translation functions and their application to molecular-fragment location. Acta Crystallogr. D Biol. Crystallogr. 54 (Pt. 5): 750-756.[CrossRef][Medline]
Davey, M.J., Jeruzalmi, D., Kuriyan, J., and O'Donnell, M. 2002. Motors and switches: AAA+ machines within the replisome. Nat. Rev. Mol. Cell Biol. 3: 826-835.[CrossRef][Medline]
Enemark, E.J., Chen, G., Vaughn, D.E., Stenlund, A., and Joshua-Tor, L. 2000. Crystal structure of the DNA binding domain of the replication initiation protein E1 from papillomavirus. Mol. Cell 6: 149-158.[CrossRef][Medline]
Enemark, E.J., Stenlund, A., and Joshua-Tor, L. 2002. Crystal structures of two intermediates in the assembly of the papillomavirus replication initiation complex. EMBO J. 21: 1487-1496.[CrossRef][Medline]
Ferguson, M.K. and Botchan, M.R. 1996. Genetic analysis of the activation domain of bovine papillomavirus protein E2: Its role in transcription and replication. J. Virol. 70: 4193-4199.[Abstract]
Fletcher, R.J., Bishop, B.E., Leon, R.P., Sclafani, R.A., Ogata, C.M., and Chen, X.S. 2003. The structure and function of MCM from archaeal M. thermoautotrophicum. Nat. Struct. Biol. 10: 160-167.
Fouts, E.T., Yu, X., Egelman, E.H., and Botchan, M.R. 1999. Biochemical and electron microscopic image analysis of the hexameric E1 helicase. J. Biol. Chem. 274: 4447-4458.
Gillitzer, E., Chen, G., and Stenlund, A. 2000. Separate domains in E1 and E2 proteins serve architectural and productive roles for cooperative DNA binding. EMBO J. 19: 3069-3079.[CrossRef][Medline]
Harris, S.F. and Botchan, M.R. 1999. Crystal structure of the human papillomavirus type 18 E2 activation domain. Science 284: 1673-1677.
Howley, P.M. 1996. Papillomavirinae: The viruses and their replication. In Fields virology (eds. B.N. Fields et al.), pp. 2045-2076. Lippincott-Raven, Philadelphia, PA.
James, J.A., Escalante, C.R., Yoon-Robarts, M., Edwards, T.A., Linden, R.M., and Aggarwal, A.K. 2003. Crystal structure of the SF3 helicase from adeno-associated virus type 2. Structure (Camb.) 11: 1025-1035.
Jones, T.A., Zou, J.Y., Cowan, S.W., and Kjeldgaard, M. 1991. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr. A 47 (Pt. 2): 110-119.
Koonin, E.V. 1993. A common set of conserved motifs in a vast variety of putative nucleic acid-dependent ATPases including MCM proteins involved in the initiation of eukaryotic DNA replication. Nucleic Acids Res. 21: 2541-2547.
Lamzin, V.S. and Wilson, K.S. 1997. Automated refinement for protein crystallography. In Methods enzymology (eds. C.W. Carter Jr. et al.), pp. 269-305. Academic Press, San Diego, CA.
Laskowski, A.R., MacArthur, W.M., Moss, S.D., and Thornton, M.J. 1993. PROCHECK: A program to check the stereo-chemical quality of protein structures. J. Appl. Cryst. 26: 283-291.
Li, R., Knight, J.D., Jackson, S.P., Tjian, R., and Botchan, M.R. 1991. Direct interaction between Sp1 and the BPV enhancer E2 protein mediates synergistic activation of transcription. Cell 65: 493-505.[CrossRef][Medline]
Li, R., Yang, L., Fouts, E., and Botchan, M.R. 1993. Site-specific DNA-binding proteins important for replication and transcription have multiple activities. Cold Spring Harb. Symp. Quant. Biol. 58: 403-413.[Medline]
Li, D., Zhao, R., Lilyestrom, W., Gai, D., Zhang, R., DeCaprio, J.A., Fanning, E., Jochimiak, A., Szakonyi, G., and Chen, X.S. 2003. Structure of the replicative helicase of the oncoprotein SV40 large tumour antigen. Nature 423: 512-518.[CrossRef][Medline]
Lim, D.A., Gossen, M., Lehman, C.W., and Botchan, M.R. 1998. Competition for DNA binding sites between the short and long forms of E2 dimers underlies repression in bovine papillomavirus type 1 DNA replication control. J. Virol. 72: 1931-1940.