Genes and Development

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


GENES & DEVELOPMENT 19:1401-1415, 2005
©2005 by Cold Spring Harbor Laboratory Press; ISSN 0890-9369/ $5.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Meinhart, A.
Right arrow Articles by Cramer, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Meinhart, A.
Right arrow Articles by Cramer, P.
Related Content
Right arrow Chromatin and Gene Expression
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

REVIEW

A structural perspective of CTD function

Anton Meinhart1, Tomislav Kamenski, Sabine Hoeppner, Sonja Baumli and Patrick Cramer2

Gene Center, University of Munich (LMU), Department of Chemistry and Biochemistry, 81377 Munich, Germany


    Abstract
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
The C-terminal domain (CTD) of RNA polymerase II (Pol II) integrates nuclear events by binding proteins involved in mRNA biogenesis. CTD-binding proteins recognize a specific CTD phosphorylation pattern, which changes during the transcription cycle, due to the action of CTD-modifying enzymes. Structural and functional studies of CTD-binding and -modifying proteins now reveal some of the mechanisms underlying CTD function. Proteins recognize CTD phosphorylation patterns either directly, by contacting phosphorylated residues, or indirectly, without contact to the phosphate. The catalytic mechanisms of CTD kinases and phosphatases are known, but the basis for CTD specificity of these enzymes remains to be understood.

[Keywords: RNA polymerase II; gene transcription; nuclear coupling; C-terminal repeat domain; kinase; phosphatase]


Over the last decade, a wealth of genetic and biochemical data revealed a physical and functional coupling between mRNA transcription by RNA polymerase II (Pol II), mRNA processing, and other nuclear events. Those coupling events are generally mediated by the C-terminal repeat domain (CTD) of the largest subunit of Pol II. The CTD is required for efficient capping, splicing, cleavage, and polyadenylation of mRNAs in vivo, and binds to RNA processing factors in vitro (McCracken et al. 1997bGo; Hirose and Manley 1998Go, 2000Go; Proudfoot et al. 2002Go). The binding of specific processing factors depends on the phosphorylation pattern of the CTD, which changes during the transcription cycle, and coordinates events of nuclear mRNA biogenesis (Dahmus 1995Go; Maniatis and Reed 2002Go; Palancade and Bensaude 2003Go; Proudfoot 2004Go; Sims et al. 2004Go; Zorio and Bentley 2004Go). Also coupled to mRNA transcription are chromatin remodeling and modification, DNA repair, mRNA packaging, RNA editing, and nuclear mRNA export (Reed 2003Go; Sims et al. 2004Go; Ares and Proudfoot 2005Go). A role of the CTD in these processes is just emerging. Here we review the structure, modification, and recognition of the CTD, from which insights into the molecular mechanisms of CTD-mediated coupling were obtained.


    Free CTD structure
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
The CTD forms a tail-like extension from the catalytic core of Pol II, and is flexibly linked to a region near the RNA exit pore of the enzyme (Fig. 1). In the crystal structures of yeast Pol II, the CTD is not visible due to mobility (Cramer et al. 2001Go; Armache et al. 2005Go). Also flexible is an 80-residue linker between the polymerase core and the CTD. The beginning of the linker forms an {alpha}-helix that binds the polymerase subunit Rpb7, which together with Rpb4 constitutes a subcomplex that protrudes from the enzyme (Armache et al. 2005Go). The CTD consists of heptapeptide repeats of the consensus sequence Y1-S2-P3-T4-S5-P6-S7. The number of repeats depends on the species, and is 26 in yeast and 52 in human. At least eight repeats are required for yeast viability (Nonet et al. 1987Go; West and Corden 1995Go).

Current evidence suggests that the free CTD is largely flexible, although it shows some residual structure and a tendency to form {beta}-turns. The CTD contains two SPXX motifs (S2-P3-T4-S5, S5-P6-S7-Y1), which were proposed to form {beta}-turn structures stabilized by two hydrogen bonds (Suzuki 1989Go). NMR studies of a single CTD consensus repeat peptide revealed such {beta}-turns for the S2-P3-T4-S5 motif, but also unfolded forms (Harding 1992Go). NMR and circular dichroism of a CTD peptide with eight repeats showed a small population of {beta}-turns at both SPXX motifs in water, which increased in trifluoroethanol, a hydrogen-bond-promoting solvent (Cagas and Corden 1995Go). Another study also detected a low population of turn structures, which strongly increased with single amino acid mutations (Dobbins et al. 1996Go). A further study of an eight-repeat CTD peptide detected a content of 15% polyproline helix, and below 10% {beta}-turn structure in water, which increases to 75% in 90% trifluoroethanol (Bienkiewicz et al. 2000Go). Cyclic model peptides contain a larger content of {beta}-turns than linear CTD peptides, with the turn at S2-P3-T4-S5 being more stable than at S5-P6-S7-Y1 (Kumaki et al. 2001Go). Recent studies of a two-repeat CTD peptide with a central phosphorylated S2 (pS2) residue, however, revealed a dynamic disordered ensemble (Noble et al. 2005Go). The largely disordered nature of the free CTD apparently allows for many different interactions with target proteins via an induced fit mechanism.

In an extended {beta}-strand conformation, the length of the yeast CTD and linker would be ~650 Å and 250 Å, respectively (Fig. 1; Cramer et al. 2001Go). Thus, the CTD could in principle reach anywhere on the surface of Pol II, which is ~150 Å in diameter. The CTD is, however, most likely compact, at least in its unphosphorylated state. Electron micrographs of Pol II revealed a weak density that was attributed to the CTD and measured only ~100 Å (Meredith et al. 1996Go). Pol II crystals contain a limited space adjacent to the linker that could harbor a compact CTD (Cramer et al. 2001Go). Electrophoretic analysis, gel filtration, and sucrose gradients revealed that phosphorylation of the CTD results in a far more extended and more protease-sensitive structure (Laybourn and Dahmus 1989Go; Zhang and Corden 1991Go). One possible form of a compact CTD is a random coil (Cramer et al. 2001Go). Alternatively, unusual "{beta}-spiral" models were proposed that account for the equivalence of CTD repeats in NMR studies (Matsushima et al. 1990Go; Suzuki 1990Go; Cagas and Corden 1995Go). One type of {beta}-spiral consists of a series of staggered overlapping {beta}-turns, two per CTD repeat, and an early model of this type had a length of 280 Å for the yeast CTD ("loose spiral") (Fig. 1; Cagas and Corden 1995Go). A more compact {beta}-spiral, only 100 Å long, and assuming only one {beta}-turn per repeat, was suggested based on a crystal structure of a CTD peptide bound to a CTD-binding domain ("compact spiral") (Fig. 1; Meinhart and Cramer 2004Go). Although it is unlikely that the CTD adopts a uniform repetitive conformation, portions of the CTD may adopt a spiral form, leading to an overall compaction. Upon extensive phosphorylation, compact forms of the CTD would become more extended due to charge repulsions.



View larger version (11K):
[in this window]
[in a new window]
 
Figure 1. Elongating RNA polymerase II and size of the CTD. The complete 12-subunit yeast Pol II elongation complex structure (Kettenberger et al. 2004Go; Armache et al. 2005Go) is shown as a ribbon diagram with the subunits in different colors. DNA is in blue and RNA is in red. DNA and RNA outside the polymerase have been extrapolated or modeled. The relative sizes of the yeast Pol II largest subunit linker region and the CTD are indicated in green and black, respectively. The CTD is depicted in four putative states, fully extended, as a loose or compact {beta}-spiral, and as a random coil. For size comparison, the Pcf11 CID is depicted in gold (Meinhart and Cramer 2004Go).

 

    CTD modification
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
There are five potential phosphorylation sites in a CTD consensus repeat (Y1, S2, T4, S5, and S7) (Fig. 2A). CTD phosphorylation, however, occurs mainly at residues S2 and S5 (Corden et al. 1985Go; Zhang and Corden 1991Go), and these two serine positions are not equivalent in function (West and Corden 1995Go; Yuryev and Corden 1996Go). Phosphorylation at S2 and S5 gives rise to four different phosphorylation states of a CTD repeat (unphosphorylated, phosphorylated at S2, phosphorylated at S5, phosphorylated at S2 and S5). The phosphorylated form of Pol II carries, on average, one phosphate per repeat (Payne and Dahmus 1993Go). The phosphorylation state of the CTD is generally believed to be the result of the balanced action of site-specific CTD kinases and phosphatases. The phosphorylation pattern changes during the transcription cycle, resulting in recruitment of specific RNA-processing factors. S5 phosphorylation occurs in promoter-proximal regions, and leads to recruitment of the capping enzyme (Cho et al. 1997Go; McCracken et al. 1997aGo; Ho et al. 1998Go; Komarnitsky et al. 2000Go). S2 phosphorylation predominates in regions that are more distal from the promoter, and triggers binding of the 3'-RNA processing machinery (Komarnitsky et al. 2000Go; E.J. Cho et al. 2001Go). Thus there may be a "CTD code" that specifies the position of Pol II within the transcription cycle (Buratowski 2003Go).

A CTD code could consist of 16 different states of the CTD repeat, which result from the combination of four different phosphorylation states with four possible proline configurations (residues P3 and P6 can each be in cis or trans conformation) (Buratowski 2003Go). The situation, however, becomes much more complicated when one considers how a CTD code may be read, in other words, how the different states of the CTD are recognized by proteins. First, proteins can bind more or less than one CTD repeat, and thus a single repeat is generally not the functional unit of the CTD. Indeed, genetics indicates that heptapeptide pairs are the functional units of the CTD (Stiller and Cook 2004Go). Second, it is unclear how the action of prolyl isomerases on the CTD can change specificity for CTD-binding proteins, since these enzymes only accelerate cis/trans isomerization of proline residues, in contrast to kinases, which set a phosphate mark in the CTD, or phosphatases, which remove the phosphate mark. However, the prolyl isomerase Pin1 can influence the phosphorylation pattern of the CTD in vitro, and the hypophosphorylated Pol II accumulates in pin1-/- cells (Xu et al. 2003Go). Third, the CTD code may be extended by phosphorylation at residue Y1 (Baskaran et al. 1993Go, 1997Go), and by possible CTD glycosylation (Kelly et al. 1993Go). Fourth, the 52nd repeat in the human CTD contains a phosphorylated casein kinase II site (Chapman et al. 2004Go). Finally, a short motif at the C terminus of the CTD, outside of the repeats, is also important for the stability and the function of the CTD (Fong et al. 2003Go; Chapman et al. 2004Go).


    Architecture of CTD-binding domains
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
The guanylyltransferase domain of the capping enzyme subunit Cgt1 binds the phosphorylated CTD (Cho et al. 1997Go; McCracken et al. 1997aGo). The Cgt1 domain structure reveals a mixed {alpha}/{beta}-fold (Fig. 2; Fabrega et al. 2003Go). Other RNA-processing factors bind to the phosphorylated CTD via compact domains devoted only to CTD binding, including WW domains, FF domains, and CTD-interacting domains (CIDs) (Fig. 2). These domain families differ in their structure and CTD-binding properties (Fig. 2). WW domains are 40-residue domains with two signature tryptophan residues spaced by 20-22 amino acids (Bork and Sudol 1994Go). Based on their binding specificity, WW domains can be divided into five subgroups. Subgroup IV WW domains are specific for phosphoserine-proline (pSP) motifs, which occur twice in the consensus CTD repeat. CTD-binding WW domains are found in the prolyl isomerases Pin1/Ess1 (Morris et al. 1999Go), and in the splicing factor Prp40 (Morris and Greenleaf 2000Go). The N-terminal WW domain of Pin1/Ess1 forms a three-stranded antiparallel twisted {beta}-sheet (Ranganathan et al. 1997Go; Verdecia et al. 2000Go; Li et al. 2005Go). FF domains comprise two conserved phenylalanine residues. Multiple copies of FF domains that bind the phosphorylated CTD are found in the elongation factor CA150 (Carty et al. 2000Go) and the splicing factor Prp40 (Morris and Greenleaf 2000Go). The FF domains in Prp40 are homologous to an FF domain in FBP 11, which interacts with phosphorylated CTD peptides in vitro (Allen et al. 2002Go). The structure of the FF domain of FBP11 consists of a three-helix bundle with a 310-helix inserted into the loop between the second and third helix (Allen et al. 2002Go). CID domains are found in the poly(A)-dependent 3'-RNA processing factor Pcf11 (Barilla et al. 2001Go), in the poly(A)-independent 3'-processing factor Nrd1 (Steinmetz et al. 2001Go), in the serine/arginine-rich-like factors SCAF4 and SCAF8 (Yuryev et al. 1996Go; Patturajan et al. 1998bGo), and in Rtt103, a factor involved in transcription termination (Kim et al. 2004bGo). The CID domain of Pcf11 consists of eight {alpha}-helices in a right-handed superhelical arrangement (Fig. 2; Meinhart and Cramer 2004Go).



View larger version (52K):
[in this window]
[in a new window]
 
Figure 2. CTD structure and recognition. (A) CTD primary structure and protein contacts. On the top, three CTD consensus repeats are shown with the phosphorylation sites S2 and S5 highlighted in light red. The sequences of the CTD phospho-peptides in the known CTD structures in complex with Cgt1 (Fabrega et al. 2003Go), Pin1 WW (Verdecia et al. 2000Go), and Pcf11 CID (Meinhart and Cramer 2004Go) are depicted. Residues in an extended and turn conformation are in blue and magenta, respectively. Phosphate groups are shown as a circled red P. Contacts within 4 Å distance of CTD residues to side-chain and main-chain atoms of the protein are indicated with stars and crosses, respectively (Collaborative Computational Project, Number 4, 1994Go). Circling indicates hydrogen bonds (within 3.5 Å distance). (B) Ramachandran plot for CTD peptides in the three complex structures (Laskowski et al. 1993Go). Regions for {beta}-strand (extended) and for {alpha}-helical (turn) conformations are highlighted in blue and magenta, respectively. CTD residues in the structures with Cgt1 (Fabrega et al. 2003Go), Pin1 WW (Verdecia et al. 2000Go), and Pcf11 CID (Meinhart and Cramer 2004Go) are indicated by squares, circles, and triangles, respectively, and are labeled as in the original papers. (C) Structures of CTD complexes with Cgt1 (Fabrega et al. 2003Go), Pin1 WW (Verdecia et al. 2000Go), and Pcf11 CID (Meinhart and Cramer 2004Go). The CTD peptides are shown as atomic models and are colored according to A and B. Phosphate groups are in red. The proteins are shown as ribbon models in gold.

 
Several other CTD-binding folds exist. BRCT domains can bind the phosphorylated CTD (see CTD phosphatases). The 3'-RNA processing factor Rna14 is predicted to contain HEAT repeats, and its fold might resemble that of CID domains. In contrast, the CTD-binding cleavage and polyadenylation factor Yhh1 is predicted to contain {beta}-propeller repeats (Dichtl et al. 2002bGo). Another small CID was discovered in the histone methyltransferase Set2 (Phatnani et al. 2004Go; Kizer et al. 2005Go), and is predicted to be {alpha}-helical.


    CTD recognition
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
Our current understanding of the stereochemical basis of CTD recognition by CTD-binding proteins is based on three-dimensional structures of differently phosphorylated CTD peptides bound to phospho-CTD-binding domains (Fig. 2): a S5-phosphorylated CTD peptide bound to the guanylyltransferase domain of Cgt1 (Fabrega et al. 2003Go), a doubly S2/S5-phoshorylated CTD peptide bound to the WW domain of Pin1 (Verdecia et al. 2000Go), and a S2-phosphorylated CTD peptide bound to the CID domain of Pcf11 (Meinhart and Cramer 2004Go). In all structures, the CTD peptide binds to a conserved groove of the domain, suggesting that the same interactions occur with other family members.

The structure of the S5-phosphorylated CTD peptide bound to Cgt1 revealed a largely extended CTD conformation (Fabrega et al. 2003Go). Cgt1 can bind almost three CTD repeats. The central CTD repeat is partially looped out, forms a turn-like structure, and makes few interactions with the protein. The two flanking repeats are in an extended {beta}-strand-like conformation (Fig. 2). Hydrophobic interactions involve the proline residues of the CTD, and the tyrosine residue Y1, which also forms a hydrogen bond with a Cgt1 residue via its phenylic hydroxyl group. This interaction is incompatible with Y1 phosphorylation. Cgt1 binds the phosphate groups of phosphoserine-5 (pS5) residues in the two extended flanking repeats, but not the one in the central repeat. An important implication of this structure is that these domains may bind two remote stretches of the CTD ("bivalent" or "bipartite" recognition), thereby looping out the intervening sequence. Such looping could facilitate formation of turned structures within the CTD, as observed for free cyclic CTD peptides (Kumaki et al. 2001Go). Looped structures induced by CTD binding of one factor could be recognized by another CTD-binding factor, possibly contributing to sequential factor binding.

The structure of a doubly S2/S5-phosphorylated CTD peptide bound to the Pin1 WW domain also revealed an extended CTD conformation (Verdecia et al. 2000Go). The P3-T4-pS5-P6 motif of the CTD peptide interacts with the protein. The two prolines are involved in van der Waals interactions with hydrophobic groups of Pin1, and two hydrogen bonds are formed with the peptide back-bone. The WW domain of Pin1 binds the pS5 phosphate group with several hydrogen bonds, and one hydrogen bond is formed with the pS2 phosphate (Verdecia et al. 2000Go). In binding assays, the protein is not specific for S2 or S5 phosphorylation (Myers et al. 2001Go).

In the structure of a S2-phosphorylated CTD peptide bound to the CID domain of Pcf11, the CTD motif S2-P3-T4-S5 forms a {beta}-turn, whereas the flanking residues are in an extended conformation (Fig. 2; Meinhart and Cramer 2004Go). Hydrogen bonds are formed between the CTD and the CID domain, and CTD residues Y1 and P3 bind to hydrophobic patches. The phenylic hydroxyl of Y1 forms a hydrogen bond with a conserved aspartate that is required for normal yeast growth (Sadowski et al. 2003Go). The side chain of CTD residue S5 is exposed, consistent with the observation that S5 phosphorylation does not influence binding of the CID domain in SCAF8 (Patturajan et al. 1998bGo). The pS2 phosphate group does also not contact the CID domain, consistent with the ability of Pcf11 to bind the unphosphorylated CTD. However, S2 phosphorylation strongly enhances the affinity of the CTD for the CID domain (Licatalosi et al. 2002Go), suggesting that the pS2 phosphate group is recognized indirectly (Meinhart and Cramer 2004Go). The pS2 phosphate forms an intramolecular hydrogen bond with the T4 side chain, which was proposed to stabilize the {beta}-turn conformation (Meinhart and Cramer 2004Go).

A recent NMR study of the CTD-Pcf11 interaction (Noble et al. 2005Go) generally confirmed the crystallographic results (Meinhart and Cramer 2004Go), and found additionally that the pS2-T4 hydrogen bond in the CTD-CID complex is not present in the free CTD, suggesting that the CTD turn conformation observed in the complex results from induced fit. Although the apparent indirect recognition of the CTD pS2 phosphate is not fully understood, an advantage of such indirect recognition was suggested (Meinhart and Cramer 2004Go). Exposure of a CTD phosphate group could be important for its accessibility to a phosphatase, which could remove the phosphate, lower the binding affinity, and trigger CTD dissociation from CTD-binding domains (Meinhart and Cramer 2004Go). Although sequence conservation suggests that all CIDs share the same fold, there are apparently differences in the details of CTD recognition by CID domains. The CID domain in the recently characterized factor Rtt103, which is involved in transcription termination (Kim et al. 2004aGo), contains a CTD-binding pocket that differs in several amino acid positions from that of Pcf11.

Taken together, three different CTD conformations are observed in the three known CTD peptide-protein complex structures, reflecting the structurally versatile nature of the CTD and indicating that an induced fit to the target surface plays an important role in CTD recognition. Comparison of the structures and CTD conformations suggests that there is no simple structural basis for a possible CTD code. However, a few underlying principles of CTD recognition can be extracted. CTD peptides adopt extended {beta}-strand-like, or {beta}-turn conformations (Fig. 2B). In all three structures, the P3 side chain docks into a hydrophobic pocket of the target protein. In two structures (CID, Cgt1), the Y1 side chain is bound through hydrophobic interactions and via a hydrogen bond to its phenylic hydroxyl, incompatible with Y1 phosphorylation. Phosphorylated serines can be recognized directly, by interactions with the phosphate group, or indirectly, by a mechanism that is not clear yet. In all complex structures, prolines P3 and P6 are in trans conformation, and their isomerization to cis would likely impair binding. Since a CTD peptide exists as a mixture of cis and trans populations in solution (Noble et al. 2005Go), the target domains apparently select the trans isomer.


    CTD-Mediator interaction
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
Transcription initiation involves the interaction of the unphosphorylated CTD with the Mediator complex, the central multiprotein coactivator that transmits signals from activators and repressors to Pol II (Malik and Roeder 2000Go; Myers and Kornberg 2000Go; Bjorklund and Gustafsson 2004Go). In yeast, the CTD is required for the formation of a stable yeast Pol II-Mediator complex (Myers et al. 1998Go; Asturias et al. 1999Go). Antibodies against the unphosphorylated CTD displace Mediator from Pol II (Svejstrup et al. 1997Go). The CTD-Mediator interaction is required for Mediator function, since yeast Mediator cannot stimulate transcription by a CTD-less Pol II (Myers et al. 1998Go). The phosphorylated CTD does not associate with Mediator (Svejstrup et al. 1997Go), and CTD phosphorylation during transcription initiation apparently breaks the Pol II-Mediator interaction, resulting in an elongating polymerase and a scaffold complex that remains at the promoter (Liu et al. 2004Go). Consistent with the conservation of Mediator, mouse and Drosophila Mediator also bind the CTD (Jiang et al. 1998Go; Park et al. 2001Go). The human Mediator-like complex CRSP also interacts with the CTD and adopts a specific CTD-bound conformation (Naar et al. 2002Go). Addition of recombinant CTD inhibits CRSP-dependent transcription activation.

The yeast Mediator comprises up to 25 subunits; 11 are essential and 22 are at least partially conserved in sequence among eukaryotes (Boube et al. 2002Go; Bourbon et al. 2004Go). Nine of the Mediator subunits were identified in a genetic screen for suppressors of CTD truncation mutants (Thompson et al. 1993Go; Hengartner et al. 1995Go). According to biochemical and genetic studies, Mediator consists of three distinct submodules (Kang et al. 2001Go), which may correspond to density lobes observed by electron microscopy, termed head, middle, and tail, respectively (Dotson et al. 2000Go). The CTD may bind between the head and middle modules, since recombinant head and middle modules independently bind to the CTD (Kang et al. 2001Go). The middle module is the most conserved module and includes the MED7/MED21 heterodimer. The recent structure of the MED7/MED21 heterodimer revealed a novel, very extended helical fold, and a flexible hinge (Baumli et al. 2005Go) that may partially account for changes in the overall Mediator structure upon binding to Pol II (Asturias et al. 1999Go; Davis et al. 2002Go; Naar et al. 2002Go) or to activators (Taatjes et al. 2002Go).

Larger isoforms of Mediator include a module that contains a CTD kinase, and can act independently of the CTD. For example, the Mediator-like complex SMCC from human cells does not require the CTD for activation (Gu et al. 1999Go). The repressive function of the Mediator-like human complex NAT is also independent of the CTD (Sun et al. 1998Go). The large inactive human Mediator ARC-L does not interact with the CTD (Naar et al. 2002Go). Consistent with these observations, electron microscopic images of the yeast Pol II-Mediator complex suggest that the CTD is not the sole point of contact between Mediator and the polymerase, but that there are multiple interaction sites (Asturias et al. 1999Go). In addition, Mediator binds to several general transcription factors involved in initiation (Kang et al. 2001Go; Park et al. 2001Go).


    CTD kinases
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
The CTD is phosphorylated by the regulated action of cyclin-dependent kinases (CDKs). CDKs associate with specific cyclins and play central roles in transcription and cell cycle regulation (Dynlacht 1997Go; Bregman et al. 2000Go; Murray 2004Go). The CTD is targeted by CDK7, CDK8, and CDK9, which generally associate with cyclins H, C, and T, respectively. The abundance of these cyclins does not fluctuate during the cell cycle (Tassan et al. 1994Go; Rickert et al. 1996Go). In addition to CDK7, CDK8, and CDK9, CDK11 was recently implicated in the coordination of transcription with RNA processing (Hu et al. 2003Go). Whereas all CDKs show a high degree of conservation (Russo et al. 1996Go; Tarricone et al. 2001Go; Lolli et al. 2004Go), the cyclins are less conserved, and cyclins involved in transcription are only weakly related to classical cyclins involved in cell cycle regulation (Leopold and O'Farrell 1991Go; Lew et al. 1991Go).

The CDK7/cyclin H pair associates with the RING finger protein MAT1 to form the CDK-activating kinase (CAK), which phosphorylates and activates other CDKs involved in cell cycle regulation (Harper and Elledge 1998Go; Kaldis 1999Go). CAK also forms a subcomplex of the 10-subunit general transcription factor TFIIH, which phosphorylates the CTD at S5 during transcription initiation (Coin and Egly 1998Go). Electron microscopy showed that the CAK complex protrudes from the ring-like structure of TFIIH (Schultz et al. 2000Go). The yeast CDK7 homolog Kin28 is essential for viability, required for normal transcript levels in vivo, and is the primary kinase responsible for CTD phosphorylation during transcription initiation (Valay et al. 1995Go; Holstege et al. 1998Go; Komarnitsky et al. 2000Go; Schroeder et al. 2000Go; Liu et al. 2004Go). TFIIH kinase activity is enhanced by Mediator during initiation, driving the transition to elongation (Guidi et al. 2004Go), and facilitating recruitment of RNA processing factors (Rodriguez et al. 2000Go).

The CDK8/cyclin C pair (Srb10/Srb11 in yeast) associates with MED12 (Srb8) and MED13 (Srb9), to form a fourth module of the Mediator that is present in a sub-population of Mediator complexes. This Mediator module phosphorylates the CTD, is conserved among eukaryotes, and is a target of signal transduction pathways (Liu et al. 2001Go; Borggrefe et al. 2002Go; Boube et al. 2002Go; Samuelsen et al. 2003Go). The CDK8/cyclin C pair is thought to be mainly implicated in transcriptional repression (Hengartner et al. 1998Go). One model for repression is that CDK8 phosphorylates the CTD prematurely, thereby preventing formation of a transcription initiation complex (Hengartner et al. 1998Go). Human CDK8/cyclin C can also repress CDK7 activity by phosphorylating cyclin H (Akoulitchev et al. 2000Go). CDK8 further phosphorylates some gene-specific transcription factors, thereby decreasing their stability (Chi et al. 2001Go; Nelson et al. 2003Go). On the other hand, CDK8 can also have a positive effect on transcription. CDK8-dependent phosphorylation of the transcription factor Sip4 can stimulate transcription (Vincent et al. 2001Go). A positive effect also results from ATP-dependent dissociation of preinitiation complexes, triggered by CDK8 (Liu et al. 2004Go). CDK8-dependent phosphorylation of the Mediator subunit MED2 also has a positive effect on transcription (Hallberg et al. 2004Go).

The CDK9/cyclin T pair forms the core of the positive transcription elongation factor P-TEFb (Price 2000Go). The originally identified P-TEFb consists of CDK9 and one of the cyclin T isoforms T1, T2, or K (Peng et al. 1998Go). A larger P-TEFb complex with reduced activity contains additionally the small nuclear RNA 7SK and the HEXIM protein (Nguyen et al. 2001Go; Yang et al. 2001Go; Michels et al. 2003Go; Yik et al. 2003Go). P-TEFb was isolated by its ability to overcome arrest of Pol II complexes during early elongation, a function that requires the CTD (Marshall and Price 1995Go; Marshall et al. 1996Go). P-TEFb phosphorylates the elongation factor DSIF on its Spt5 subunit and counteracts the negative effect of DSIF and its cofactor NELF during early elongation (Wada et al. 1998Go; Yamaguchi et al. 1998Go). There are two putative homologs of CDK9 in Saccharomyces cerevisiae, Ctk1 and Bur1 (Prelich and Winston 1993Go; Murray et al. 2001Go; Prelich 2002Go; Guo and Stiller 2004Go). Ctk1 associates with its cyclin partner Ctk2 and a third subunit, Ctk3, to form the CTDK1 complex. Bur1 associates with the cyclin Bur2. Chromatin immunoprecipitation and genetic experiments suggest that Ctk1 and Bur1 play nonoverlapping roles in transcription elongation (Yao et al. 2000Go; E.J. Cho et al. 2001Go; Yao and Prelich 2002Go; Keogh et al. 2003Go). It is possible that in yeast Ctk1 and Bur1 phosphorylate the CTD and Spt5, respectively, which are both substrates of P-TEFb (Keogh et al. 2003Go). During stress response the CTD can also be phosphorylated at S5 by ERK kinases (Bonnet et al. 1999Go).


    Kinase structure and specificity
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
The CDKs involved in CTD phosphorylation differ in their specificity toward the target serine residue. CDK7 and CDK8 phosphorylate S5, whereas CDK9 phosphorylates S2, although some conflicting reports exist (Table 1). CDK7 activity predominates during initiation, whereas CDK9 activity is important during elongation (Komarnitsky et al. 2000Go; E.J. Cho et al. 2001Go; Kim et al. 2002Go). CDK7 and CDK8, but not Bur1 or Ctk1, are stably associated with transcription initiation complexes (Liu et al. 2004Go). CDKs also differ in their processivity. CDK7 generates hyperphosphorylated CTD peptides, whereas CDK8 and CDK9 generate CTD peptides with fewer phosphorylated residues (Pinhero et al. 2004Go).


View this table:
[in this window]
[in a new window]
 
Table 1. Specificity of CTD kinases and phosphatases

 
Crystal structures of free CDK7 (Lolli et al. 2004Go), cyclin H (Andersen et al. 1997Go), and cyclin C (S. Hoeppner, S. Baumli, and P. Cramer, unpubl.) are known (Fig. 3), but structures of CDK/cyclin complexes involved in transcription have not been determined. The structure of CDK7 is highly similar to that of other CDKs (Lolli et al. 2004Go). The active site is located between the two lobes of the CDK fold, and is partially covered by a loop, the so-called activation segment (Fig. 3). In contrast to the highly conserved CDK structure, cyclin structures show remarkable differences (Fig. 3). Two five-helix repeats form a conserved cyclin fold, but the length and exact orientation of the helices differ. In addition, cyclins differ strongly in the N- and C-terminal regions flanking the cyclin fold. Whereas an N-terminal helix in cell cycle cyclins is involved in CDK recognition, the corresponding helix in transcription cyclins may interact with other factors (Andersen et al. 1997Go; S. Hoeppner, S. Baumli, and P. Cramer, unpubl.). The N- and C-terminal helices of cyclin H adopt a different orientation compared to other known cyclin structures, and are required for structural integrity (Andersen et al. 1997Go). Cyclin C does not contain a C-terminal helix, and its N-terminal helix has a different length and orientation as in other cyclins (S. Hoeppner, S. Baumli, and P. Cramer, unpubl.).



View larger version (54K):
[in this window]
[in a new window]
 
Figure 3. Structural information on CTD kinase/cyclin pairs. (A) Structures of CDK7 (Lolli et al. 2004Go), the CDK2/cyclin A pair (Russo et al. 1996Go), cyclin H (Andersen et al. 1997Go), and cyclin C (S. Hoeppner, S. Baumli, and P. Cramer, unpubl.) (from left to right). Kinases are in blue, with the activation segment in green and the phosphorylated residue highlighted with a red sphere. The active site is indicated with a star. Cyclin repeats 1 and 2 are colored in olive and brown, respectively. The N- and C-terminal helices of the cyclins are in red and magenta, respectively. (B) Surface representations of the structures in A. Surface residues are colored from dark green to yellow according to decreasing degree of conservation.

 
Models of CDK-cyclin pairs may be obtained by homology modeling with known CDK/cyclin complex structures, such as the human CDK2-cyclin A structure (Jeffrey et al. 1995Go; Russo et al. 1996Go). Such modeling identified highly conserved residues that are generally involved in CDK-cyclin interactions (Jeffrey et al. 1995Go; Andersen et al. 1997Go), and residues in the CDK-cyclin interface that are conserved only within a certain CDK or cyclin family (S. Hoeppner, S. Baumli, and P. Cramer, unpubl.). Theses "specific" residues may govern preferential interaction of a certain cyclin with a certain CDK. However, it remains very difficult to rationalize the basis for specific CDK-cyclin complex formation based on the available structures, because the detailed structure of the interface regions may change upon complex formation, and since additional protein subunits can contribute to specific kinase-cyclin assembly.

Despite the recent structural studies, the basis for specificity of a kinase for the CTD and for recognition of a particular CTD residue remains enigmatic, because no structures of CDK/CTD complexes are known. Compared with other CDK structures, the activation segment of CDK7 is in a different conformation, which may help in determining substrate specificity (Russo et al. 1996Go; Lolli et al. 2004Go). Compared with CDK7, CDK8 has three additional residues in the activation segment, and a nine-residue insertion near the activation segment, which could play a role in defining substrate specificity (S. Hoeppner, S. Baumli, and P. Cramer, unpubl.).

Kinase specificity for the CTD may not only be achieved by CTD recognition at the kinase active site, but also by CTD binding to kinase-associated factors. CDK7 specificity for the CTD is influenced by its binding to MAT1 (Yankulov and Bentley 1997Go; Larochelle et al. 1998Go), but it is unclear how MAT1 accomplishes this function, although the NMR structure of its RING finger domain is known (Gervais et al. 2001Go). The CDK7-containing CAK complex targets other CDKs, but TFIIH, which includes the CAK complex, has a strong preference for the CTD as a substrate (Rossignol et al. 1997Go; Yankulov and Bentley 1997Go). CDK7 CTD specificity is highest in the context of a transcription initiation complex (Lu et al. 1992Go; Watanabe et al. 2000Go), and the preference of TFIIH for S5 phosphorylation is enforced by TFIIE (Yamamoto et al. 2001Go). Cyclin C has a highly conserved surface depression that may bind substrates near the active site of CDK8 (S. Hoeppner, S. Baumli, and P. Cramer, unpubl.). A similar mechanism is established for cyclin A, which has a conserved surface patch that binds kinase substrates (Schulman et al. 1998Go; Kontopidis et al. 2003Go). Cyclin T binds the CTD via a histidine-rich stretch in its C-terminal domain (Taube et al. 2002Go; Kurosu et al. 2004Go). A recent study suggests that the cyclins generally act as adaptors to render a CDK specific for a substrate (Loog and Morgan 2005Go). The HIV Tat protein shifts CDK9 phosphorylation preference from S2 to both S2 and S5 (Zhou et al. 2000Go). Noncanonical phosphorylation of the CTD at Y1 by the Abl kinase involves CTD binding to an Abl domain distinct from the kinase domain (Baskaran et al. 1997Go).

An open question is the activation mechanism of the CTD-targeting CDKs. CDKs involved in cell cycle regulation are generally activated in two steps, cyclin binding, and phosphorylation of a conserved threonine in the CDK activation segment (T160 in human CDK2) (Pavletich 1999Go). Interaction of the phosphothreonine side chain with three conserved arginines triggers a conformational change that results in full kinase activation (Russo et al. 1996Go). CDK7 and CDK9 carry a threonine or a serine at the phosphorylated position. In the free CDK7 structure, the phosphorylated threonine, however, is found at a different location than in CDK2 (Lolli et al. 2004Go), and does not contact the three conserved arginines, pointing to a different mechanism of CDK activation. Also, CDK8 does not have a threonine or serine residue at the position phosphorylated in other CDKs (Tassan et al. 1995Go). A conserved aspartate in CDK8 or a glutamate in cyclin C could, however, mimic a phosphothreonine (S. Hoeppner, S. Baumli, and P. Cramer, unpubl.).



View larger version (38K):
[in this window]
[in a new window]
 
Figure 4. Structure and mechanism of CTD phosphatases. (A) Scp1 structure and active site. The structure of Scp1 (Kamenski et al. 2004Go) is shown on the left as a ribbon model with two aspartate residues in the active site and a metal ion highlighted. The detailed view on the right depicts the active-site region (marked by a rectangle in the structure on the left). Depicted is a trifluoroberyllate ion, which binds the catalytic side chain of residue D96, resulting in a mimicry of a phosphoaspartyl intermediate. (B) Fcp1 model. The FCPH domain (left) was modeled based on the Scp1 structure (Kamenski et al. 2004Go). The BRCT domain was modeled based on PDB-ID1cdz (X. Zhang et al. 1998Go). The C-terminal Fcp1 helix in complex with the TFIIF domain is taken from PDB-ID 1j2x [PDB] (Kamada et al. 2003Go). (C) Ssu72 model. Ssu72 has been modeled by homology based on the structure of the bovine low-molecular-weight phosphotyrosine phosphatase (PDB-ID 1phr [PDB] ) (Su et al. 1994Go). Loop regions that are uncertain are shown as dashed lines. The detailed view on the right depicts the active-site region (marked by a rectangle in the structure on the left). A sulfate ion trapped in the structure is depicted in orange/red. The phosphorylated side chain of cysteine residue C12 is shown. (D) Proposed mechanisms of CTD phosphatases Scp1/Fcp1 (top) and Ssu72 (bottom). The reaction mechanisms involve the formation of a phosphoaspartyl (Scp1, Fcp1) or a phosphocysteinyl intermediate (Ssu72), followed by recycling of the catalytic side chain with the use of a nucleophilic water molecule.

 

    CTD phosphatases of the Fcp1 family
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
Recycling of Pol II requires CTD dephosphorylation by Fcp1, a conserved and essential phosphatase that is globally required for gene expression (Chambers and Dahmus 1994Go; Chambers and Kane 1996Go; Archambault et al. 1997Go, 1998Go; Cho et al. 1999Go; Kobor et al. 1999Go; Lin et al. 2002bGo). Fcp1 dephosphorylates both the free CTD and transcription elongation complexes (Chambers et al. 1995Go; Cho et al. 1999Go; Lehman and Dahmus 2000Go; Kong et al. 2005Go). Fcp1 contains an N-terminal catalytic region (FCPH) that includes the DXDX(T/V) signature motif of a superfamily of phosphotransferases and -hydrolases (Collet et al. 1998Go), and a C-terminal breast cancer protein related C-terminal (BRCT) domain, which binds to the phosphorylated CTD (Yu et al. 2003Go). In higher eukaryotes, small CTD phosphatases (SCPs) were described, which contain a catalytic FCPH domain, but lack a BRCT domain (Yeo et al. 2003Go). SCPs are transcriptional regulators that silence neuronal genes in non-neuronal tissue (Yeo et al. 2005Go). Plants contain CTD phosphatase-like proteins (CPLs), which also comprise a catalytic domain with a DXDXT motif, and also lack a BRCT domain (Koiwa et al. 2004Go).

The high-resolution structure of the catalytic FCPH domain of Scp1 (Kamenski et al. 2004Go) revealed a core fold with a central parallel {beta}-sheet (Fig. 4; Kamenski et al. 2004Go). The fold is similar to that of other enzymes of the DXDX(T/V) superfamily (Wang et al. 2001Go; Lahiri et al. 2003Go), although they share no sequence similarity outside the signature motif. The signature motif is part of a central depression that forms the active site and binds a metal ion. Catalysis involves the metal-assisted phosphorylation of the first aspartate in the DXDX(T/V) motif. Magnesium ions are essential for Fcp1 and Scp1 activity, and the trifluoroberyllate anion inhibits activity by forming a stable tetrahedral adduct with the catalytic aspartate side chain, mimicking a labile phosphoaspartyl intermediate (Fig. 4; Kamenski et al. 2004Go). This mechanism is consistent with biochemical data (Hausmann and Shuman 2003Go), and corresponds to that of other DXDX(T/V) superfamily enzymes, which also use the N-terminal aspartate in the signature motif as a phosphoryl acceptor (H. Cho et al. 2001Go; Lahiri et al. 2003Go). Consistently, mutation of this aspartate in Scp1 or Fcp1 to alanine abolished activity (Kobor et al. 1999Go; Hausmann and Shuman 2002Go; Kamenski et al. 2004Go). The second asparate in the DXDX(T/V) motif contributes to metal ion binding, and may act as a general acid/base.

Whereas the catalytic mechanism of Fcp1/Scp1 phosphatases is well understood, the basis for their CTD specificity remains to be fully established. Specificity for the CTD may to some extent be explained by recruitment of the enzymes to the CTD. Fcp1 binds to a docking site on Pol II outside the CTD (Chambers et al. 1995Go) that includes the Pol II subcomplex Rpb4/7 (Kimura et al. 2002Go; Kamenski et al. 2004Go). Rpb4/7 is located directly adjacent to the polymerase linker to the CTD (Armache et al. 2003Go; Bushnell and Kornberg 2003Go; Armache et al. 2005Go). In addition, Fcp1 binds the phosphorylated CTD via its BRCT domain (Yu et al. 2003Go), and binds the polymerase-associated general transcription factor TFIIF, which stimulates Fcp1 activity (Chambers et al. 1995Go; Archambault et al. 1998Go; Kamada et al. 2003Go; Nguyen et al. 2003Go).

Specificity of the CTD phosphatases toward CTD phosphorylation sites requires recognition of CTD residues around the phosphorylated target side chain. Indeed, Fcp1 activity requires several CTD residues flanking the phosphoserine, and single alanine mutations of the flanking Y1 and P3 decrease activity (Hausmann et al. 2004Go). Fcp1 and Scp1 were reported to dephosphorylate S5 and S2 (Hausmann and Shuman 2002Go; Lin et al. 2002aGo; Yeo et al. 2003Go). Highly purified Fcp1 was recently shown to dephosphorylate S5, but not S2 (Kong et al. 2005Go). Since both serines are flanked on the C-terminal side by a proline residue, CTD phosphatases may bind the adjacent prolines P3 or P6, and preferential phosphoserine dephosphorylation may be achieved by binding to other nearby residues. Indeed, the P3 side chain binds to a hydrophobic pocket in the known CTD peptide complex structures (Verdecia et al. 2000Go; Fabrega et al. 2003Go; Meinhart and Cramer 2004Go), and specific recognition of a flanking proline is consistent with Fcp1 inhibition by the prolyl isomerase Pin1 (Xu et al. 2003Go). Plant CPLs were also shown to specifically dephosphorylate S5 (Koiwa et al. 2004Go), and therefore most likely also recognize flanking residues.


    CTD phosphatase Ssu72
 Top
 Abstract
 Free CTD structure
 CTD modification
 Architecture of CTD-binding...
 CTD recognition
 CTD-Mediator interaction
 CTD kinases
 Kinase structure and specificity
 CTD phosphatases of the...
 CTD phosphatase Ssu72
 Outlook
 Acknowledgments
 References
 
The highly conserved and essential protein Ssu72 plays roles in all three phases of the transcription cycle. Ssu72 was originally identified in a yeast genetic screen as enhancer of a defect caused by a mutation in the initiation factor TFIIB (Sun and Hampsey 1996Go), and binds to TFIIB (Wu and Hampsey 1999Go; Pappas and Hampsey 2000Go; Dichtl et al. 2002aGo). Ssu72 is also involved in mRNA elongation, since its mutation can increase Pol II pausing and can counteract toxicity of the elongation inhibitor 6-azauracil (Dichtl et al. 2002aGo). Ssu72 is further involved in both poly(A)-dependent and -independent termination (Steinmetz and Brow 2003Go). It is a subunit of the yeast cleavage and polyadenylation factor (CPF), which is part of the machinery for mRNA 3'-end formation. Finally, Ssu72 is also essential for transcription termination of snRNAs (Ganem et al. 2003Go).

Two groups independently reported that Ssu72 has phosphatase activity, and speculated that it may target the CTD (Ganem et al. 2003Go; Meinhart et al. 2003Go). Ssu72 cleaves a nonspecific phosphatase substrate, and its sequence contains the CX5R signature motif of protein tyrosine phosphatases (PTPases) (Ganem et al. 2003Go; Meinhart et al. 2003Go). Mutation of the cysteine in this signature motif abolishes Ssu72 activity in vitro (Meinhart et al. 2003Go), and confers lethality in vivo (Sun and Hampsey 1996Go). In PTPases, the conserved cysteine and arginine residues of the signature motif form part of the active site (Ramponi and Stefani 1997Go). Whereas the cysteine attacks the substrate phosphorus atom, leading to formation of a phosphocysteinyl intermediate, the arginine stabilizes the transition state (Burke and Zhang 1998Go). Although there is no apparent sequence homology between Ssu72 and PTPases outside the signature motif, secondary structure prediction suggested that Ssu72 adopts the fold of