|
|
|
REVIEW
1 Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA; 2 UCT/National Institutes of Health, Bethesda, Maryland 20892, USA
| Abstract |
|---|
|
|
|---|
[Keywords: Chromatin; histones; variants; structure; transcription; segregation]
1000 µm3. This compaction is achieved by protein-mediated folding of DNA. Chromatin, the nucleoprotein complex found in the nucleus, has approximately twice the protein mass as DNA (Butler 1983
At the first level of packaging, the DNA is wrapped around histones to form a beaded chain. Each bead is referred to as a core nucleosome and contains an octamer of two molecules of each of the core histones H2A, H2B, H3, and H4 with two turns of DNA wrapped around the proteins (for review, see Luger 2003
; Khorasanizadeh 2004
). These core histones all contain a conserved C-terminal histone fold domain and unique N-terminal tails. The four core histones interact in pairs via a "hand-shake motif" with two H3/H4 dimers interacting together to form a tetramer, while the two H2A/H2B dimers associate with the H3/H4 tetramer in the presence of DNA. Multiple electrostatic, hydrophobic, and hydrogen bonds at the interface of these subcomplexes are required for nucleosome formation. The N-terminal tails of the histones do not significantly participate in the nucleosome structure and instead are involved in interactions with other proteins and nucleosomes. One molecule of histone H1 associates at the position where the DNA enters and exits the nucleosome core, thus sealing the two turns of DNA. The nucleosome filament is then folded into a 30-nm fiber mediated in part by nucleosome-nucleosome interactions, and this fiber is probably the template for most nuclear processes. Additional levels of compaction enable these fibers to be packaged into the small volume of the nucleus.
The packaging of DNA into nucleosomes and chromatin positively or negatively affects all nuclear processes in the cell. While nucleosomes have long been viewed as stable entities, there is a large body of evidence indicating that they are highly dynamic (for review, see Kamakaka 2003
), capable of being altered in their composition, structure, and location along the DNA. Enzyme complexes that either post-translationally modify the histones or alter the position and structure of the nucleosomes carry out these functions. There are a wide variety of post-translational modifications that occur on histones, such as phosphorylation, methylation, acetylation, and ubiquitylation (Iizuka and Smith 2003
), and these modifications affect the properties of the histones. Moreover, chromatin-remodeling complexes contain ATPase subunits and are known to slide nucleosomes, replace histones, or alter the histone-DNA interactions (Tsukiyama 2002
; Langst and Becker 2004
). A third way to modulate chromatin is via incorporation of histone variants. Here we provide an overview of the best-characterized histone variant functions and the ways that they can alter chromatin to facilitate various cellular processes.
| An introduction to the variants |
|---|
|
|
|---|
| Who are the variants? |
|---|
|
|
|---|
Histone H1
Histone H1 has numerous sequence variants such as H10, H5, and the spermand testis-specific variants. Most of the sequence differences between the major histone subtypes and the variants occur in the nonglobular N- and C-terminal tail domains of these proteins. The abundance of these variants fluctuates in different cell types as well as during the cell cycle, differentiation, and development (for review, see Cole 1987
; Brown 2001
; Parseghian and Hamkalo 2001
; Brown 2003
). Furthermore, the major histones and variants have distinct biophysical properties (Cole 1987
; Ramakrishnan 1997
) and different distribution patterns in the genome (Roche et al. 1985
; Parseghian and Hamkalo 2001
). Based on these observations, it has been suggested that the H1 variants have specific functions, although tests of this prediction have uncovered only subtle functional differences (Brown et al. 1996
; Shen and Gorovsky 1996
; Steinbach et al. 1997
; Lin et al. 2000
; Alami et al. 2003
; Folco et al. 2003
).
Histone H2A
Among the core histones, H2A has the largest number of variants, including H2A.Z, MacroH2A, H2A-Bbd, H2AvD, and H2A.X (Table 1; Fig. 1; (Ausio and Abbott 2002
; Redon et al. 2002
; Fernandez-Capetillo et al. 2004
). Some H2A variants, like H2A.Z, are conserved through evolution (Jackson et al. 1996
), while others such as MacroH2A (Pehrson and Fuji 1998
) and H2A-Bbd (Chadwick and Willard 2001
) are restricted to vertebrates or mammals. The H2A variants are distinguished from the major H2A histones by their C-terminal tails that diverge in both length and sequence, as well as in their genome distribution (Table 2). MacroH2A localizes predominantly to the inactive X-chromosome (Costanzi and Pehrson 1998
), while H2A-Bbd localizes to the active X-chromosome and autosomes (Chadwick and Willard 2001
). H2A.X and H2A.Z are constitutively expressed and localize throughout the genome, although H2A.Z shows some enrichment in intergenic regions. Interestingly, the major H2A proteins in Saccharomyces cerevisiae and Schizosaccharomyces pombe are more similar to the mammalian H2A.X variant than the mammalian major H2A subtypes (Supplementary Fig. 1; Malik and Henikoff 2003
). In Drosophila, a single variant called H2AvD has sequence characteristics of both H2A.X and H2A.Z (Redon et al. 2002
). Because the Drosophila protein likely encompasses the separate functions ascribed to both H2A.Z and H2A.X in mammals, care needs to be taken in comparing the functions of variants between species.
|
|
|
Histone H2B is markedly deficient in variants. The few that have been documented completely replace the major H2B subtypes and appear to have very specialized functions in chromatin compaction and transcription repression, particularly during gametogenesis (for review, see Poccia and Green 1992
; Green et al. 1995
). Unlike the major H2B subtypes, the sperm-specific H2B in sea urchins has a long N-terminal tail that is highly charged. This tail assists in the condensation of chromatin fibers, suggesting that this variant may play a role in packaging the chromatin in the sperm. There are additional H2B variants that are developmental stage-specific, but their specific role is unclear.
Histone H3
There are two major histone H3 variants called H3.3 and centromeric H3 (CenH3) (Ahmad and Henikoff 2002a
; Malik and Henikoff 2003
), as well as a mammalian testis tissue-specific histone H3 variant called H3.4 (Table 1; Fig. 1; Albig et al. 1996
; Witt et al. 1996
). Because the centromeric H3 variant has many names (see Table 1) such as CENP-A in mammalian cells, we will use the standardized name CenH3 throughout this review. CenH3 is a conserved essential protein that binds to centromeres, the DNA locus that directs formation of the kinetochore protein structure that mediates chromosome segregation in eukaryotes. Despite similarity in the histone fold domain, all CenH3 proteins have highly divergent N-terminal tails. H3.3 and H3.4 are the least divergent variants, containing only four amino acid differences compared to H3 in Drosophila (Supplementary Fig. 2). However, unlike the major H3 histones, H3.3 is expressed throughout the cell cycle and often localizes to transcriptionally active regions of the chromosome (Ahmad and Henikoff 2002b
). Similar to H2A.X, the major H3 proteins in S. cerevisiae (Ahmad and Henikoff 2002b
) are more similar to mammalian H3.3 than H3.
Histone H4
Histone H4 is one of the slowest evolving proteins, and there appear to be no known sequence variants of histone H4. However, there are H4 genes that are constitutively expressed throughout the cell cycle that encode for proteins that are identical in sequence to the major H4 (Akhmanova et al. 1996
). The reason for a lack of sequence variants is not clear.
| Are variants deviant in behavior? Functions of the variants |
|---|
|
|
|---|
Transcriptional activation and repression
Several histone H1 variants appear to have roles in transcription, particularly in repression (Table 2) during differentiation (Poccia and Green 1992
; Doenecke et al. 1994
; Buttinelli et al. 1999
). One example is histone H5 in chicken erythrocytes. This variant is deposited into chromatin during the terminal stages of erythrocyte differentiation, and its deposition coincides with global transcriptional repression (Wagner et al. 1977
). The H5 variant is depleted from active genes in vivo, and the presence of this variant represses transcription initiation in vitro (for review, see Paranjape et al. 1994
). While some of the H1 variants may be general repressors, others may be more selective in their regulation of genes (Roche et al. 1985
; Shen and Gorovsky 1996
; Steinbach et al. 1997
; Folco et al. 2003
).
The MacroH2A variant is also thought to be involved in transcriptional repression. This variant localizes to the inactive X-chromosome (Costanzi and Pehrson 1998
), and while binding does not initiate X inactivation (Mermoud et al. 1999
), some models suggest that the C-terminal tail of MacroH2A represses transcription enzymatically (Ladurner 2003
), while others suggest that it sterically blocks access to transcription factors and coactivators (Perche et al. 2000
; Angelov et al. 2003
; Abbott et al. 2004
).
In contrast, H2A-Bbd lacks a significant C-terminal tail, and it has been postulated that the lack of such a tail may destabilize the nucleosome, thus aiding in ease of nucleosome displacement during transcription (Angelov et al. 2004
; Bao et al. 2004
; Gautier et al. 2004
). This role is consistent with the localization of H2A-Bbd to the active X-chromosome and autosomes (Chadwick and Willard 2001
).
H2A.Z has been linked to both transcriptional repression and activation. Recent results indicate that H2A.Z may be involved in heterochromatin organization. In Drosophila, H2A.Z is present at heterochromatic loci in addition to euchromatin (van Daal et al. 1988
; Leach et al. 2000
). Similarly, immunofluorescence analyses in mammalian cells indicate that H2A.Z localizes to foci containing the heterochromatic protein HP1
and un-acetylated histones H3 and H4 present on chromosomal arms, though not the centromeric heterochromatin or facultative heterochromatin (Rangasamy et al. 2003
, 2004
). The heterochromatic protein HP1 has a modest preference for binding to H2A.Z-containing nucleosomes in vitro (Fan et al. 2004
), and depletion of H2A.Z is accompanied by the loss of HP1
from the arms. In addition, mutations in H2A.Z affect repression mediated by HP1 and the homeotic repressor protein, polycomb, leading to the mislocalization of both proteins in the nucleus (Swaminathan et al. 2004). Taken together, these data suggest that H2A.Z plays a role in transcription repression (Fig. 2).
|
Given that the major histone subtypes affect transcription activation and silencing, the observed differences in the function of H2A.Z could simply reflect the different assays used and loci analyzed or could be due to fundamental differences between organisms. The vast number of phenotypes associated with H2A.Z also raises the possibility that some of its effects may be mediated through global changes in chromatin architecture, rather than direct effects at specific loci, given that many chromatin factors act globally as well as locally.
The H3.3 histone variant also plays a role in transcription. One of the distinguishing features of this variant is that it is constitutively expressed during the cell cycle (though there is increased expression during S phase) and can be deposited into chromatin outside of S phase. In dividing cells, H3.3 is present at genes that are either poised for transcription, or are actively transcribed. It is widely accepted that nucleosome disruption occurs during nuclear processes such as transcription and DNA repair, and the consequent loss of histones need to be replaced. Because the Drosophila H3.3 variant is deposited at transcriptionally active loci like the rDNA, outside of S phase (Ahmad and Henikoff 2002b
), H3.3 may serve to replace H3 at active genes as nucleosomes reform behind the transcribing polymerase (Fig. 3).
|
Heterochromatic barriers
Some regions of the chromatin are transcriptionally inactive, or "silenced." In yeast, silencing is achieved by the binding of a complex of repressor proteins that spreads along the chromatin that is silenced. The silenced chromatin domains are restricted from spreading along the DNA by the presence of barrier elements (Donze and Kamakaka 2002
). H2A.Z, which was initially isolated as a weak suppressor of a silencing defect in budding yeast (Dhillon and Kamakaka 2000
), was subsequently shown to be enriched in regions adjacent to the silenced domains and function in parallel with barrier elements to block the spread of silencing (Meneghini et al. 2003
). Consistent with its role in transcription activation, in S. cerevisiae, current models suggest that H2A.Z-containing chromatin is an unfavorable substrate for the binding of silencing proteins (Kimura et al. 2002
; Suka et al. 2002
; Krogan et al. 2003a
; Meneghini et al. 2003
; Kobor et al. 2004
; Zhang et al. 2004
).
Genome stability
Some histone variants contribute to genome stability by regulating the fidelity of chromosome segregation or the efficiency of DNA replication and repair. The CenH3 variant is required for accurate chromosome segregation in every organism examined (Stoler et al. 1995
; Figueroa et al. 1998
; Howman et al. 2000
; Takahashi et al. 2000
; Blower and Karpen 2001
; Oegema et al. 2001
; Goshima et al. 2003
). There are two major functions that CenH3 is likely to fulfill at centromeres: First, it has been proposed that CenH3 is the epigenetic mark that specifies the site of kinetochore formation. This is supported by the observation that all active centromeres contain CenH3, whereas inactive centromeres do not (Warburton et al. 1997
; Ouspenski et al. 2003
). However, CenH3 does not appear to be sufficient for centromere identity, because mistargeting of CenH3 to euchromatin causes some, but not all kinetochore proteins to mislocalize with it (Van Hooser et al. 2001
). Therefore, additional mechanisms must assist CenH3 in specifying the site of kinetochore formation. An idea that was recently proposed is that histone modifications specific for centromeric chromatin could also aid in propagating centromere identity (Sullivan and Karpen 2004
). Alternatively, the mistargeting of CenH3 to euchromatin may not result in a chromatin structure that is permissive for kinetochore assembly.
The other major function for CenH3 is in directing assembly of the proteinaceous kinetochore structure. In worms, CenH3 depletion leads to a kinetochore null phenotype where most kinetochore proteins examined were mislocalized (Oegema et al. 2001
). Consistent with this, CenH3 depletion experiments and CenH3 null mice exhibited altered localization of many kinetochore proteins (Howman et al. 2000
; Blower and Karpen 2001
). CenH3 is essential for the specialized centromeric chromatin structure in budding and fission yeast (Meluh et al. 1998
; Takahashi et al. 2000
), and directly or indirectly interacts with many kinetochore proteins (Fig. 4; Van Hooser et al. 2001
). Taken together, these data suggest that the kinetochore protein-binding sites in CenH3 combined with the underlying chromatin structure created by CenH3 nucleosomes create an environment favorable for kinetochore assembly. In addition, a centromeric nucleosome may also assist in specifying the geometry of kinetochores to aid in microtubule binding, or in resisting the strong pulling forces that microtubules exert during mitosis. CenH3 may also have additional functions that have not yet been fully explored, such as recruitment of the cohesion complex that holds sister chromatids together, positioning of the mitotic spindle, and a role in cytokinesis (Tanaka et al. 1999
; Glowczewski et al. 2000
; Zeitlin et al. 2001b
).
|
Genome stability also requires the H2A.X variant. Double-strand breaks that occur during replication, recombination, or DNA rearrangement must be repaired. While H2A.X is expressed throughout the cell cycle and deposited all over the chromosomes, it is preferentially phosphorylated by the ATM and ATR kinases at sites that flank double-stranded breaks (Rogakou et al. 1999
), and this phosphorylation is essential to recruit many components of the DNA damage response to these sites (Paull et al. 2000
). Although H2A.X has not been shown to directly mediate DNA repair, it is important for suppressing oncogenic translocations and tumor formation (Celeste et al. 2003
). It is possible that H2A.X phosphorylation helps retain repair proteins at the site of damage, or facilitates interactions between chromosomes that are important for DNA repair.
| How do variants find their way home? Assembly of variant nucleosomes |
|---|
|
|
|---|
H2A.Z assembly
H2A.Z is deposited into chromatin during and outside of S phase and has been identified in two complexes: One contains the H2A/H2B histone chaperone/assembly protein Nap1, and the other contains a Swi/Snf-like ATPase called Swr1 (Krogan et al. 2003a
; Kobor et al. 2004
; Mizuguchi et al. 2004
). Whether these two complexes function together or in separate pathways needs to be determined, but the incorporation of H2A.Z is dramatically reduced in the absence of Swr1 in vivo (Krogan et al. 2003a
; Kobor et al. 2004
; Mizuguchi et al. 2004
). Swr1 can mediate the exchange of H2A with H2A.Z in nucleosomes in an ATP-dependent manner in vitro (Mizuguchi et al. 2004
), and the Swr1-containing complex might be involved in transcription-dependent deposition of this variant, since there is considerable overlap in the genes that are misregulated in cells lacking either H2A.Z or Swr1 (Kobor et al. 2004
; Mizuguchi et al. 2004
; Zhang et al. 2004
). H2A.Z is also deposited into regions of chromatin that are transcriptionally inactive (Leach et al. 2000
; Rangasamy et al. 2003
; Swaminathan et al. 2004), but it is not clear how this variant is deposited in these regions. It is also not clear whether the Swr1-mediated deposition of this variant is a cause or a consequence of transcription. It is interesting to note that H2A.Z deposition also depends on Yaf9, a component of both the Swr1 and the NuA4 histone acetyltransferase complexes (Zhang et al. 2004
). Since acetylation is correlated with active transcription, it raises the question of whether H2A.Z is targeted to chromatin that is acetylated or vice versa.
H3.3 assembly
H3.3 is constitutively expressed during the cell cycle (though there is increased expression during S phase) and can be deposited into chromatin outside of S phase. As mentioned previously, there are only minor differences between major H3 and H3.3. Three out of four changes reside in the histone fold domain, and these residues are important for deposition of the variant outside of S phase. While converting any one of these three residues in Drosophila H3.3 to that in major H3 does not affect the replication-independent deposition of H3.3, changing any one of the residues in major H3 to its counterpart in H3.3 allows major H3 to be incorporated into chromatin outside of S phase (Ahmad and Henikoff 2002b
).
This sequence specificity in deposition likely reflects interactions with different assembly factors, since CAF-1 is present in a complex with the major histone H3 subtype and HIRA is in a complex with H3.3. It is thought that the CAF-1 replication-coupled chromatin assembly complex deposits H3 histones during S phase, while H3.3 incorporation outside of S phase utilizes the HIRA complex (Tagami et al. 2004
), since this complex can mediate deposition in a replication-independent manner (Ray-Gallet et al. 2002
). While it is not known whether mammalian H3.3 is deposited into chromatin during S phase by HIRA or CAF-1, in S. cerevisiae both of these assembly complexes can deposit histone H3.3 in S phase. Furthermore, additional assembly complexes must exist that can deposit this variant, since yeast cells lacking CAF-1 and HIRA are viable (Kaufman et al. 1998
).
In dividing Drosophila cells, the sites of H3.3 deposition outside of S phase are transcriptionally active loci, suggesting that the HIRA complex may use a transcription-coupled deposition mechanism to replace major H3 with H3.3 (Ahmad and Henikoff 2002b
). Despite the identification of the H3.3 assembly complex, it is still unclear how HIRA targets H3.3 to transcriptionally active genes. Furthermore, transcription-coupled deposition may not be the only mechanism by which this variant is deposited into chromatin, since in mature cortical neurons, the levels of H3.3 increase to 90% of the total H3, and this protein is deposited in transcriptionally active euchromatin as well as inactive heterochromatin (Pina and Suau 1987
).
Histone H3.3 is bound with the assembly complexes as a heteromeric dimer, but whether and how this helps in the epigenetic inheritance of the chromatin state is not fully understood (see Tagami et al. [2004
] and Henikoff et al. [2004
] for a thorough discussion on this topic).
CenH3 assembly
Because other histones and variants are deposited by chaperones, it has been assumed that a specific loading complex will exist for CenH3. However, a CenH3 loading factor has not yet been identified, possibly because the low levels of soluble CenH3 make it difficult to identify interacting factors. The only chromatin-related proteins that are known to have a role in CenH3 centromere localization are the RbAp46/48 proteins that are components of a number of complexes including CAF-1 (for review, see Loyola and Almouzni 2004
). In fission yeast, the RpAp46/48 homolog Mis16 and the conserved Mis18 protein form a complex that regulates kinetochore function, and defects in RbAp46/48 in both S. pombe and mammalian cells result in CenH3 mislocalization from the centromere (Hayashi et al. 2004
). Because a direct interaction with CenH3 has not yet been reported, it is still unclear whether these proteins directly mediate CenH3 loading or instead alter centromeric chromatin structure to allow CenH3 deposition. Mutants in the fission yeast Mis16 and Mis18 proteins affect the acetylation state of the centromere, suggesting a potential link between CenH3 localization and acetylation.
Several other chromatin-assembly complexes have roles in S. cerevisiae kinetochore function and cause CenH3 to mislocalize to euchromatin. The CAF-1 and HIRA complexes have overlapping roles in kinetochore function (Sharp et al. 2002
), and mutants in the Spt4 transcription factor and the RSC chromatin-remodeling complex have kinetochore defects (Tsuchiya et al. 1998
; Hsu et al. 2003
; Crotti and Basrai 2004
). Proteins from each of these subcomplexes localize to kinetochores, and mutants in all complexes alter the centromeric chromatin structure and lead to chromosome missegregation phenotypes (Sharp et al. 2002
; Hsu et al. 2003
; Crotti and Basrai 2004
). However, it is not yet clear how defects in centromeric chromatin structure relate to kinetochore function. In addition, although CenH3 mislocalizes to euchromatin in some of these mutants, the centromeric localization of CenH3 is unaffected. One possibility is that these proteins are involved in setting up CenH3 boundaries to prevent it from spreading into euchromatin. Alternatively, these proteins may have subtle effects on CenH3, such as a shift in CenH3 nucleosomal positioning, that have not yet been assayed in the mutant strains.
Recent data also suggest a potential link between CenH3 localization, kinetochore function, and transcription. In fission yeast, a putative GATA transcriptional factor, ams2, binds to the central centromere region where CenH3 localizes (Chen et al. 2003
). Cells defective in ams2 have an altered centromeric chromatin structure with reduced levels of CenH3 at centromeres and defects in kinetochore function. Whether this protein is modulating these effects through recruitment of kinetochore components or via transcription is not clear. In maize, transcription has been detected at the core centromeres (Topp et al. 2004
). In addition, some of the genes in rice centromeres and an active human neocentromere are expressed, suggesting that transcription may facilitate kinetochore function (Saffery et al. 2003
; Nagaki et al. 2004
). While it is not clear whether centromeric transcription is conserved among eukaryotes, one could imagine that transcription creates an open chromatin environment that is more permissive for CenH3 assembly, or facilitates the removal of H3 nucleosomes to allow replacement by CenH3 nucleosomes. The presence of Spt4 at centromeres could help in this process (Crotti and Basrai 2004
), since this complex is involved in chromatin assembly and transcription elongation (Winston 2001
). Although intriguing, the isolation of transcription factors that affect kinetochore function may also be indirect due to transcription defects else-where in the genome, or due to additional functions for the proteins that are not related to transcription.
CenH3 is normally deposited only during S phase, though it is not known whether its deposition is replication-coupled or not (Shelby et al. 2000
; Pearson et al. 2004
). Because ectopically expressed CenH3 fusion proteins can localize to the centromere at all cell-cycle stages, CenH3 localization does not strictly depend on DNA replication (Sullivan et al. 1994
; Shelby et al. 2000
; Ahmad and Henikoff 2002a
). However, it is not known whether the exogenously expressed CenH3 uses the same mechanism of deposition as endogenous CenH3.
It has been suggested that the timing of centromere replication and spatial restriction within heterochromatin may aid in the localization of the variant (Ahmad and Henikoff 2001
). However, recent evidence showed that the centromere is replicated at the same time as euchromatic regions that contain the canonical H3 (Shelby et al. 2000
; Sullivan and Karpen 2001
; Blower et al. 2002
), so CenH3 deposition cannot be controlled solely by a restricted time of DNA replication.
The inheritance of CenH3 during centromere duplication was recently investigated, and it was found that budding yeast CenH3 is completely replaced during S phase (Pearson et al. 2004
). When synthesis of a fluorescently tagged CenH3 protein was repressed, the centromere-bound tagged protein was completely replaced by the endogenous protein in S phase, suggesting that yeast use a dispersive mode of CenH3 replication. Similar experiments need to be performed in multicellular eukaryotes to determine whether this is a conserved mode of CenH3 duplication.
The kinetochore structure is also important for CenH3 localization. Mutants in the budding yeast Ndc10 kinetochore protein completely abolish the localization of all kinetochore proteins including CenH3 (He et al. 2001
). In addition, the fission yeast proteins Mis6, Mis15, Mis16, Mis17, Mis18, and Sim4 that bind to the central centromere are all required for CenH3 localization (Takahashi et al. 2000
; Pidoux et al. 2003
; Hayashi et al. 2004
). While it is possible that these proteins play a direct role in CenH3 localization, it is just as likely that they help assemble a proper chromatin structure for CenH3 binding or help stabilize its binding following deposition.
| How deviant are variants? Structure of variant nucleosomes |
|---|
|
|
|---|
The canonical core histones bind tightly to the DNA via arginine side chains, and there are also numerous hydrogen bonds and water-mediated protein-DNA interactions between the canonical histones and DNA (for review, see Luger 2003
; Khorasanizadeh 2004
). Most of these residues and the basic histone-DNA contacts are conserved in the variants. While there are no sequence-specific interactions between the core histone side chains and the DNA bases for the major histone subtypes, it will be interesting to see whether a variant such as CenH3, which has some DNA targeting specificity, also lacks interactions between the histones and the bases. The two biggest changes due to the presence of variants appear to be in the stability of the nucleosome and the residues of the nucleosome that are exposed.
Variant nucleosome surface residues
One key finding of the structural studies is that variant nucleosomes have changes on the exposed surface. MacroH2A has an extensive C-terminal tail that likely extends away from the nucleosome and imparts an asymmetrical structure to the variant nucleosome that may be important for transcriptional repression (Allen et al. 2003
; Abbott et al. 2004
). The exposed macro domain may also be functioning enzymatically by affecting the modification status of chromatin proteins (Ladurner 2003
).
While the overall structure of H2A.Z nucleosomes is similar to the major H2A structure (Suto et al. 2000
), two of the most striking differences are the presence of an extended acidic patch on the nucleosome surface and a novel divalent cation-binding pocket. These changes on the surface of the nucleosome alter protein-nucleosome and nucleosome-nucleosome interactions, as well as the higher-order folding of the chromatin (Fan et al. 2004
) and are important for H2A.Z function during development (Ridgway et al. 2004
).
Variant nucleosome stability
Crystallography and various biophysical studies also indicate that there are changes in the stability of variant nucleosomes. FRET experiments with fluorescence donor and acceptor pairs attached at different locations in a nucleosome suggest that the overall binding of the H2A.Z/H2B dimer to the H3/H4 tetramer is slightly stabilized (Park et al. 2004
). Recent data suggest that the CenH3/H4 tetramer (Black et al. 2004
) is more compact and rigid than an H3/H4 tetramer and may also be more stable. It is possible that the additional rigidity helps to resist the microtubule pulling forces at the centromere during mitosis, or aids in the assembly of kinetochore proteins. Similarly, the MacroH2A nucleosomes may also be more stable (Changolkar and Pehrson 2002
; Abbott et al. 2004
), though additional biophysical studies will be required to fully understand the differences between the variant and major histone subtypes. Also, the in vivo consequences of a more stable variant nucleosome scattered among the canonical nucleosomes are hard to predict.
In contrast to the other variant nucleosomes, the H2A-Bbd nucleosome structure may be weaker. In the absence of DNA it is unable to form a stable histone octamer, and the H2A-Bbd nucleosome organizes only 118 bp of DNA rather than the 147 bp around the histone core (Bao et al. 2004
). While these nucleosomes are not very mobile, they are less stable and more accessible to transcription factors (Angelov et al. 2004
; Bao et al. 2004
; Gautier et al. 2004
). Therefore, it is likely that the structural alterations in the H2A-Bbd nucleosome lead to a weaker nucleosome structure that facilitates gene activation.
Variant nucleosome composition
A final observation is that certain variants are unable to coexist with the major histone subtypes. For example, the structure of the H2A.Z-containing nucleosome suggests that H2A and H2A.Z may not coexist in the same nucleosome (Suto et al. 2000
). Similarly, the CenH3, MacroH2A, and H2A-Bbd nucleosomes may also be homotypic, as CenH3 interacts with H2A, H2B, and H4, but not H3 in vivo (Shelby et al. 1997
; Blower and Karpen 2001
; Westermann et al. 2003
), and nucleosome reconstitution experiments with either MacroH2A or H2A-Bbd showed they completely replace H2A (Angelov et al. 2003
, 2004
; Gautier et al. 2004
). Although these variants likely exist only in homotypic nucleosomes, it is possible that certain variants (like H3.3) will exist in heterotypic nucleosomes along with the major histone subtypes. If heterotypic nucleosomes exist, then it will be interesting to determine how the different assembly complexes cooperate physically or temporally to form these nucleosomes.
An interesting observation that arises from the crystallographic analysis of the S. cerevisiae nucleosome is that different variant histones may coexist in the same nucleosome. The major H2A and H3 histones in S. cerevisiae are most similar to the mammalian histone variants H2A.X and H3.3, respectively. The yeast nucleosome structure has revealed that H3.3 coexists in the same nucleosome as H2A.X (White et al. 2001
). Therefore, nucleosome alterations could potentially come from combinations of variants in addition to specific changes associated with a single variant.
Higher-order structures
The details about variant nucleosome structures lead to the question of how the changes in structure affect the higher-order chromatin structure. This is especially important because it has long been believed that histone variants may exert their effects via changes in the higher-order packaging of chromatin. While earlier studies convincingly showed that histone H1 variants condensed chromatin to a greater extent compared to the major H1 subtypes (for review, see Thomas 1984
, 1999
), only recently have such analyses been extended to core histone variants.
H2A.Z chromatin: Two independent studies used positioned arrays of 12 nucleosomes and recombinant histones (H2A and H2A.Z) to analyze the folding of chromatin fibers. In one study, the presence of H2A.Z helped facilitate the folding of the nucleosomal filament into the 30-nm fiber as a function of divalent cations (Fan et al. 2002
) but reduced fiber-fiber interactions and aggregation, analogous to fibers containing acetylated histones. In a second independent study, H2A.Z-containing arrays were consistently less folded as a function of monovalent salt (Abbott et al. 2001
), but it is not known whether the reduced folding is due to the absence of divalent cations, since H2A.Z has a divalent cation-binding pocket that is important for H2A.Z function. Furthermore, the effects of having a few H2A.Z variant nucleosomes located among nucleosomes containing the major histones have not yet been explored.
CenH3 chromatin: While it was originally thought that CenH3 forms a linear array at centromeres, recent microscopy on extended chromatin fibers showed that the linear relationship between CenH3 and the major H3 is not exclusionary (Blower and Karpen 2001
). Instead, arrays of the major H3 nucleosomes are dispersed throughout the CenH3-rich chromatin in flies and humans (Fig. 4). This suggests that the higher-order structure assembles in a manner that causes the CenH3 nucleosomes to orient together to form the base of the kinetochore and exclude the H3 nucleosomes. This is consistent with the holocentric chromosomes of Caenorhabditis elegans that contain both CenH3 and H3 nucleosomes in a linear array and also orient CenH3 nucleosomes to form the kinetochore along the length of the chromosome (Buchwitz et al. 1999
). Recent data on H2A.Z suggest that this variant may also be present in linear arrays in the nucleus (Fan et al. 2004
), and it will be interesting to determine whether these arrays help orient H2A.Z chromatin in specific ways in the nucleus.
| What makes a variant a variant? Specificity within the variants |
|---|
|
|
|---|
Domains in H2A variants
For the histone H2A variants, the C-terminal tail appears to distinguish their specific functions. The invariant SQE motif in the tail of H2A.X is crucial for function, because it is the site of reversible phosphorylation that occurs in response to double-strand breaks (For review, see Redon et al. 2002
).
Like H2A.X, the C-terminal of H2A.Z is important for its function. Domain swaps in Drosophila between specific H2A.Z sequences and the major H2A identified the C-terminal docking domain that interacts with histone H4 as required for viability (Clarkson et al. 1999
). The C-terminal tail interacts with proteins, because in vitro binding studies indicate that this domain interacts with the general transcription machinery (Adam et al. 2001
; Larochelle and Gaudreau 2003
) and plays a role in recruiting various factors to the regulatory regions of genes, while in mammals the C-terminal domain is important for binding to HP1
as well as INCENP (Rangasamy et al. 2003
; Fan et al. 2004
).
In contrast to these studies, in Tetrahymena the lysines in the N terminus of H2A.Z are important for function (Ren and Gorovsky 2003
), since mutating these to arginine results in lethality (Ren and Gorovsky 2001
). While the differences between Tetrahymena (Ren and Gorovsky 2001
), Drosophila (Clarkson et al. 1999
), and S. cerevisiae (Adam et al. 2001
) appear striking at first glance, it should be noted that these studies are not directly comparable. It is not clear whether converting all the N-terminal lysine residues to arginine in Drosophila or S. cerevisiae would have any phenotype and conversely, the phenotype of deleting the C-terminal tail in Tetrahymena is also not known.
For the MacroH2A variant, both the histone fold and long C-terminal tail have roles in transcriptional repression. The histone fold domain prevents nucleosome sliding and is responsible for assembly into the nucleosome. The structure of the C-terminal macro domain indicates that it has similarity to the enzymatic domain of nucleotide triphosphate hydrolases as well as the DNA-binding domain of certain aminopeptidases (Allen et al. 2003
). The C-terminal tail interferes with transcription factor binding (Perche et al. 2000
; Angelov et al. 2003
) and might help in interactions with linker DNA or adjacent chromatin fibers. It has also been proposed that it may possess an enzymatic function, modulating the ADP-ribosylation of chromatin proteins (Ladurner 2003
). Therefore, both domains in MacroH2A appear to inhibit transcription using different mechanisms.
Domains in H3 variants
Numerous studies have dissected the CenH3 residues required for targeting to centromeres and for proper kinetochore function. While both the CenH3 N-terminal tail and histone fold domain are essential for function, only the histone fold domain contains the centromere targeting information (Sullivan et al. 1994
; Shelby et al. 1997
; Keith et al. 1999
). Although there are few distinguishing features among the various CenH3 histone fold domains, studies have led to the conclusion that loop I and helix II are critical for CenH3 centromere targeting in flies and mammals (Shelby et al. 1997
; Keith et al. 1999
; Vermaak et al. 2002
; Black et al. 2004
). These same regions also confer the more rigid structure to the CenH3/H4 tetramer (Black et al. 2004
), suggesting that this feature may be important for targeting CenH3. One possibility is that these regions of CenH3 are recognized by an assembly factor that ensures that it incorporates the right histone at the centromere.
CenH3 might have some DNA-binding specificity, because the residues necessary for targeting are predicted to map to H3/DNA contact sites. This may seem surprising since centromeres are one of the most rapidly evolving sequences in the genome (Schueler et al. 2001
). However, the observation that the Drosophila CenH3 loop I undergoes adaptive evolution is consistent with the idea that CenH3 has DNA-binding specificity that is evolving along with the centromeric sequence (Henikoff et al. 2001
). Although
-satellite DNA is a hallmark of mammalian centromeric DNA, CenH3 is also found at neocentromeres that completely lack
-satellite DNA (Lo et al. 2001a
,b
). Therefore, any targeting specificity must arise from a unique secondary structure instead of primary sequence requirements.
The essential N-terminal tails of all the CenH3 variants diverge in sequence and in length, ranging from 27 to >400 residues. The only organism where the essential residues have been mapped is in budding yeast, where a 33-amino acid N-terminal domain (END) is sufficient to provide the essential function (Keith et al. 1999
; Chen et al. 2000
). Although the N-terminal domains are not necessary for centromere targeting, they are required for the binding of other kinetochore proteins. In budding yeast, the END domain binds to the Ctf19 kinetochore protein, while in mammalian cells, this domain appears to recruit CENP-C (Chen et al. 2000
; Van Hooser et al. 2001
). The END domain does not have known homology to other proteins or other CenH3 N-terminal tails, and the divergence in CenH3 N-terminal sequence and length is likely due to the differences in the kinetochore proteins that are recruited by each CenH3 (see below).
Unlike the other variants, which possess domains that are distinct from the major histones, in Drosophila H3.3, only four amino acids distinguish this variant from the major histones, and three of these residues reside in the histone fold domain (Ahmad and Henikoff 2002b
). These residues likely specify the mode of deposition of this variant, as discussed above.
| Can you alter a variant? Modifications of the variants |
|---|
|
|
|---|