Genes and Development

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


GENES & DEVELOPMENT 19:2783-2810, 2005
©2005 by Cold Spring Harbor Laboratory Press; ISSN 0890-9369/ $5.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Massagué, J.
Right arrow Articles by Wotton, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Massagué, J.
Right arrow Articles by Wotton, D.
Related Content
Right arrow Chromatin and Gene Expression
Right arrow Signal Transduction
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

REVIEW

Smad transcription factors

Joan Massagué1,4, Joan Seoane2 and David Wotton3

1 Cancer Biology and Genetics Program, Howard Hughes Medical Institute, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA; 2 ICREA, Medical Oncology Research Program, Vall d'Hebron University Hospital Research Institute, 08035 Barcelona, Spain; 3 Department of Biochemistry and Molecular Genetics, and Center for Cell Signaling, University of Virginia, Charlottesville, Virginia 22908, USA


    Abstract
 Top
 Abstract
 Basic features of the...
 Smad activation and deactivation
 Smad transcriptional complexes
 Target gene selection by...
 Roles of Smads in...
 Transcriptional deactivation and...
 Self-enabled gene responses
 Smad synexpression groups
 Constraining Smad...
 Roles of the inhibitory...
 Regulation of Smads by...
 Regulation of Smads by...
 Regulation of Smads by...
 Smads as nodes for...
 Perspectives
 Acknowledgments
 References
 
Smad transcription factors lie at the core of one of the most versatile cytokine signaling pathways in metazoan biology—the transforming growth factor-{beta} (TGF{beta}) pathway. Recent progress has shed light into the processes of Smad activation and deactivation, nucleocytoplasmic dynamics, and assembly of transcriptional complexes. A rich repertoire of regulatory devices exerts control over each step of the Smad pathway. This knowledge is enabling work on more complex questions about the organization, integration, and modulation of Smad-dependent transcriptional programs. We are beginning to uncover self-enabled gene response cascades, graded Smad response mechanisms, and Smad-dependent synexpression groups. Our growing understanding of TGF{beta} signaling through the Smad pathway provides general principles for how animal cells translate complex inputs into concrete behavior.

[Keywords: Smad; TGF-{beta}; signaling; transcription]


As evolution unfolded and multicellular life forms emerged, so did the need for tight control over the ability of individual cells to move, divide, differentiate, and organize. Intricate intercellular communication systems evolved to ensure the proper behavior of individual cells in the context of the whole organism. Among these forms of communication, one of the most prevalent involves secretory polypeptides that are recognized by membrane receptors coupled to transcriptional regulatory factors. With its 42 members in the human genome, seven in Drosophila, and four in Caenorhabditis elegans, the transforming growth factor-{beta} (TGF{beta}) family is one of the most prominent representatives of this class of molecules. TGF{beta} and its family members—the nodals, activins, bone morphogenetic proteins (BMPs), myostatins, anti-Muellerian hormone (AMH), and others—exert profound effects on cell division, differentiation, migration, adhesion, organization, and death. These factors may be produced by many cell types, as in the case of TGF{beta}, or very few, as in the case of myostatin, and they may be active from the earliest stages of embryo development through adulthood, as in the case of the BMPs, or for very limited periods during development, as in the case of AMH. On the whole, the TGF{beta} family provides a paradigm of functional versatility among hormonally active polypeptides.

The quest to understand how cells read these signals started as soon as these factors were isolated in the early 1980s and has continued unabated ever since. By now, we have obtained a fairly robust understanding of the biochemical backbone of the TGF{beta} signaling pathway, and a growing sense for how cells translate these signals into responses. At the core of this pathway lie the Smad transcription factors. TGF{beta} induces its membrane receptors to directly activate Smad proteins that then form transcriptional complexes to control target genes (Fig. 1). What is not so simple is how these complexes activate or repress hundreds of target genes at the same time, in the same cell, and under tightly controlled conditions. This complexity has been the subject of much research in recent years and is the focus of the present review. For more information on the biology of the TGF{beta} family, we would recommend several classical reviews (Letterio and Roberts 1998Go; Whitman 1998Go; Massagué and Chen 2000Go; Massagué et al. 2000Go; Derynck et al. 2001Go; Attisano and Wrana 2002Go; Schier 2003Go). Other recent reviews (Derynck and Zhang 2003Go; Siegel and Massagué 2003Go) and research articles (Foletta et al. 2003Go; Ozdamar et al. 2005Go) cover the emerging topic of Smad-independent pathways in TGF{beta} signaling.


    Basic features of the Smad proteins
 Top
 Abstract
 Basic features of the...
 Smad activation and deactivation
 Smad transcriptional complexes
 Target gene selection by...
 Roles of Smads in...
 Transcriptional deactivation and...
 Self-enabled gene responses
 Smad synexpression groups
 Constraining Smad...
 Roles of the inhibitory...
 Regulation of Smads by...
 Regulation of Smads by...
 Regulation of Smads by...
 Smads as nodes for...
 Perspectives
 Acknowledgments
 References
 
Eight Smad proteins are encoded in the human and mouse genomes, four in Drosophila, and three in C. elegans (Massagué 1998Go). Only five of the mammalian Smads—Smad1, Smad2, Smad3, Smad5, and Smad8—act as substrates for the TGF{beta} family of receptors; these are commonly referred to as receptor-regulated Smads, or RSmads (Fig. 2A). Smads 1, 5, and 8 serve principally as substrates for the BMP and anti-Muellerian receptors, and Smads 2 and 3 for the TGF{beta}, activin, and Nodal receptors. Smad4, also referred to as Co-Smad, serves as a common partner for all RSmads (Fig. 2B). Smad6 and Smad7 are inhibitory Smads that serve as decoys interfering with Smad–receptor or Smad–Smad interactions. The Mad protein in Drosophila, which was the first identified member of this family (Raftery and Sutherland 1999Go), is the ortholog of mammalian Smad1/5, whereas dSmad2, Medea, and Dad are the orthologs of Smad2/3, Smad4, and Smad6/7, respectively. The Smad family in C. elegans includes Sma1, Sma2, and Sma3 (Das et al. 1999Go). The name "Smad" was coined with the identification of human Smad1 in reference to its sequence similarity to the Sma and Mad proteins (Liu et al. 1996Go).



View larger version (67K):
[in this window]
[in a new window]
 
Figure 1. Functional and structural features of the TGF{beta}–Smad pathway. TGF{beta} family members are dimeric ligands that remain in latent form by binding to their propeptide, or in trapped form by binding to occluding factors. On release from these inactive states, the ligand binds to pairs of membrane receptor serine/threonine kinases (receptor types I and II), promoting the formation of a hetero-tetrameric receptor complex. The cytoplasmic region of the type I receptor contains a canonical protein kinase domain (purple) preceded by a regulatory region or GS domain (green) to which the inhibitor FKBP12 (red) binds to enforce the inactive basal state. Phosphorylation of the GS domain by the type II receptor creates a repeated pS–x–pS motif that serves as a docking site for receptor-regulated Smad proteins (RSmad). In the basal state, RSmads and Smad4 undergo a constant nucleocytoplasmic shuttling. Anchor proteins capture RSmads for presentation to the activated type I receptor. Receptor-mediated phosphorylation at the C terminus of RSmads creates a pS–x–pS motif (red P), allowing the accumulation of RSmad in the nucleus and the recognition of this motif by a basic pocket in Smad4. The resulting Smad complex incorporates different DNA-binding cofactors that confer target gene selectivity and influence the recruitment of either transcriptional coactivators or corepressors. Several hundred genes are regulated by TGF{beta} in this fashion, of which two-thirds are activated by the signal and the rest are repressed. RSmad dephosphorylation terminates the signaling cycle. (Crystal structures adapted from Shi et al. 1997Go; Huse et al. 1999Go, 2001Go; Wu et al. 2000Go, 2001bGo).

 
Smad proteins are ~500 amino acids in length and consist of two globular domains coupled by a linker region (Fig. 2A; Shi and Massagué 2003Go). The N-terminal domain, or "Mad-homology 1" (MH1) domain, is highly conserved in all RSmads and Smad4 but not in Smads 6 and 7. The linker region is quite divergent between the various subgroups, whereas the C-terminal or MH2 domain is conserved in all Smad proteins. Functional studies, together with the X-ray crystal structure analysis of the MH1 and MH2 domains, have provided critical insights into the principal interactions of Smad proteins with other proteins and with DNA. The MH1 domain is a DNA-binding module stabilized by a tightly bound zinc atom. The contact with DNA is primarily established by a {beta}-hairpin structure, which is conserved in all the RSmads and Smad4. Interestingly, the most abundant splice form of Smad2 contains an insert (encoded by exon 3) that blocks DNA binding. The MH1 domain is followed by the linker region, a flexible segment with binding sites for Smurf (Smad ubiquitination-related factor) ubiquitin ligases, phosphorylation sites for several classes of protein kinases, and, in Smad4, a nuclear export signal (NES).

The Smad MH2 domain is highly conserved and is one of the most versatile protein-interacting modules in signal transduction. RSmads have a conserved C-terminal motif, Ser–X–Ser, that is phosphorylated by the activated receptor. A pocket lined with basic residues interacts with the phosphorylated region of the activated receptor in the case of RSmads and with the phosphorylated tail of RSmads in the case of Smad4. A set of contiguous hydrophobic patches, referred to as the "hydrophobic corridor", on the surface of the MH2 domain mediates interactions with cytoplasmic retention proteins, with components of the nuclear pore complex (nucleoporins), and with DNA-binding cofactors. A region overlapping the linker and MH2 regions ("Smad4 activation domain", SAD) mediates interactions with transcriptional activators and repressors (Fig. 2A).


    Smad activation and deactivation
 Top
 Abstract
 Basic features of the...
 Smad activation and deactivation
 Smad transcriptional complexes
 Target gene selection by...
 Roles of Smads in...
 Transcriptional deactivation and...
 Self-enabled gene responses
 Smad synexpression groups
 Constraining Smad...
 Roles of the inhibitory...
 Regulation of Smads by...
 Regulation of Smads by...
 Regulation of Smads by...
 Smads as nodes for...
 Perspectives
 Acknowledgments
 References
 
The striking ability of Smad proteins to accumulate in the nucleus in response to TGF{beta} or BMP was one of the early key observations placing Smads downstream of the TGF{beta} receptors (Hoodless et al. 1996Go; Liu et al. 1996Go). Smad proteins undergo a constant process of nucleocytoplasmic shuttling, and their nuclear accumulation results from receptor-mediated phosphorylation events that decrease the affinity of RSmads for cytoplasmic anchors and increase their affinity for nuclear factors (Shi and Massagué 2003Go; Xu and Massagué 2004Go). Subsequent dephosphorylation of RSmads causes their return to the cytoplasm for another round of receptor-mediated phosphorylation and nuclear translocation (Inman et al. 2002Go). This ensures a constant sensing of the receptor activation state by the Smad pathway.



View larger version (39K):
[in this window]
[in a new window]
 
Figure 2. Smad proteins and their structural elements. (A) Smad proteins consist of two conserved globular domains—the MH1 and MH2 domains—and a variable linker region. The seven mammalian Smad proteins are listed under their corresponding classes. In RSmads and Smad4, the MH1 domain contains a {beta}-hairpin structure for binding to DNA. I-Smads lack MH1 domain. The linker region of RSmads contains multiple phosphorylation sites for MAPKs, CDKs, and other protein kinases. In RSmads and I-Smads, the linker region contains a PY motif for recognition by the WW domains in Smurf ubiquitin ligases. The linker region of Smad4 contains an NES. The MH2 domain contains a basic pocket for interaction with activated type I receptors in the case of the RSmads, and in both the RSmads and Smad4 for interaction with the pS–x–pS motif (red ball) of RSmads. On the surface of the MH2 domain in RSmads, a contiguous set of hydrophobic patches or hydrophobic corridor serves as a site for multiple interactions (see Fig. 3). (B) Some examples of mammalian TGF{beta} family members and their relationships with the five type II receptors, the seven type I receptors, the five RSmads, and the single Co-Smad. The schematic depicts a hetero-trimeric Smad complex consisting of two RSmad molecules and one Smad4 (only the MH2 domains are shown). RSmad–Smad4 heterodimers have also been described.

 

Smad nucleocytoplasmic dynamics

The mechanisms of Smad nuclear import and export have been extensively studied over the last few years, particularly in Smads 2, 3, and 4. Interestingly, the nuclear import of these proteins can occur without the intervention of nuclear transport factors (Xu et al. 2000Go, 2002Go). Many proteins undergoing nuclear translocation in response to regulatory signals rely on importins, a set of factors that mediate nuclear import by recognizing a motif called the nuclear localization signal (NLS) in the cargo proteins (Jans et al. 2000Go). The classic NLS motif consists of a cluster of basic residues and is recognized by importin-{alpha}. Importin-{alpha} binds to importin-{beta}, which directly interacts with the nuclear pore components—nucleoporins—to negotiate the passage of the importin-{beta}-importin-{alpha}-cargo complex into the nucleus (Mattaj and Englmeier 1998Go; Gorlich and Kutay 1999Go). However, the nuclear translocation of Smad proteins can occur independently of importins because Smad proteins can directly interact with nucleoporins (Xu et al. 2000Go, 2002Go). This interaction maps to the hydrophobic corridor in the Smad MH2 domain and to the FG repeat region on the nucleoporins Nup153 and Nup214 (Fig. 3; Xu et al. 2002Go). The FG repeat region is also the region for nucleoporin interaction with importins (Bayliss et al. 2000Go). Importin-independent transport has also been described for other signaling factors, including {beta}-catenin in the Wnt signaling pathway (Yokoya et al. 1999Go) and ERK2 MAP kinase in response to Ras-mediated mitogenic signals (Whitehurst et al. 2002Go).

Some Smad proteins may additionally undergo nuclear import via importins. A conserved lysine-rich sequence in the MH1 domain of Smads that resembles classical NLS motifs was hypothesized, and shown, to interact with importins (Xiao et al. 2000Go; Kurisaki et al. 2001Go). This conserved lysine-rich sequence forms an {alpha}-helix right next to the DNA-binding motif in the MH1 domain, not an extended loop as would be structurally required for recognition by importin-{alpha} (Shi et al. 1998Go; Chai et al. 2003Go). Indeed, Smad3 interacts with importin-{beta}, not importin-{alpha} (Xiao et al. 2000Go; Kurisaki et al. 2001Go). Importin binding has been detected with overexpressed Smad3 and Smad4 but not Smad2. This inability of Smad2 has been attributed to the presence of the exon 3-encoded insert in the MH1 domain. One study comparing the efficiency of Smad3 translocation by the importin-dependent and -independent mechanism indicated that the importin-dependent process is considerably weaker (Xu et al. 2003Go). On balance, it appears that Smads 2, 3, and 4 undergo nuclear import by means of direct interactions with nucleoporins, and this process may be aided by importin-{beta} in the case of Smads 3 and 4.

The intrinsic asymmetry of the nuclear pore complex, and the distribution of different nucleoporins along the span of the pore, are thought to allow proteins docking on one side of the pore to move unidirectionally to the other side and vice versa (Rout et al. 2000Go). Indeed, the direct interaction of Smads 2 and 3 with nucleoporins has been shown to enable nuclear export as well as import (Xu et al. 2002Go). The presence of an NES and the recognition of this sequence by the general nuclear export factor CRM1, which are necessary for nuclear export of other proteins (Fornerod et al. 1997Go), are not required for the export of Smads 2 and 3. Interestingly, these properties are also shared by {beta}-catenin (Yokoya et al. 1999Go), ERK2 (Whitehurst et al. 2002Go) and importin-{beta} (Fahrenkrog and Aebi 2003Go).



View larger version (53K):
[in this window]
[in a new window]
 
Figure 3. Smad interaction sites. Summary of representative interactions of RSmads and Smad4 with other proteins in the cytoplasm, nuclear pore, and nucleus. On a type I receptor kinase domain, the L45 loop specifies the receptor–RSmad interaction, the phosphorylated GS domain enables this interaction, and the catalytic domain phosphorylates the RSmad tail. On RSmad, the L3 loop (not shown) and the basic pocket (blue oval) match the receptor L45 loop and the GS domain. In the basal state, the MH1 and MH2 domains may interact in cis, and the MH2 domains of Smad4 in trans (dashed arrow), forming homotrimers. The phosphorylated tail of RSmads (red dot) is recognized by the basic pocket in Smad4. SARA, acting as a cytoplasmic anchor, and the nucleoporins Nup214 and Nup153, acting as mediators of nuclear import and export, interact with the hydrophobic corridor of RSmads. Smad4 import involved direct contacts with these nucleoporins, whereas export involves binding of CRM1 to the NES motif (orange mark) on the linker region of Smad4. In the cytoplasm or the nucleus, the linker region of RSmads is recognized by MAPKs, CDKs, and other kinases for phosphorylation and by Smurf ubiquitin ligases for polyubiquinitation. In the nucleus, different DNA-binding cofactors bind to RSmads in the hydrophobic corridor and others bind to the MH1 domain. Transcriptional coactivators such as p300 bind to the MH2 domain, in some cases with the assistance of adaptors such as MSG1. Corepressors may bind in competition with p300 (case of TGIF) or in competition with Smad4 (case of SnoN).

 
In the basal state, Smad4 is distributed throughout the cell and may also undergo a continuous shuttling between the cytoplasm and the nucleus (Fig. 3; Pierreux et al. 2000Go; Watanabe et al. 2000Go; Inman et al. 2002Go). Receptor-mediated phosphorylation at the C-terminal region of RSmad proteins creates a binding site for Smad4 (Wu et al. 2001bGo; Chacko et al. 2004Go). In principle, receptor-phosphorylated RSmads could associate with Smad4 in the cytoplasm and enter the nucleus as a RSmad–Smad4 complex, or this complex could form after RSmads enter the nucleus. The site of Smad complex formation in the cell has not been systematically investigated, and it may depend on the cell type and conditions. Regardless, Smad4 is concentrated in the nucleus as a result of this interaction.

Unlike the nuclear export of Smads 2 and 3, the export of Smad4 is dependent on, or at least enhanced by, CRM1. Smad4 contains a leucine-rich NES in the linker region that is recognized by CRM1 (Figs. 2A, 3; Pierreux et al. 2000Go). The CRM1 inhibitor leptomycin B blocks the nuclear export of Smad4 without interfering with nuclear export of Smads 2 or 3 (Inman et al. 2002Go; Xu et al. 2002Go). The interaction with receptor-phosphorylated Smads 2 and 3 is thought to mask the NES in Smad4, protecting the RSmad–Smad4 complexes from recognition by CRM1 and nuclear export (Watanabe et al. 2000Go; Inman et al. 2002Go). When RSmads are dephosphorylated and the complex dissociates, the Smad4 NES becomes exposed again and nuclear export of Smad4 by CRM1 can proceed. Little is known about other inputs that may regulate the exposure of the Smad4 NES. However, an alternatively spliced mammalian Smad4 isoform (Pierreux et al. 2000Go) and a separate gene product, Smad4{beta}, in Xenopus (Watanabe et al. 2000Go) lack the NES and are constitutively located in the nucleus.


Smad subcellular retention mechanisms

Despite the intrinsic ability of RSmads to shuttle in and out the nucleus, many immunocytochemical studies have shown that in the basal steady state, RSmads are predominantly concentrated in the cytoplasm (Pierreux et al. 2000Go; Watanabe et al. 2000Go; Xu et al. 2002Go). This is thought to result from the action of cytoplasmic Smad-binding factors. Indeed, endogenous Smad proteins in the basal state are found in high-molecular mass oligomeric complexes, as determined by molecular-sizing chromatography (Jayaraman and Massagué 2000Go).

Different cytosolic proteins may function as Smad anchors and adaptors. The best characterized of the cytosolic retention factors for Smad2 and Smad 3 is the protein SARA (smad anchor for receptor activation) (Tsukazaki et al. 1998Go). SARA is a multidomain protein that contains an 80-amino-acid Smad-binding domain (SBD) and a FYVE phosholipid-binding domain that avidly binds to phosphatidyl inositol 3' phosphate on endosomal membranes. FYVE targets SARA preferentially to early endosomes (Di Guglielmo et al. 2003Go). As revealed by the X-ray crystal structure of the Smad2–SBD complex, the SBD of SARA makes contact with the three consecutive hydrophobic patches on the MH2 domain surface that constitute the hydrophobic corridor (Figs. 2, 3; Wu et al. 2000Go). This region is also involved in the interaction with the FG-repeat-containing domain nucleoporins and the binding of DNA-binding cofactors (Wu et al. 2000Go; Randall et al. 2002Go). Therefore, the interaction with SARA is incompatible with translocation of Smad2/3 into the nucleus and formation of transcriptional complexes.

Receptor-mediated phosphorylation causes a decrease in the affinity of Smad2 for SARA (Wu et al. 2001bGo). This occurs without a concomitant increase in Smad affinity for nucleoporins. In vivo, however, this drop in affinity for SARA is sufficient for Smad2/3 dissociation and movement into the nucleus, where Smad2/3 are retained by interactions with Smad4 and additional proteins, as well as with the DNA. Dephosphorylation and dissociation of Smad transcriptional complexes are thought to end this retention, allowing the export of RSmad out of the nucleus (Inman et al. 2002Go).

The hydrophobic corridor and adjacent regions are involved in binding contacts not only with SARA and nucleoporins, but also with DNA-binding partners in the nucleus (Fig. 3; Randall et al. 2002Go). Two such partners of Smad2/3, the forkhead family member FoxH1 and the homeodomain transcription factor Mixer, share a conserved Smad-interacting motif (SIM) that has sequence similarity with the SBD of SARA (Germain et al. 2000Go; Wu et al. 2000Go). Interestingly, the SIM is shorter than the SBD and, contrary to the SBD, it binds to Smad without occluding the Smad–Smad interface (Wu et al. 2001bGo). Thus SIM-containing factors can bind to Smad2/3–Smad4 oligomers, whereas SARA SBD interacts with monomeric Smad2/3.

As the same Smad protein surface interacts with cytoplasmic retention factors, nuclear pore components, and nuclear-interacting factors, this surface creates mutually exclusive interactions—and hence, competition—between Smad-binding components (Fig. 3). A soluble SBD peptide has been used as an effective inhibitor of Smad2/3 nuclear translocation (Xu et al. 2000Go, 2003Go). Nup214 and Nup153 compete with SARA for Smad2 in binding assays (Xu et al. 2002Go). Overexpression of FoxH1 causes the concentration of Smad2/3 in the nucleus even in the absence of TGF{beta}, providing evidence for the constant nucleocytoplasmic traffic of Smads (Xu et al. 2002Go). This FoxH1–Smad2/3 interaction is transcriptionally inert because, without receptor mediated phosphorylation, Smad2/3 in this complex cannot recruit Smad4.


Smad adaptors for receptor interaction

Several adaptor proteins that facilitate the Smad–receptor interaction have been described, of which SARA is the most extensively characterized. By regionally restricting Smad2/3 proteins to the plasma membrane and early endosomes, SARA facilitates the interaction of Smads 2 and 3 with the activated TGF{beta} receptor (Tsukazaki et al. 1998Go). The activated receptor complex is internalized via clathrin-coated pits from the plasma membrane to the early endosomes, where SARA-bound Smads are thought to be most readily accessible (Di Guglielmo et al. 2003Go). However, this interaction also appears to occur at the plasma membrane prior to receptor internalization (Di Guglielmo et al. 2003Go). Another FYVE domain protein, Hgs, has been found to cooperate with SARA on Smad phosphorylation (Miura et al. 2000Go). Activated TGF{beta} receptor complexes are also internalized via lipid rafts and caveolae, but this route is thought to lead to receptor interaction with the E3 ubiquitin ligase Smurf2 that targets the receptor for inactivation (Di Guglielmo et al. 2003Go). However, genetic evidence for the necessity for SARA in Smad2/3 signaling is lacking, and so it remains to be established whether SARA is in fact a major player in this pathway.

Other proteins that have been proposed to participate in the interaction of Smad2/3 with the TGF{beta} receptor, including Disabled-2 (Hocevar et al. 2001Go), Dok-1 (Yamakawa et al. 2002Go), Axin (Furuhashi et al. 2001Go), the ELF {beta}-spectrin (Tang et al. 2003Go), and a cytoplasmic isoform (cPML) of the promyelocytic leukemia protein (Lin et al. 2004Go), are interesting members of this group. cPML has been proposed to interact with SARA, Smad2/3, and the TGF{beta} receptor and to be critical for TGF{beta} phosphorylation of Smad2/3 and TGF{beta} signaling (Lin et al. 2004Go). However, PML-deficient mice develop fairly normally (Wang et al. 1998Go), whereas embryos deficient in TGF{beta} receptors or Smad2 do not (Oshima et al. 1996Go; Waldrip et al. 1998Go; Heyer et al. 1999Go). The proteins TRAP-1 (TGF{beta}-receptor-associated protein) (Wurthner et al. 2001Go) and TLP (TRAP-1-like protein) (Felici et al. 2003Go) have been described as adaptor proteins that interact with the receptor complex and facilitate the formation of Smad2/3–Smad4 complexes. Many of these interactions require a more thorough examination before their significance as Smad adaptors can be established. Furthermore, little is known about factors that facilitate RSmad–receptor interactions in the BMP pathway. Smad1 does not interact with SARA, but given the overall similarity between these pathways, the existence of a SARA-like factor for Smads 1, 5, and 8 remains possible.


Smad phosphorylation by TGF{beta} receptor kinases

Smad phosphorylation by the activated TGF{beta} receptor complex is a pivotal event in the initiation of TGF{beta} signal transduction. The mechanism of TGF{beta} receptor activation was established by combining biochemical and genetic approaches (Wrana et al. 1994Go). This process and its structural basis have been reviewed in detail elsewhere (Shi and Massagué 2003Go). Briefly, TGF{beta} binds to pairs of receptor serine/threonine kinases, known as the TGF{beta} type I (T{beta}R-I) and type II (T{beta}R-II) receptors, forming a hetero-tetrameric receptor complex. In this complex, T{beta}R-II phosphorylates a serine/threonine-rich region, called the GS region, that is located N-terminal to the canonical kinase domain of T{beta}R-I (Fig. 1). In the absence of ligand, the small proteins FKBP12 and FKBP12.6 bind to the GS region and occlude the phosphorylation sites in this region (Y.G. Chen et al. 1997Go; Datta et al. 1998Go). The X-ray crystal structure of the FKBP12–T{beta}R-I cytoplasmic domain complex revealed that the bound FKBP12 additionally enforces a catalytically inactive conformation of the T{beta}R-I kinase domain by pressing against the active center and causing a mis-alignment of the critical catalytic amino acid residues (Huse et al. 1999Go). Thus, in the basal state, the GS region acts as a docking site for FKBP12 and an auto-inhibitory element of the receptor kinase (Fig. 1).

T{beta}R-II appears to be a constitutively active kinase (Wrana et al. 1994Go). Ligand access to the receptors is a tightly regulated process, with numerous proteins acting as ligand traps that bind various members of the TGF{beta} family and prevent their contact with cell surface receptors (Fig. 1; for reviews, see Massagué and Chen 2000Go; Shi and Massagué 2003Go). In the ligand-induced receptor complex, T{beta}R-II gains access to the GS region of T{beta}R-I, catalyzing the phosphorylation of alternating serine (or threonine) residues in the sequence Thr–Thr–Ser–Gly–Ser–Gly–Ser (Wrana et al. 1994Go; Massagué 1998Go). This interaction is negatively regulated by the pseudoreceptor BAMBI, which intercalates in the receptor complex (Onichtchouk et al. 1999Go). Phosphorylation of T{beta}R-I turns the GS region from a FKBP12-binding site to a binding site for Smad2/3 (Huse et al. 2001Go), providing a case of remarkable economy of function within a kinase structural element. Smad2/3 are then phosphorylated by T{beta}R-I and released to propagate the signal.

This general mechanism applies to all TGF{beta} and BMP family receptors characterized to date, from human through Drosophila (Shi and Massagué 2003Go). The RSmad substrate specificity of a given receptor complex is determined by a particular region, the L45 loop, in the kinase domain of the type I receptor and a complementary region, the L3 loop, on the MH2 domain of RSmads (Fig. 3). T{beta}R-I (also known as ALK5) and the nodal/activin type I receptors ALK2 and ALK7 recognize Smads 2 and 3, whereas ALK1, ALK3, and ALK6 recognize Smads 1, 5, and 8. Thus, the phosphorylated GS region drives receptor interaction with RSmads, whereas the L45 loop determines the specificity of this interaction.

Receptor-mediated phosphorylation occurs at two serine residues in the extreme C-terminal sequence Ser–Val–Ser (Ser–Met–Ser in Smad2) of RSmads. pSer–X–pSer is the paradigmatic activation motif of the TGF-{beta}/Smad pathway (Fig. 1). This motif occurs both in the GS region of type I receptors upon phosphorylation by type II receptors and in the C-terminal tail of RSmads upon phosphorylation by type I receptors. Together with the C-terminal carboxyl group, this di-phosphoserine moiety constitutes an acidic tail that binds to a basic pocket in the Smad4 MH2 domain (Fig. 1; Wu et al. 2001bGo; Chacko et al. 2004Go). As a result, RSmad–Smad4 oligomers are formed that nucleate a large number of transcriptional regulatory complexes.

In solution, the unphosphorylated Smad2 MH2 domain is a monomer but the phosphorylated form is a homotrimer (Wu et al. 2001bGo), and Smad4 also forms homotrimers (Shi et al. 1997Go). Smad2/3 mixed oligomers (presumably trimers) have been observed in cells on TGF-{beta} stimulation (Wu et al. 2001bGo; Chacko et al. 2004Go), but their function is unknown. More significant from the standpoint of signal transduction are the RSmad–Smad4 oligomers, both heterodimers (RSmad–Smad4) or heterotrimers (two RSmads molecules plus one Smad4 molecule) (Figs. 1, 2B). As revealed by X-ray crystallographic studies, these oligomers are stabilized by interactions within an extensive protein–protein interface between MH2 domains plus the binding of the pSer–X–pSer motif of one MH2 domain into the di-phosphoserine-binding pocket on the adjacent MH2 domain (Wu et al. 2001bGo). Mutation of these serine residues into aspartic acid mimics this interaction. Several Smad-inactivating mutations found in tumors map to the MH2 domain interface and inhibit Smad oligomer formation (Wu et al. 2001bGo).


Smad dephosphorylation

A steady level of Smad2/3 phosphorylation is achieved in cells within 15–30 min of exposure to of TGF{beta} addition and can last for several hours. Eventually, a drop in TGF{beta} levels in the extracellular space, receptor inactivation by internalization and degradation, or the action of negative feedback mechanisms leads to a loss of Smad phosphorylation.

Recent evidence suggests that sustained receptor activity is required for the maintenance of the steady-state phospho-Smad levels over the duration of the TGF{beta} stimulation. The existence of a rapid cycle of dephosphorylation and return to the cytoplasm has been inferred from results using a TGF{beta} receptor kinase inhibitor (Inman et al. 2002Go). Interrupting signaling by addition of this inhibitor causes a precipitous loss of phospho-Smad2, with a half-life of <30 min (Inman et al. 2002Go). This loss is primarily caused by dephosphorylation (Fig. 1; Inman et al. 2002Go), although ubiquitination and proteasome-mediated degradation of Smad2 in the nucleus may also eventually occur (Lo and Massagué 1999Go). RSmad dephosphorylation seems to occur in the nucleus, as suggested by the underphosphorylated state of Smad2 protein exported from the nuclei of permeabilized TGF{beta}-treated cells (Xu et al. 2002Go). Dephosphorylation is accompanied with dissociation of the RSmad–Smad4 complex and export of its components to the cytoplasm. Thus, RSmads undergo repeated cycles of receptor-mediated phosphorylation and re-entry into the nucleus, as long as TGF{beta} receptors remain active (Fig. 1). Smad signaling activity becomes thereby tied to receptor activation. The identity of the RSmad phosphatase(s) has not been revealed, but their identification will be of great interest because the action of this enzyme(s) causes the termination of Smad signal transduction.


    Smad transcriptional complexes
 Top
 Abstract
 Basic features of the...
 Smad activation and deactivation
 Smad transcriptional complexes
 Target gene selection by...
 Roles of Smads in...
 Transcriptional deactivation and...
 Self-enabled gene responses
 Smad synexpression groups
 Constraining Smad...
 Roles of the inhibitory...
 Regulation of Smads by...
 Regulation of Smads by...
 Regulation of Smads by...
 Smads as nodes for...
 Perspectives
 Acknowledgments
 References
 
The RSmad–Smad4 complex formed upon receptor-mediated phosphorylation of RSmad acts as the core of many different multiprotein assemblies that target different genes for either activation or repression. The Smad proteins in this complex, via their MH1 domain, can bind to DNA. Almost universally, however, Smads must interact with other DNA-binding cofactors to achieve high affinity and selectivity for specific subsets of target genes. Understanding how the MH1 domain recognizes DNA and how a RSmad–Smad4 complex engages different DNA-binding partners is essential for delineating Smad transcriptional programs.


DNA-binding determinants in the MH1 domain: {beta}-hairpin and exon 3 insert

Smad4 and all RSmads (except the long form of Smad2, see below) can directly bind to DNA. The X-ray crystal structure of the Smad3 MH1 domain bound to the SBE shows that a {beta}-hairpin (i.e., two anti-parallel short {beta}-strands separated by a linker loop) in the MH1 domain mediates this binding interaction. The {beta}-hairpin is inserted in the major groove of the DNA and establishes hydrogen bonds with nucleotides in three base pairs of the Smad-binding element (Shi et al. 1998Go). The {beta}-hairpin sequence is conserved in all RSmads and Smad4 (Shi et al. 1998Go), arguing that this Smad–DNA contact cannot provide much selectivity in the interaction of different Smads with their corresponding target genes.

Interestingly, the most abundant splice variant of Smad2 in vertebrates contains an insert in the MH1 domain, in the vicinity of the {beta}-hairpin, that prevents Smad2 binding to DNA (Shi et al. 1998Go). This insert is codified by exon 3 of Smad2. The predicted sequence of the Smad2 ortholog in Drosophila, dSmad2, also contains an insert in this position (Brummel et al. 1999Go). A splice variant of Smad2 lacking the exon 3 insert is able to bind DNA (Yagi et al. 1999Go). Because the splice form containing the insert is the most abundant and was cloned first, this variant is called Smad2, whereas the version lacking the insert, and thus most closely resembling the other RSmads, is stuck with the name Smad2{Delta}E3.

The endogenous Smad2 and Smad2{Delta}E3 proteins are coexpressed in various cell types at a ratio ranging from 3:1 to 10:1 (Dunn et al. 2005Go). Smad2-null mice show an early embryonic lethal phenotype because Smad2 plays an essential role in patterning the embryonic axis and specifying the endoderm (Waldrip et al. 1998Go; Heyer et al. 1999Go). Mice engineered to exclusively express Smad2{Delta}E3 from the Smad2 locus are viable and fertile, indicating that full-length Smad2 with the insert is not required for viability (Dunn et al. 2005Go).

The role of the insert and its inhibitory effect on DNA binding remain a mystery. Smad2 might act as a competitive inhibitor of Smad2{Delta}E3 and Smad3 in transcriptional complexes. However, this scenario has not been borne out by the evidence. Smad2 was originally identified for its ability to act as a mediator and not an inhibitor of activin/TGF{beta}-like transcriptional responses. Smad2 can associate with Smad4 and an additional DNA-binding cofactor to form transcriptional complexes. This was shown by the ability of the complex FoxH1–Smad2–Smad4 to mediate activation of the Mix2 promoter (X. Chen et al. 1997Go; Liu et al. 1997Go). Still, it is possible that Smad2 may act as an effector of some gene responses and as an inhibitor of others. Furthermore, the exon 3-encoded insert may have functions not related to transcriptional regulation. Of interest in this regard, Smad2{Delta}E3, like Smad3, forms high-molecular mass oligomeric complexes in the basal state, whereas Smad2 is essentially monomeric under these conditions (Jayaraman and Massagué 2000Go).


Smad-binding elements

Using a bound-oligonucleotide selection strategy, the binding specificity of recombinant Smad proteins was originally defined as 5'-GTCTAGAC-3' (Zawel et al. 1998Go) and later shown to be 5'-GTCT-3', or its complement 5'-AGAC-3', called the Smad-binding element (SBE). Many Smad-responsive promoter regions contain one or more SBEs, which in many instances contain an extra base, as 5'-CAGAC-3'. The crystal structure of the MH1–SBE complex shows that Smads recognize the 5'-GTCT-3' sequence through the {beta}-hairpin in the MH1 domain (Shi et al. 1998Go).

The affinity of Smad proteins for the SBE is too low to support binding of a Smad complex to a single SBE in vivo (Shi et al. 1998Go). Sufficient binding affinity for transcriptional activation can be artificially achieved with concatemers of multiple SBEs (Zawel et al. 1998Go). As activated Smad complexes consist of Smad oligomers, the presence of multiple SBEs likely enables tight binding through cooperative interactions between multiple MH1 domain–SBE contacts by the same Smad complex. However, natural Smad target promoters seldom contain SBE concatemers, and those that contain up to four SBEs still require cooperating factors for effective DNA binding (Seoane et al. 2004Go).

High-affinity binding of the Smad complex is thought to occur through the incorporation of a different DNA-binding cofactor into the RSmad–Smad4 complex (Fig. 1). This allows the recognition of promoter regions that present one SBE in the vicinity of the cognate sequence for that particular cofactor. This mode of interaction provides a basis for high affinity and selectivity in the recognition of target genes and a venue for the differential action of TGF{beta} in different cell types (Massagué and Wotton 2000Go). As discussed below, the interaction of Smads with other transcription factors to generate target gene-specific transcriptional complexes is crucial for the pleiotropic nature of TGF{beta} signaling. However, no crystal structure of a natural Smad complex bound to its cognate DNA region has been described to date, leaving us to wonder how such interactions may actually take place.


SBE variants and GC-rich elements

Some TGF{beta} responsive regions lack a canonical SBE but contain sequences that may function as such because of a certain degree of tolerance in the MH1–SBE interaction. Based on the crystal structure of this complex, the second nucleotide in the 5'-GTCT-3' sequence does not support interactions with the MH1 domain, thus allowing substitutions that do not impair Smad binding (Shi et al. 1998Go). Such "degenerate" SBEs have been proposed in the TGF{beta} inhibitory element (TIE) in the c-Myc promoter (Chen et al. 2002Go). The TIE does not contain a canonical SBE but binds a Smad3–Smad4 complex, as verified by chromatin immunoprecipitation assays (Chen et al. 2002Go). Scrutiny of this region in vitro and in cells has demonstrated that the sequence 5'-GGCTT-3' is contacted by Smad3 or Smad4 (Chen et al. 2002Go; Frederick et al. 2004Go). Either way, the Smad–DNA interaction in the c-Myc TIE is buttressed by E2F4/5 and DP1 as DNA-binding cofactors in the c-Myc inhibitory complex (Chen et al. 2002Go; Frederick et al. 2004Go).

In certain promoters, Smad complexes recognize GC-rich regions. Mad (Drosophila Smad1) and Medea (Drosophila Smad4) were shown to interact with a GCCGnCGC sequence in the promoters of Vestigial (Kim et al. 1997Go) and Tinman (Xu et al. 1998Go). Interestingly, the murine counterpart of Tinman, Nkx2.5, contains a canonical SBE that interacts with the Smad complex (Lien et al. 2002Go; Brown et al. 2004Go). The BMP-responsive element (BRE) in the promoter of Smad6 contains four overlapping copies of a GC-rich motif. Direct binding of Smads to GC-rich sequences has been demonstrated in oligonucleotide binding assays using extracts from cells that overexpress Smad5 and Smad4 (Ishida et al. 2000Go). Based on these observations, GC-rich motifs are sometimes referred to as "Smad1-binding elements". However, it would be erroneous to imply that Smad1/5/8 bind GC-rich motifs, whereas Smad2/3 bind to SBEs. The first Smad1-binding element defined in vertebrates, in the BMP-responsive region of the Xenopus Vent-2 promoter, is a canonical SBE (Hata et al. 2000Go). Furthermore, binding of Smad1–Smad4 complexes and Smad3–Smad4 complexes to the Id1 promoter requires both SBE and GC-rich elements. Both elements are required for BMP-mediated induction of Id1 (Korchynskyi and ten Dijke 2002Go; Lopez-Rovira et al. 2002Go; Kang et al. 2003Go) and TGF{beta}-mediated repression of Id1, the difference being determined by the recruitment of an additional factor, ATF3 (Kang et al. 2003Go). No crystal structure of a Smad molecule bound to a GC region has been reported, but the dual interaction with SBE and GC-rich motifs invites the speculation that the MH1 domain might actually contain separate DNA-binding sites for these two different sequences.


Core Smad complexes: dimers and trimers

Transcriptionally active Smad complexes are mediated by RSmad–Smad4 oligomers. The stoichiometry of such oligomers has been the subject of debate. The analysis of complexes formed with Smads containing acidic residue substitutions for the C-terminal serine residues suggested that they form heterotrimers of two phospho-RSmad molecules and one Smad4 molecule (Chacko et al. 2001Go; Qin et al. 2001Go). Moreover, phospho-Smad2 and Smad4 each in isolation form homotrimers in solution (Shi et al. 1997Go; Wu et al. 2001bGo). These results indicated that functional RSmad–Smad4 complexes are probably heterotrimers (Chacko et al. 2001Go). However, the crystal structure of isolated MH2 domains of phospho-Smad2 supported as well a heterodimer model between Smad4 and the RSmad (Wu et al. 2001aGo,bGo). Recently, the X-ray crystal structure analysis of the complexes formed by the MH2 domains of phospho-Smad2–Smad4 and phospho-Smad3–Smad4, digested with chymotrypsin, shows that both complexes are heterotrimers comprising two phosphorylated RSmad subunits and one Smad4 (Chacko et al. 2004Go).

Studies on endogenous Smad transcriptional complexes in vivo have suggested a more complex scenario. Both heterodimers and heterotrimers may be formed depending on the target gene and other factors present in the complex (Inman and Hill 2002Go). For example, the Mix2 promoter may be targeted by a heterotrimer Smad2–Smad2–Smad4 bound to FoxH1, whereas a Smad3–Smad4 heterodimer and an as yet unknown cofactor may be involved in targeting the JunB promoter (Inman and Hill 2002Go). The physical interaction of the Smad cofactors with the RSmad–Smad4 complexes might facilitate the formation of a heterodimer or a heterotrimer.

The regulation of c-Myc, Id1, p21Cip1, and other genes by TGF{beta} involves Smad3 preferentially over Smad2. Smad3 has a higher affinity than Smad2 for the cofactors—E2F4/5, ATF3, and FoxO, respectively—that target Smads to these gene promoters (Chen et al. 2002Go; Kang et al. 2003Go; Seoane et al. 2004Go). The role of Smad2{Delta}E3 in these complexes has not been characterized. Therefore, the choice between Smad heterodimers and heterotrimers might depend on several variables, including the balance between Smad2 and Smad2{Delta}E3, the other DNA-binding partners present in the complex, and the number of SBEs present in the target promoter.


Smad4 requirement

Whether Smad4 is always required in Smad transcriptional complexes remains an important yet elusive question. All endogenous Smad complexes described to date have been shown to contain Smad4. All Smad target genes characterized by chromatin immunoprecipitation showed Smad4 binding along with RSmads. Nevertheless, Smad4-deficient tumor cells and fibroblasts from Smad4-deficient mice still display some gene responses to TGF{beta} (Sirard et al. 1998Go; Wisotzkey et al. 1998Go; Subramanian et al. 2004Go). Certain pancreatic carcinoma cells that lack Smad4 contain high levels of phosphorylated RSmads and respond to TGF{beta} receptor signaling with increased motility (Subramanian et al. 2004Go). TGF{beta} receptors could signal some of these responses via MAP kinases, phosphatidyl inositol 3'-kinase, PP2A protein phosphatases, or Rho family members (for review, see Derynck and Zhang 2003Go; Siegel and Massagué 2003Go), or as a result of effects on tight junction membrane components (Ozdamar et al. 2005Go). However, TGF{beta} signaling via RSmads independently of Smad4 remains a possibility, as suggested by the more severe phenotype of mad mutants compared with medea mutants in Drosophila (Wisotzkey et al. 1998Go) and Smad2-/- mice compared with Smad4-/- mice (Sirard et al. 1998Go; Chu et al. 2004Go). Phospho-RSmads might yet be shown to interact with factors distinct from, and in competition with, Smad4.


    Target gene selection by association with DNA-binding partners
 Top
 Abstract
 Basic features of the...
 Smad activation and deactivation
 Smad transcriptional complexes
 Target gene selection by...
 Roles of Smads in...
 Transcriptional deactivation and...
 Self-enabled gene responses
 Smad synexpression groups
 Constraining Smad...
 Roles of the inhibitory...
 Regulation of Smads by...
 Regulation of Smads by...
 Regulation of Smads by...
 Smads as nodes for...
 Perspectives
 Acknowledgments
 References
 
Activated RSmad–Smad4 complexes achieve high affinity and selectivity for target gene promoters by associating with diverse DNA-binding factors (Fig. 4A,B). The DNA-binding activities of the various components in the resulting complex cooperatively generate a high-affinity binding interaction with target promoters that contain the cognate sequences. This interaction demands that the suitable DNA elements be present at the correct distance and orientation on the target promoter, so the interaction is highly selective. Among several hundred TGF{beta} target genes in a given cell, only a few may be targeted by a particular RSmad–Smad4–partner combination. A given cell type that does not express a particular Smad partner will be unable to mount the TGF{beta} gene response(s) that depend on that particular RSmad complex combination. This would make a gene response to TGF{beta} cell type dependent. Some DNA-binding partners can pair with Smad2/3 only and others with Smad1/5/8 only, thus establishing pathway specificity for TGF{beta} and BMP gene responses. Furthermore, in some cases Smad partners also determine the sign of the effect— activation or repression—that is exerted on a target gene. Smad partners therefore provide four levels of specificity: (1) target gene specificity, (2) pathway specificity, (3) cell type specificity, and (4) specific transcriptional effect.



View larger version (43K):
[in this window]
[in a new window]
 
Figure 4. Models of transcriptional regulation by Smads. (A) RSmad–Smad4 complexes, in combination with different DNA-binding cofactors, form assemblies that target specific genes. FoxH, FoxO, and Mixer are examples of DNA-binding cofactors for Smad2/3–Smad4, and OAZ for Smad1/5–Smad4. These complexes may recruit coactivators (e.g., p300, CBP, P/CAF) with the assistance of adaptor proteins (e.g., MSG1, SMIF). (B) RSmad–Smad4 complexes may also recruit corepressors as a function of the associated DNA-binding cofactor. Smad1–Smad4 with Nkx3.2 recruits mSin3/HDAC1. Smad3–Smad4 with E2F4/5 recruits p107 and with Runx2 recruits HDAC4. The color coding indicates functionally linked components. (C) In a growing number of cases, Smad-mediated transcriptional regulation appears to involve a self-enabling mechanism whereby Smads induces a gene response that then enables another Smad-dependent gene response. (1) In the Xenopus embryo, Mixer with Smad2–Smad4 induces expression of Mix2, which then mediates the induction of other genes with Smad2–Smad4. Likewise, OAZ with Smad1–Smad4 induces expression of Vent2, which then apparently mediates the induction of other genes with Smad1–Smad4. (2) In human keratinocytes, Smad2/3–Smad4 with unknown cofactors induces expression of ATF3, which in complex with Smad3–Smad4 then mediates the repression of ID1. (3) By down-regulating the expression of c-MYC in epithelial cells, an E2F4/5–Smad3–Smad4 complex removes Myc-mediated repression from p21CIP1, thus enabling the activation of this gene by FoxO–Smad3–Smad4 complexes. (4) In the Drosophila embryo, repression of Brinker by a Schnurri–Mad–Medea complex facilitates the displacement of Brinker from the Ubx promoter by a competing Mad–Medea (with an unknown cofactor) complex.

 
Many Smad cofactors have been identified that fulfil this paradigm for different Smad target genes. Remarkably, these cofactors belong to different families of DNA-binding proteins, providing a basis for the breadth of TGF{beta} transcriptional responses. It should be noted that only a few of these interactions have been validated to date by analysis of endogenous (as opposed to overexpressed) proteins, chromatin immunoprecipitation analysis of protein binding to DNA in vivo, and genetic depletion of Smad partners. Whenever possible, we will focus here on examples that meet these stringent criteria.

The first identified Smad-interacting transcription factor was the forkhead family member, FoxH1 (previously FAST1) (Chen et al. 1996Go). The FoxH1–Smad2–Smad4 complex binds an activin-responsive element on the promoter region of Mix2 in response to activin/nodal-like signals during mesoderm specification in Xenopus. FoxH1 is essential for the binding of this complex to the Mix2 promoter (X. Chen et al. 1997Go; Liu et al. 1997Go). FoxH1 specifically interacts with the MH2 domain of Smads 2 and 3. This interaction is mediated by two separate sequences on FoxH1: a proline-rich sequence called the SIM and a FoxH1-specific motif (Germain et al. 2000Go). Three other forkhead family members, FoxO1, FoxO3, and FoxO4, serve as Smad3 transcriptional partners in the activation of the cyclin-dependent kinase inhibitor p21Cip1 or CDKN1A (Seoane et al. 2004Go). In this case, however, the interaction is mediated by the DNA-binding domain ("forkhead" or "Fox-box" domain) of the FoxO proteins and the MH1 domain of Smad3.

A region structurally and functionally similar to the FoxH1 SIM is also present in Mixer and Milk. However, these proteins are members of the Mix family of homeodomain transcription factors. Mixer and Milk serve as Smad partners in the regulation of Xenopus goosecoid (Germain et al. 2000Go). Interestingly, it has been suggested that the contact with the goosecoid promoter is established by Mixer or Milk with Smads tethered to these proteins without directly contacting the DNA (Germain et al. 2000Go).

Activation of Vent2 during ventral mesoderm differentiation in Xenopus is mediated by an OAZ–Smad1–Smad4 complex in response to BMP (Hata et al. 2000Go). OAZ belongs to yet another family of DNA-binding proteins, the zinc-finger protein family. OAZ contains 30 zinc-finger motifs, some of which mediate binding to Smad1 and others mediate binding to a Vent2 promoter sequence (Hata et al. 2000Go). Another BMP target gene activated by the Smad–OAZ complex is Xretpos, a novel family of long terminal repeat (LTR)-retrotransposons in Xenopus, whose transcript is restricted to ventro–posterior-specific regions (Shim et al. 2002Go). However, the activation of the homeobox gene, Tlx2, by BMP in the mouse embryo during gastrulation and in the embryonic carcinoma p19 cell line (Macias-Silva et al. 1998Go; Tang et al. 1998Go) is not mediated by Smad–OAZ (Hata et al. 2000Go). In this case, the TRAF6-interacting protein Ecsit, which is also implicated in the Toll signaling pathway, helps the Smad1–Smad4 complex recognize Tlx2 for activation (Xiao et al. 2003Go). This provides yet another example of the level of selectivity that has evolved to ensure proper control of target gene selection in Smad transcriptional programs.

The list of DNA-binding partners is rapidly growing. The RUNX family of DNA-binding factors contributes three members to this list: Runx1 (also known as AML1/CBFA2/PEBP2{alpha}B, Runx2 (AML3/CBFA1/PEBP2{alpha}A), and Runx3 (AML2/CBFA3/PEBP2{alpha}C) (Hanai et al. 1999Go; E. Pardali et al. 2000Go; Zhang and Derynck 2000Go; Alliston et al. 2001Go). TGF{beta}-activated Runx–Smad2/3–Smad4 complexes target the IgC{alpha} promoter in B cells (Hanai et al. 1999Go; E. Pardali et al. 2000Go; Zhang and Derynck 2000Go) and the osteocalcin promoter in osteoblasts (Alliston et al. 2001Go). AP1 transcription factors interact with Smads in the regulation of c-jun (Zhang et al. 1998Go), collagenase-I/MMP1 (Wong et al. 1999Go), and interleukin-11 (Y. Kang et al. 2005Go). While AP1 and Smad complexes have been shown to bind to adjacent sites on the collagenase-I/MMP1 and interleukin-11 promoters, they can act on the c-jun promoter by a long-range synergy from distant promoter sites (Wong et al. 1999Go). Thus, in the c-jun response, AP1 and Smads may not establish the kind of physical partnership that is characteristic of other Smad–partner interactions. The E2F family members E2F4 and E2F5 act as partners of Smad3–Smad4 in repression of c-Myc by TGF{beta} in epithelial cells (Chen et al. 2002Go) and the ATF/CREB family member ATF3 in the repression of Id1 (Kang et al. 2003Go).

p53 has been proposed to interact with Smads and to be required for many Smad gene responses, although no direct physical interaction has been demonstrated (Cordenonsi et al. 2003Go). Among general transcription factors, Sp1 has been implicated in the induction of p15Ink4b (Feng et al. 2000Go) and p21Cip1 (K. Pardali et al. 2000Go), but no evidence has been provided yet that TGF{beta} controls the interaction of endogenous Sp1 with the p15Ink4b and p21Cip1 promoters in chromatin immunoprecipitation assays or that conditional depletion of Sp1 specifically interferes with the induction of these genes by TGF{beta}.

It remains possible that genes containing enough clustered copies of the SBE in the promoter region might be activated by Smad-only complexes. One candidate in this class is Smad7, whose promoter contains two pal-indromic SBEs (Denissova et al. 2000Go). Smad7 is activated by both TGF{beta} and BMP pathways in many cell types for negative feedback. So, it would make sense if its induction could be achieved by activated Smads regardless of the activating input. Nevertheless, definitive proof for this notion is still lacking. Indeed, several reports show that full activation of the Smad7 promoter requires cooperation of the Smad complex with AP1, Sp1, or TFE3 transcription factors (Brodin et al. 2000Go; Hua et al. 2000Go). Of note,