A glimpse into the molecular entrails of endoderm formation
During organogenesis, the endoderm forms the epithelial lining of the primitive gut tube from which the alimentary canal and associated organs, such as the liver and pancreas, develop. Despite the physiological importance of these organs, our knowledge of the genes regulating endoderm development has been limited. In the past few years, we have witnessed a rapid pace of discoveries regarding the initial formation of this germ layer. Because the insights have come from studies in several model systems, I have chosen to discuss endoderm formation not only in vertebrate model systems but also inCaenorhabditis elegans, Drosophila, sea urchins, and ascidians. These studies reveal a high degree of conservation in some of the transcriptional regulators of endoderm differentiation, but seemingly more divergence in the intercellular signaling events leading to formation of this tissue. For example, transcription factors of the Gata and Forkhead families are implicated in endoderm development across the phyla. On the other hand, a role for TGF-β, and more specifically Nodal, signaling, which is critical for endoderm formation in zebrafish, mouse, and Xenopus, appears to be vertebrate specific. The studies on vertebrate endoderm formation also suggest that we have now gathered sufficient information to attempt to coax mammalian stem cells towards the endodermal lineage.
Definition of the endoderm
The endoderm is classically defined as the innermost layer of the three Metazoan “germ layers” and as such it gives rise to the inner lining of the gut and its associated organs. However, the term “endoderm” has also been coopted for other developing structures such as some of the extraembryonic tissues in mammalian embryos. Until the time when the nomenclature is adequately revised, the classically defined embryonic endoderm is often referred to as “definitive endoderm” and these are the cells whose formation, or initial differentiation, I will cover in this review. In addition and to establish some temporal distinction, I refer to cells as progenitors prior to the onset of gastrulation, and precursors during gastrulation stages. Finally, because the endoderm and mesoderm often originate from the same, or adjacent, regions of the embryo, a recurring theme of this review will be the analysis of how the embryo segregates these two germ layers.
Endoderm formation in C. elegans
The entire endoderm in C. elegans, that is, the 20 cells that constitute the intestine, originates from the E blastomere at the 8-cell stage (Fig. 1A) (Sulston et al. 1983). E itself is a daughter of EMS and the sister of MS, which gives rise to much of the mesoderm. Endoderm formation in C. elegansis under the control of a number of maternal proteins, including SKN-1, a bZIP/homeodomain transcription factor that is required for both E and MS development (Bowerman et al. 1992). SKN-1 directly regulates the expression of two redundant GATA-type transcription factor genes,med-1 and med-2, which like skn-1 are necessary for EMS differentiation (Maduro et al. 2001). In addition, ectopic expression of either MED-1 or MED-2 can induce mesendodermal differentiation in non-EMS cells (Maduro et al. 2001).
Early cell lineage of the Caenorhabditis elegans embryo and fate maps of the Drosophila, sea urchin, Xenopus, zebrafish and mouse embryos showing the position of the mesodermal (red) and endodermal (yellow) progenitors. (A) In C. elegans, the mesendodermal progenitor EMS divides to give rise to MS, the mesodermal progenitor, and E, the endodermal progenitor. Prior to this division, P2 sends EMS a Wnt signal (encoded by mom-2) which leads to a decrease in POP-1 function in E, allowing endoderm formation. (B) InDrosophila, endodermal tissues originate from the poles of the blastoderm embryo. Note that both the foregut and hindgut are thought to be ectodermal (blue). (C) In sea urchin, the mesoderm lies vegetal to the endoderm so that during gastrulation, the mesoderm invaginates first followed by the endoderm. By the end of gastrulation, the endoderm forms the archenteron or primitive gut. (D) InXenopus, as in ascidians, most of the endoderm derives from the most vegetal aspect of the late blastula stage embryo. The suprablastoporal endoderm is not shown in this simplified drawing. (E) In zebrafish, the endoderm lies close to the margin at the late blastula stage. Mesodermal and endodermal progenitors are interspersed in this region while further away from the margin only mesodermal progenitors are present. (F) In mouse figure adapted from Wells and Melton (1999), gastrulation begins when mesodermal and endodermal progenitors start migrating through the primitive streak. After passing through the streak, the endodermal precursors and some mesodermal precursors, including those that will give rise to the notochord, migrate along the midline in an anterior direction. Other populations of mesodermal cells migrate more laterally. (PGCs) Primordial germ cells; (An) animal pole; (Veg) vegetal pole; (A) anterior; (P) posterior.
Downstream of MED-1 and MED-2, lie END-1 and END-3, another pair of redundant GATA factors. Losing both END-1 and END-3 results in the loss of endoderm (Zhu et al. 1997). In fact, the ‘endodermal’ phenotype of these mutants is very similar to that seen in skn-1 andmed-1,2 mutants in that the E blastomere is converted to a C-like mesoectodermal progenitor. (The C blastomere, which is a daughter of P2, gives rise primarily to epidermis and muscle.) Shortly after the time E is born and end-1 and end-3 become activated, two additional GATA factors, ELT-2 and ELT-7 become expressed (Fukushige et al. 1998). Loss of ELT-2 alone leads to degeneration of the gut, whereas loss of ELT-7 alone does not appear to have any phenotype. However, ELT-2 and ELT-7 again appear to function redundantly in gut development as removal of both has a more profound effect on gut differentiation than removal of ELT-2 alone and abrogates transcription of many gut-specific genes. Ectopic expression of any of these endoderm specific GATA factors (END-1, END-3, ELT-2, or ELT-7) is sufficient to promote widespread gut formation throughout the embryo (Zhu et al. 1998; J. Rothman, pers. comm.). Yet another GATA factor, ELT-4, is expressed later in gut development, though its function is unknown (J. McGhee, pers. comm.). Thus a network of GATA factors that act in redundant pairs functions to regulate endoderm formation and gut development in C. elegans. In fact, of the 11 GATA factors inC. elegans, seven are involved in endoderm development.
In addition to these GATA factors, a large number of other transcription factors are specifically expressed in the endoderm. These include a Forkhead-domain transcription factor, PHA-4, which is expressed throughout the intestine (Azzaria et al. 1996; Horner et al. 1998; Kalb et al. 1998) and at higher levels throughout the pharynx, the anterior organ of the digestive tract. Although PHA-4 is essential for differentiation of the five different cell types of the pharynx (Mango et al. 1994; Horner et al. 1998), its role in gut differentiation is not known. The presence of GATA sites within its promoter suggests that at least part of pha-4 expression is regulated by member(s) of the GATA network (Kalb et al. 1998).
The segregation of mesoderm and endoderm, which happens when EMS divides, is regulated by a Wnt signal that originates from P2, a cell adjacent to EMS (Fig. 1A). Mutations that lead to the transformation of E to MS (More of MS, MOM), or the transformation of MS to E (Posterior Pharynx Deleted, POP) have been identified and found to encode components of convergent Wnt and MAP kinase signaling cascades (for review, seeThorpe et al. 2000). These signals allow endoderm development to occur. For example, mom-2 encodes the Wnt protein necessary for endoderm formation (Thorpe et al. 1997), whereas pop-1 encodes a Tcf homolog that inhibits endoderm formation (Lin et al. 1995). Additional components of this pathway include WRM-1, a β-catenin homolog necessary for endoderm formation (Rocheleau et al. 1999), and MOM-4, a MAP3K also necessary for endoderm formation (Meneghini et al. 1999; Shin et al. 1999). A model of how this Wnt–MAPK signaling pathway integrates with the MED transcription factors to positively regulate the expression of the endoderm-specifying end genes has been proposed by Maduro et al. (2001). According to this model, which has been partially tested, the MEDs activate the endgenes in E, while in MS, POP-1 repression of the end genes (Calvo et al. 2001) allows the MEDs to activate MS-specific target genes. In summary, many of the regulators of early endoderm development have been identified in C. elegans and assembled into a molecular pathway (Fig. 2A). It will be particularly interesting to further explore issues such as the sufficiency of Wnt signaling in endoderm formation, especially in light of data obtained in sea urchins and ascidians (see below).
Molecular pathways of endoderm formation inCaenorhabditis elegans, Xenopus and zebrafish. (A) In C. elegans, maternal SKN-1 directly activatesmed expression in the mesendodermal progenitor EMS (Maduro et al. 2001). Subsequently, Wnt/MAPK signaling in E permits the MEDs to activate end-1 and end-3 expression. The ENDs then activate a hypothetical Gata cascade (shown below the dashed line) that has been implicated in endoderm formation. (B) InXenopus, maternal VegT directly activates Xnrexpression (Kofron et al. 1999). Nodal signaling contributes toMixer and Gata5 expression, which in turn regulateXsox17 expression (Xanthos et al. 2001). VegT also directly initiates Bix1 and Bix4 expression (Tada et al. 1998;Casey et al. 1999), and has been implicated in initiatingXsox17 expression at the midblastula transition (Engleka et al. 2001). (C) In zebrafish, Nodal signaling regulates the expression of bon, fau/gata5, and cas, and Cas functions downstream of (left), or in parallel to (right), Bon and Fau/Gata5 to regulate sox17expression. bon encodes a Mix-type transcription factor andcas encodes a Sox-related transcription factor.
Endoderm formation in Drosophila
In Drosophila, the endodermal tissues originate from the most terminal aspects, or poles, of the blastoderm embryo (Fig. 1B). The anterior pole gives rise to the anterior midgut, the posterior pole to the posterior midgut, and the midgut proper is derived from cells that migrate from both anterior and posterior primordia. Several mutations that affect the formation of endodermal tissues have been identified and analyzed in detail, including huckebein,serpent, and fork head. huckebein encodes an Sp1/Egr-like Zn-finger protein and is expressed in the terminal regions of the embryo (Bronner et al. 1994). It interacts with snailand twist to separate the endoderm from the mesoderm prior to gastrulation. In huckebein mutants, the mesodermal progenitors expand at the expense of the endoderm and both midgut primordia are absent. At the molecular level, Huckebein has been proposed to repress the function of Snail and Twist in the endodermal progenitors (Reuter and Leptin 1994). Huckebein also positively regulates the expression of the GATA gene serpent in the midgut primordia andserpent itself is required for midgut development (Reuter 1994). In serpent mutants, the midgut primordia invaginate, but the cells do not appear to undergo the epithelial-to-mensenchymal transition that normally precedes migration (Reuter 1994). In other settings, serpent has also been implicated in cell fate decisions (Rehorn et al. 1996; Hayes et al. 2001), indicating thatGATA genes function in both morphogenesis and cell differentiation. fork head mutants also lack a gut andfork head encodes the founding member of the winged-helix transcription factors now known as Fox (Forkhead box) factors (Weigel et al. 1989; Kaestner et al. 2000). (Chordate homologs of fork head are also involved in endoderm development and are covered below.) The expression pattern of fork head in Drosophila is associated with the invagination of the gut primordia suggesting that it regulates this process. In fact, analysis of fork head function in the secretory cells of theDrosophila salivary gland shows that it is required for constriction of the apical surface membrane (Myat and Andrew 2000), a process necessary for cell invagination. Another gene expressed in theDrosophila midgut is hepatocyte nuclear factor 4 (HNF4), which encodes a member of the steroid hormone receptor superfamily and, as its name indicates, has been associated with endoderm development in vertebrates (Zhong et al. 1993). At this point however, its function or position within the developmental hierarchy leading to endoderm formation in Drosophila are not known. It is interesting to note that in C. elegans, another putative member of this steroid hormone receptor superfamily, END-2, has also been implicated in endoderm formation (Zhu et al. 1997) although its exact role remains unclear.
Another area of active investigation in Drosophila is the anterioposterior patterning of the gut endoderm by the overlying mesoderm. This fascinating question, which falls outside the scope of this review, has been recently synthesized elsewhere (Bienz 1997;Clements et al. 2001). In addition, the hindgut, although thought to arise from the ectoderm in insects, requires for its formation an evolutionarily conserved “cassette” that includes Fork head and a Wnt signal (for review, see Lengyel and Iwaki 2002).
Endoderm formation in sea urchin
The sea urchin endoderm originates from the vegetal plate of the blastula stage embryo (Fig. 1C). Endoderm morphogenesis begins by an inward bending of the vegetal plate to form a pocket of ∼100 cells. This process, called primary invagination, results in a short, squat stump of tissue (the archenteron or primitive gut), which contains most of the endodermal precursors as well as precursors of the secondary mesenchyme (SMC). (The SMC precursors become mesenchymal during gastrulation and go on to form a diverse array of cell types including muscle and pigment cells.) Subsequently, the archenteron elongates as the cells in this invaginated epithelium rearrange and more endodermal cells are recruited around the blastopore (the opening of the archenteron at the vegetal region) in a process referred to as secondary invagination. Finally, in the animal pole region of the embryo, the archenteron contacts the basal surface of the presumptive ectoderm, which leads to the formation of the mouth opening.
Several genes expressed specifically in endodermal precursors during gastrulation have been identified. One of them, endo16, which encodes an extracellular matrix glycoprotein, has been examined in detail in an attempt to elucidate the regulatory circuits controlling gene expression in the developing endoderm. Specifically, extensive analysis of the endo16 promoter has led to the identification of the binding sites critical for endodermal expression as well as the purification and molecular isolation of many of the binding factors (Yuh et al. 1998, 2001). From these and other studies a detailed picture of the regulatory network involved in endoderm formation in sea urchin is emerging.
Responding to a signal that remains to be identified, β-catenin accumulates in the nuclei of the mesendodermal progenitors between the 16- and 32-cell stage. There, it forms a complex with TCF and positively regulates the expression of several genes. These genes encode the signaling molecule Wnt8, and several transcription factors including Krox and Krl (a Krüppel-like factor). Krl behaves as a repressor and suppresses animal cell fates (Howard et al. 2001). Wnt8 has been postulated to positively regulate TCF, thereby creating a positive feedback loop (Angerer and Angerer 2000). The components of this feedback loop initially contribute to the activation of mesendodermal genes, and later on to the activation of endodermal genes such as gataE, foxA, and foxB. The functional importance of β-catenin in this early step was demonstrated by overexpressing the cytoplasmic fragment of the transmembrane protein Cadherin which leads to a sequestration of β-catenin in the cytoplasm (Logan et al. 1999). The roles of Krox and Otx, another early acting mesendodermal transcription factor, were shown through the use of Engrailed repressor (EnR) fusions (Li et al. 1999), and that of Krl through the use of morpholino antisense oligos (Howard et al. 2001).
In a second phase of regulation starting in the late cleavage/early blastula stage, GataE becomes the major endodermal regulator with inputs to many other endodermal genes. The inhibition of gataEtranslation through the use of morpholino antisense oligos blocks endoderm formation (Davidson et al. 2002). Another endodermal regulator, the Fox/Forkhead family member FoxA, is thought to act as a repressor of itself and its expression is under GataE control. These and other data as well as the emerging regulatory gene network controlling endoderm formation can be found at a frequently updated web site maintained by the Davidson lab at Caltechhttp://www.its.caltech.edu/∼mirsky/endomeso.htm.
The question of how the endoderm segregates from the mesoderm is also being studied in sea urchin and recent data have led to the following model. In the late cleavage/early blastula stage, a subset of cells in the center of the presumptive mesendoderm start to synthesize the Notch ligand Delta as a response to β-catenin accumulation in their nuclei. Because all mesendodermal progenitors express Notch on their surface, those that are adjacent to the Delta-expressing cells activate their Notch signaling pathway and consequently become mesodermal (Sherwood and McClay 1999; McClay et al. 2000). The mesendodermal progenitors that are not in contact with Delta-expressing cells become endodermal. In addition, there is evidence for a nonautonomous effect of Notch signaling on endoderm formation, possibly through the release of a Wnt signal from the mesodermal cells (Sherwood and McClay 2001). In summary, the convergence of several approaches including systematic promoter dissections as well as various gene inactivation techniques is likely to make the sea urchin an invaluable model system in deciphering the regulatory gene network involved in endoderm formation. The strengths of the sea urchin as an embryological model system should also lead to valuable information regarding the role of these genes in the morphogenetic events associated with endoderm formation.
Endoderm formation in ascidians
The endodermal tissues of the Ciona larva consist of ∼500 cells whose embryonic lineage is well documented (Conklin 1905; Nishida 1987). At the 32-cell stage, they derive in part from two pairs of vegetal blastomeres whose progeny gives rise exclusively to endodermal tissues. Consistent with this early fate restriction, endodermal progenitors show a high potential for autonomous differentiation when isolated from early embryos (Whittaker 1990; Nishida 1992). This autonomy is dependent on maternal factors that are prelocalized in the egg (Nishida 1993). Endoderm formation in Ciona, as in sea urchin, appears to involve the nuclear accumulation of β-catenin. Misexpression of β-catenin in Ciona embryos induces the expression of endodermal genes in cells fated to become notochord and epidermis. Conversely the down-regulation of nuclear β-catenin by Cadherin overexpression results in the loss of endoderm (Imai et al. 2000). Thus, whereas in sea urchin β-catenin appears to be involved in the formation of both mesoderm and endoderm, in Ciona, as in C. elegans, it appears to be more specific for endoderm formation. It is possible, however, that in both sea urchin and Ciona initial β-catenin activity is required for both mesoderm and endoderm formation and that continued β-catenin activity in certain cells leads to endoderm formation.
Subtractive hybridization screens of mRNAs from embryos overexpressing either β-catenin or Cadherin has led to the identification of several interesting endodermal genes including an otx gene, an NK-2 class gene, a LIM-homeobox gene, two forkhead genes and alefty/antivin gene among others (Satou et al. 2001). Theotx gene is expressed in the mesendodermal progenitors of the 32-cell stage embryo. Inhibition of its function by morpholino antisense oligos appears to have no effect on these cells, possibly because of redundant inputs on otx target genes. The NK-2 class gene, ttf1, is expressed in all endodermal progenitors starting at the 64-cell stage and misexpression of ttf1 leads to ectopic endodermal gene expression suggesting that it plays an important role in endoderm formation (Ristoratore et al. 1999; Satou et al. 2001). Loss-of-function analyses are now required to clarify its role in this process. Notably, the LIM-homeobox gene lhx3, which is expressed specifically in the endodermal progenitors starting at the 32-cell stage, is both necessary and sufficient for endoderm formation (Satou et al. 2001). In addition, lhx3 is sufficient to restore endodermal gene expression in the absence of β-catenin function. These data suggest a model whereby β-catenin accumulates in the nuclei of the vegetal blastomeres by the 32-cell stage and turns on genes such as otx and lhx3. These genes in turn, especially lhx3, initiate a cascade of events that leads to endoderm formation. Of course, because β-catenin is active in both mesodermal and endodermal progenitors, it will be important to identify the factor(s) responsible for restricting lhx3 expression to the endodermal progenitors.
The compactness of the ascidian genome and relative ease of generating transgenic embryos by electroporation also makes it an attractive organism in which to study cis-regulatory elements (for review, seeCorbo et al. 2001). Such work has recently started to explore endoderm formation through the analysis of the 5′-regulatory region of aforkhead/HNF3β gene (DiGregorio et al. 2001).
In summary, several elements such as β-catenin, Otx, and Forkhead are present in the regulatory gene networks driving endoderm formation in both sea urchins and ascidians and it is likely that additional ones will emerge. It is also clear from these studies that the combined approach of analyzing both gene function and the control of gene expression in these organisms will continue to generate critical information on the molecular mechanisms regulating endoderm formation.
Endoderm formation in vertebrates
Our understanding of endoderm formation in vertebrates stems in large part from studies in Xenopus, zebrafish, and mouse. These studies have converged to reveal the importance of the Nodal signaling pathway and a conserved set of transcription factors in this process. Because many of these studies were done concurrently and profoundly influenced one another, some of the following sections are structured around individual factors (or sets of related factors) rather than around individual model systems. In addition, the following sections are ordered temporally, starting with Nodal signaling and ending with members of the Gata and Forkhead families of transcription factors which, as noted above, are implicated in endoderm formation across the phyla.
Nodal signaling
Nodals are TGF-β family members, and at last count include one protein in mouse (Nodal), two in zebrafish (Cyclops [Cyc] and Squint [Sqt]), and five in Xenopus (Xnr1, Xnr2, Xnr4, Xnr5, and Xnr6) (for review, see Whitman 2001). Experiments in Xenopusprovided the initial indications that TGF-β family members are important for endoderm formation. Subsequent experiments in zebrafish, mouse, and Xenopus showed that Nodal is the TGF-β family member critical for this process.
Nodal signaling and zebrafish endoderm formation
Genetic studies in zebrafish have shown that cyc and sqtfunction at least partially redundantly in endoderm formation. Whereas single mutants appear essentially normal in their ability to form endoderm, cyc;sqt double mutants lack all endoderm (Feldman et al. 1998). In addition, overexpression of Antivin, a competitive inhibitor of the Activin/Nodal signaling pathway related to mammalian Lefty (Meno et al. 1999; Thisse and Thisse 1999; Thisse et al. 2000), leads to a complete loss of endoderm (Alexander and Stainier 1999;Thisse and Thisse 1999). Conversely, expression of a constitutively active form of a type-I TGF-β receptor is sufficient to induce endodermal gene expression throughout the zebrafish embryo (Alexander and Stainier 1999), and can cell-autonomously divert embryonic cells to an endodermal fate (Peyrieras et al. 1998). Another factor necessary for endoderm formation in zebrafish is Oep, a member of the EGF–CFC family of membrane-associated proteins. Embryos lacking both maternal and zygotic Oep fail to form endoderm and most mesoderm (Zhang et al. 1998a), a phenotype highly reminiscent of the one seen in cyc;sqt double mutants (Feldman et al. 1998). Several lines of evidence, including recent biochemical data, have led to the suggestion that EGF–CFC proteins function as necessary coreceptors for Nodal molecules (Gritsman et al. 1999; Kumar et al. 2001; Yeo and Whitman 2001), and as such would be essential for endoderm formation. However, EGF–CFC proteins such as Oep also appear to have signaling activities of their own (Salomon et al. 1999). For example, the XenopusEGF–CFC gene FRL-1 was isolated on the basis of its ability to induce FGF receptor autophosphorylation in a heterologous yeast expression system (Kinoshita et al. 1995). And the mouse EGF–CFC protein Cripto can bind to murine epithelial cells and lead to increased MAPK phosphorylation (Kannan et al. 1997), though the identity of the receptor complex through which it signals is unknown. Thus, the role of Oep in endoderm formation may extend beyond its function as a Nodal coreceptor. In addition, Nodals were found to be able to heterodimerize with Bmps in the absence of Cripto (Yeo and Whitman 2001), suggesting an EGF–CFC independent mode of action for Nodal molecules.
By the mid-blastula stage, cyc and sqt are expressed in the margin, the region where the blastoderm meets the yolk cell and where the cells fated to become endoderm reside (for review, see Warga and Stainier 2002). The model emerging from these studies is that Nodal molecules trigger a series of events that lead to endoderm formation in cells closest to the source of Nodal secretion. Thus, it has been hypothesized that high levels of Nodal signaling lead to the formation of endoderm and lower levels to the formation of mesoderm. In support of this hypothesis, various genetic and biochemical manipulations that lead to the progressive lowering of Nodal signaling preferentially affect endoderm formation (Schier et al. 1997; Thisse and Thisse 1999). Furthermore, the recent demonstration that Sqt can function as a morphogen during mesoderm formation (Chen and Schier 2001) provides a candidate molecule for this model. Cyc does not appear to share the morphogenetic properties of Sqt, so it will be interesting to further investigate the respective role of these two molecules in endoderm formation. For example, the expression of bonnie and clyde, a transcription factor gene necessary for endoderm formation (see below) is unaffected in cyc mutants but perturbed in sqtmutants (Alexander and Stainier 1999).
The model that the cells closest to the source of Nodal secretion become endodermal is not entirely correct however, as some cells in the marginal zone become mesodermal (Fig. 1E) (Warga and Nüsslein-Volhard 1999). Thus, it is likely that additional signaling pathway(s) influence the endodermal versus mesodermal cell fate decision.
Nodal signaling and mouse endoderm formation
The importance of Nodal signaling in mouse endoderm formation comes from several lines of experimental evidence. The first Nodal gene was in fact identified in mouse through the study of a retroviral insertion that when homozygosed leads to the lack of a primitive streak and most mesoderm as well as an early arrest of embryonic development (Iannaccone et al. 1992; Conlon et al. 1994). Recent analysis of a hypomorphic Nodal allele further shows that Nodal signaling is required for endoderm development, and that higher levels of Nodal signaling appear to be required for endoderm formation than for mesoderm formation (Lowe et al. 2001). In addition, foregut formation appears to require higher levels of Nodal signaling than hindgut formation, although the basis for this difference is not clear. Another line of evidence that Nodal signaling is involved in endoderm formation in mouse comes from the analyses of Cripto mutants which also lack definitive endoderm (Ding et al. 1998).
Nodal signaling and Xenopus endoderm formation
InXenopus, various members of the TGF-β family, including Activin, the Nodal family members Xnr-1 and Xnr-2, and Derrière, can induce animal cap cells to express the full range of known embryonic endodermal genes (for review, see Clements et al. 2001). Interfering with TGF-β signaling in the embryo's vegetal region from which the endoderm arises (Fig. 1D), using dominant negative forms of Nodal (Osada and Wright 1999), Derrière (Sun et al. 1999), or Activin receptors (Clements et al. 1999; Yasuo and Lemaire 1999; Chang and Hemmati-Brivanlou 2000), reduces endodermal gene expression. These data indicated that TGF-β signaling plays an important role in endoderm formation in Xenopus. A critical elaboration of this model came from experiments designed to test various TGF-β family members for their ability to restore endodermal gene expression in VegT-depleted embryos. (VegT, which will be covered at greater length in the next section, is a maternally deposited T-box transcription factor required for endoderm formation.) These experiments showed that VegT-regulated TGF-β family members such as Xnr1,Xnr2, and Xnr4 as well as derrière are able to restore endodermal gene expression in VegT-depleted embryos (Xanthos et al. 2001). Furthermore, inhibitory forms of Xnr2 and Derrière block the ability of VegT mRNA injections to rescue endodermal gene expression in VegT-depleted embryos. These experiments clearly establish a central role for TGF-β signaling in endoderm formation in Xenopus, although determining the exact role of individual TGF-β family members will require more specific loss-of-function approaches. In addition, examining the relative amount of phosphorylated Smad2 in various regions of the embryo indicated that higher levels of Nodal signaling are present vegetally (Faure et al. 2000). These data are consistent with the earlier stated proposal that higher levels of Nodal signaling lead to endoderm formation while lower levels lead to mesoderm formation. Admittedly, this hypothesis needs to be tested further not only in Xenopus but also in mouse and zebrafish.
In summary, Nodal signaling appears to be central to endoderm formation in vertebrates and it will be interesting to determine whether TGF-β signaling plays any role in endoderm formation in more primitive organisms such as sea urchins and ascidians.
Regulators of Nodal expression
A turning point in the molecular analysis of Xenopusendoderm formation was the demonstration that VegT is a vegetally localized endodermal determinant (Zhang et al. 1998b). In fact, VegT is both necessary and sufficient for endoderm formation (Lustig et al. 1996; Stennard et al. 1996; Zhang and King 1996; Horb and Thomsen 1997;Zhang et al. 1998b; Xanthos et al. 2001). VegT regulates the expression of TGF-β family members such as Xnr1, Xnr2, and Xnr4 as well as Derrière (Kofron et al. 1999;Hyde and Old 2000). It also appears to regulate directly the expression of Mix-type homeobox genes such as Bix1 and Bix4(Tada et al. 1998; Casey et al. 1999), and possibly the activation of the endodermal determinantSox17 (Clements et al. 1999; Yasuo and Lemaire 1999;Engleka et al. 2001). Thus, VegT appears to regulate endoderm formation in several ways, principally by controlling Nodal gene expression, and also by directly activating transcriptional regulators of endoderm formation (Fig. 2B). In turn, Nodal signals activate the zygotic expression of VegT, as well as maintain their own expression, thereby establishing a positive feedback loop for the initiation and maintenance of endodermal gene expression in vegetal cells (for review, see Clements et al. 2001).
Several T-box transcription factor genes have been identified and analyzed genetically in zebrafish and mouse. In mouse, the T-box geneEomesodermin is expressed from an early stage in extraembryonic ectoderm, the tissue that initially induces Nodal expression in the adjacent epiblast cells (Russ et al. 2000; Brennan et al. 2001), and Eomesodermin mutants, like Nodal mutants, fail to form mesoderm and definitive endoderm. These data suggest that Eomesodermin could be part of the regulatory network controllingNodal expression in the early mouse embryo, either directly or indirectly.
In zebrafish, Spadetail, a homolog of VegT (Griffin et al. 1998), appears not to regulate nodal or sox17 expression, while an Eomesodermin homolog is first expressed during somitogenesis in the presumptive telencephalon (Mione et al. 2001). Thus, zebrafish Nodal expression could be under the control of an as yet unidentified T-box factor, or even another type of transcription factor. Alternatively, endoderm formation could be induced by maternal Nodal molecules (sqt is expressed maternally; Feldman et al. 1998), thereby bypassing the need for spatially restricted transcriptional regulators. In the context of discussing the role of T-box transcription factors in endoderm formation, it is also interesting to mention that in ascidians, endodermal expression of a forkhead/HNF3β gene is under the control of the T-box protein VegTR (DiGregorio et al. 2001).
Effectors of Nodal signaling
Smads
Nodal binding to its receptor complex leads to the phosphorylation of Smad2 (and possibly also Smad3), which translocates to the nucleus and together with other proteins, activates transcription. Smad2 mutant mice lack all derivatives of the embryonic germ layers consistent with Smad2 playing an essential role in Nodal signaling (Nomura and Li 1998; Waldrip et al. 1998; Weinstein et al. 1998; Heyer et al. 1999). However, chimeric studies have been key to understanding the cell-autonomous role of Smad2 in embryonic development: They have shown that Smad2 mutant cells can extensively colonize ectodermal and mesodermal lineages but are excluded from the definitive endodermal lineage during gastrulation (Tremblay et al. 2000). These experiments indicate that Smad2 is a key Nodal effector that is essential for endoderm formation. Smad3 may compensate for the loss of Smad2 during mesoderm formation (Tremblay et al. 2000).
FoxH1
Smad2 partners include the Forkhead protein FoxH1/Fast. FoxH1 was initially identified in Xenopus through its ability to bind an activin response element in the promoter region of the mesendodermal gene Mix.2 (Chen et al. 1996). Mutations in the mouse FoxH1 gene block endoderm formation (Hoodless et al. 2001; Yamamoto et al. 2001), although in chimeric embryos a fewFoxH1 mutant cells contributed to the most anterior region of the foregut (Hoodless et al. 2001). Smad2 mutant cells can similarly populate the most anterior region of the foregut (Tremblay et al. 2000). These data are in apparent contradiction with the report that foregut formation requires higher levels of Nodal signaling than hindgut formation (Lowe et al. 2001). However, because of the convergence of many different cell types to the anterior foregut region, more detailed studies are needed to determine the exact role Nodal signaling plays in their development.
In Xenopus and zebrafish, FoxH1 appears to be required for formation of some of the mesoderm but not the endoderm (Watanabe and Whitman 1999; Pogoda et al. 2000; Sirotkin et al. 2000), suggesting that additional Smad2 partner(s) essential for endoderm formation remain to be identified. Alternatively, Smad2 and/or Smad3 might in some cell contexts be able to function without a partner.
Mix-type factors
Certain members of the Mix family of paired-class homeodomain transcription factors can also bind phosphorylated Smad2. This family comprises at least one member in mouse, Mml (Pearce and Evans 1999; Robb et al. 2000), three in zebrafish, Bonnie and clyde (Bon) (Alexander et al. 1999; Kikuchi et al. 2000) and Mtx1 and 2 (Hirata et al. 2000), and seven inXenopus (Rosa 1989; Mead et al. 1996; Vize 1996;Ecochard et al. 1998; Henry and Melton 1998; Lemaire et al. 1998; Tada et al. 1998). The binding of these factors to activated Smad2 occurs through a short motif that is also present in the Forkhead protein FoxH1 (Germain et al. 2000). The founding member of the Mix family,Mix.1, is an endodermally expressed gene initially identified because of its responsiveness to mesodermal inducers (Rosa 1989), and subsequently implicated in endoderm formation (Lemaire et al. 1998). One of the most comprehensive studies on a Xenopus Mix genes and endoderm formation was carried out onMixer (Henry and Melton 1998). Mixer is expressed exclusively in endodermal precursors starting at the beginning of gastrulation and can activate endodermal gene expression in animal cap cells. In addition, dominant interference studies with a Mixer–EnR chimera indicate that it plays a role in the regulation of endodermal gene expression, and that it acts upstream of the endodermal determinant Sox17. However, because Mixer expression initiates after the onset of Sox17 expression, it has been suggested that Mixer functions to maintain endodermal Sox17 expression (Fig. 2B).
Molecular and genetic analyses in zebrafish have clearly established that the Mix family member Bon functions upstream of Sox17:bon expression precedes the onset of sox17 expression (Alexander and Stainier 1999), and bon mutants lack 90% of their endodermal cells, as assessed by sox17expression (Kikuchi et al. 2000).Furthermore, Bon requires the function of the novel Sox protein Cas (see below) to inducesox17 expression and endoderm formation (Fig. 2C) (Alexander and Stainier 1999; Dickmeis et al. 2001; Kikuchi et al. 2001).
In mouse, the Mix gene, Mml, which interestingly lacks a Smad interaction motif, is expressed in the nascent primitive streak (Pearce and Evans 1999; Robb et al. 2000) where the endoderm and mesoderm originate (Fig. 1F) and thus might play a role in the formation of these germ layers. However, and at first glance,Mml does not appear to be the mouse bon ortholog based on synteny considerations and the observations that Mml lacks a Smad interaction motif and can not rescue the endodermal defect inbon mutants (L. Trinh and D.Y.R. Stainier, unpubl.). Thus, it is possible that the mouse genome contains additional Mix-typegenes.
As noted earlier, certain members of the Mix family, including Mixer (Germain et al. 2000) and Bon (Randall et al. 2002), can interact with phosphorylated Smad2 and could help transduce the Nodal signal. However, in Xenopus and zebrafish, Mixer andbon expression are initially induced by Nodal signaling suggesting that another Nodal effector is initially required to partner with phosphorylated Smad2 to activate Mixer and bonexpression. In Xenopus, FoxH1 is a likely regulator ofMixer expression as the Mixer promoter contains a FoxH1 binding site (L. Henry and M. Whitman, pers. comm.), and it will be interesting to determine the relationship between FoxH1 andbon in zebrafish.
Other signaling pathways involved in endoderm formation
Although Nodal signaling appears to play a central role in vertebrate endoderm formation, other signaling pathways have also been implicated.
Bmp signaling
Overexpression of the Bmp antagonists Chordin and Noggin has been reported to promote endodermal gene expression in Xenopusanimal cap explants (Sasai et al. 1996), suggesting that high levels of Bmp signaling may inhibit endoderm formation. Because Nodal and Bmp can form inactive heterodimers (Yeo and Whitman 2001), overexpressing Chordin, which itself binds Bmp with relatively high affinity, might possibly lead to the release of Nodal molecules and the activation of Nodal signaling, thereby inducing endoderm. Zebrafish embryos lacking Bmp2b or Bmp7 function exhibit a greater number and expanded distribution of endodermal precursors at the end of gastrulation, whereas embryos lacking Chordin function have fewer endodermal precursors (Warga and Stainier 2002). These observations are in agreement with lineage tracing data that show that more endoderm is derived from dorsal than ventral regions of the late blastula (Warga and Nüsslein-Volhard 1999). Together, these data suggest that Bmp signaling may also participate in endoderm formation and patterning, although clearly more work is needed to understand whether this potential Bmp function occurs via inhibition of Nodal signaling or through some other mechanism.
Wnt signaling
In C. elegans, as well as in sea urchins and ascidians, Wnt signaling and/or its transcriptional effectors such as β-catenin appear to play important roles in endoderm formation. In vertebrates such as Xenopus and zebrafish, β-catenin has been shown to play a key role in patterning the early embryo (Heasman et al. 2000;Kelly et al. 2000) and it will be important to determine whether it is also involved in endoderm formation.
Other transcription factors involved in vertebrate endoderm formation
Gata factors
The zinc finger transcriptional activators of the Gata family have been implicated in endoderm formation in C. elegans,Drosophila, and sea urchins, as mentioned earlier in this review. Some of their vertebrate counterparts have also been implicated in this process. Vertebrate genomes contain at least six evolutionarily conserved gata genes. Although gata1, gata2, and gata3 function predominantly in hematopoietic development (for review, see Orkin and Zon 1997), gata4, gata5, and gata6 are expressed in extraembryonic tissues, heart, and endoderm (for review, see Charron and Nemer 1999). Studies of endodermal gene regulation have suggested the involvement of Gata factors: transfection of nonendodermal cells with gata genes induces the transcription of endodermal genes such as IFABP,gastric H+/K+-ATPase and HNF4(Maeda et al. 1996; Gao et al. 1998; Morrisey et al. 1998). Direct regulation of endodermal genes by Gata factors has been further suggested by a study demonstrating occupation of a gata motif in the albumin promoter in mouse embryonic endodermal cells (Bossard and Zaret 1998). In addition, the C. elegans Gata factor END-1 was shown to be a potent activator of endoderm formation in Xenopusanimal caps (Shoichet et al. 2000).
Genetic demonstration of the role of a vertebrate gata gene in endoderm formation has come from the analysis of the zebrafishfaust (fau) mutation. fau encodes zebrafish Gata5 and is required for endoderm formation upstream of sox17(Reiter et al. 1999, 2001). In addition, fau/gata5 is sufficient to induce sox17 expression and requires the function of the novel Sox protein Cas to do so (Kikuchi et al. 2001;Reiter et al. 2001). Initial expression of fau/gata5, like that of bon, is regulated by Nodal signaling (Alexander and Stainier 1999; Rodaway et al. 1999; Reiter et al. 2001). Similarly, inXenopus, gata5 is induced by Nodal signaling and is sufficient, as is gata4, to induce endoderm (Weber et al. 2000). Genetic analyses in mouse have so far failed to reveal a clear role for gata4, gata5, or gata6 in endoderm formation, despite their widespread and early expression in this tissue (for review, see Molkentin 2000). It is expected that tissue specific gene inactivation and/or the simultaneous inactivation of several of these genes will reveal their likely role in this process. As mentioned earlier, the Drosophila gata gene serpent has been implicated in both morphogenesis and cell differentiation. Similarly, in zebrafish, fau/gata5 appears to play multiple roles in endoderm morphogenesis and differentiation (Reiter et al. 1999, 2001), and it will be interesting to investigate who the downstream effectors of Gata function are and how they regulate these highly intertwined processes.
Sox factors
Sox factors are transcriptional regulators that contain a high mobility group (HMG) DNA-binding domain, and they were first implicated in endoderm formation from work in Xenopus: Hudson et al. (1997) identified Sox17 in a search for genes enriched in endodermal progenitors. Overexpression of Sox17 promotes endodermal gene expression in animal cap explants, whereas expression of a Sox17–EnR chimera inhibits endoderm differentiation in both isolated vegetal pole explants and the intact embryo. ExpressingSox17 in cells fated to become ectoderm causes their descendants either to relocate into the embryonic gut or to die at a late developmental stage, whereas disrupting Sox17 activity in cells fated to become endoderm causes them to enter mesodermal or ectodermal lineages (Clements and Woodland 2000). These data indicate that Sox17 plays a critical role in endoderm formation in Xenopus.
Endodermally-expressed sox genes have also been isolated in zebrafish. sox17 is expressed specifically in endodermal precursors at the onset of gastrulation, although its function is not known. A novel member of the same SoxF subfamily has recently been determined to be encoded by the casanova (cas) gene (Dickmeis et al. 2001; Kikuchi et al. 2001). cas mutants completely lack sox17 expression indicating that casplays a critical role early in the formation of the endodermal lineage (Alexander et al. 1999; Alexander and Stainier 1999). Furthermore, and as noted earlier, Cas function is necessary for the ability of Bon and Fau/Gata5 to induce sox17 expression and endoderm formation (Alexander and Stainier 1999; Reiter et al. 2001). casexpression, like that of bon and fau/gata5, is regulated by Nodal signaling, and cas is sufficient to restore endoderm in bon and fau/gata5 mutants. These data have led to a model whereby, in zebrafish, Nodal signaling regulates the expression of bon, fau/gata5, and cas, and Cas functions downstream of, or in parallel to, Bon and Fau/Gata5 to regulate sox17 expression and endoderm formation (Fig. 2C) (Alexander and Stainier 1999; Kikuchi et al. 2001). Cas was also shown to be sufficient to drive cells fated to become mesoderm to populate the gut. However, this transfating ability appears to be restricted to the mesoderm and does not extend to the ectoderm (unlike what is seen with ectopic expression of Sox17 in Xenopus; Clements and Woodland 2000). Conversely, in the absence of Cas function, cells fated to the endodermal lineage become mesodermal (Dickmeis et al. 2001).
Mouse Sox17 was originally identified as a stage-specific transcriptional regulator during mouse spermatogenesis (Kanai et al. 1996). A detailed analysis of its expression pattern has more recently revealed an earlier phase of expression in the visceral and definitive endoderm of the postimplantation embryo (Kanai-Azuma et al. 2002). Although the SoxF genes Sox17 and Sox7 are both expressed in the visceral endoderm, Sox17 but notSox7 is expressed in the endoderm of the embryonic gut.Sox17 mutant embryos show profound deficits in gut endoderm formation and chimeric analysis shows that Sox17 mutant cells contribute extensively to ectodermal and mesodermal tissues, but are excluded from mid- and hind-gut endoderm (Kanai-Azuma et al. 2002). Interestingly, Sox17 mutant cells, like FoxH1 andSmad2 mutant cells, can contribute to some extent to the foregut endoderm. Thus, it appears that in all vertebrates thus far examined, a SoxF transcription factor plays a key role in endoderm formation.
Forkhead factors
As noted earlier, Forkhead genes have been implicated in endoderm formation in all model systems examined to date. At least three vertebrate Forkhead genes are expressed during endoderm formation, FoxA1/HNF3α/fkd7, FoxA2/HNF3β/axial, andFoxA3/HNF3γ/fkd2 (Sasaki and Hogan 1993; Odenthal and Nüsslein-Volhard 1998; Warga and Nüsslein-Volhard 1999). The function of these genes has been analyzed genetically in mouse.FoxA1 and FoxA3 mutants do not show an early endodermal phenotype (Kaestner et al. 1998, 1999), probably because of overlap in expression pattern and functional compensation. However,FoxA2, which comes on in endodermal precursors at the onset of gastrulation in both mouse and zebrafish, is essential for mouse foregut and midgut but not hindgut development (Ang et al. 1993;Weinstein et al. 1994; Dufort et al. 1998). Interestingly, in late gastrula stage zebrafish embryos, FoxA2 expression appears to be missing from the cells that give rise to the hindgut (Alexander and Stainier 1999) providing a possible explanation for the dispensability of its function in mouse hindgut development. This expression pattern also indicates that anterioposterior patterning of the endoderm is already manifest during gastrulation stages.
Smad2 binding elements are present in the Xenopus FoxA2 promoter (Howell and Hill 1997), suggesting that its expression is controlled by Nodal signaling. However, in zebrafish,foxA2 expression, unlike that of cas, bon, and fau/gata5, is not an immediate early response to Nodal signaling (Alexander and Stainier 1999) suggesting that other transcriptional regulators play critical roles in the timing of its activation. Alternatively, as is the case with PHA-4 and its target genes in C. elegans (Gaudet and Mango 2002), the relative affinity of Nodal effectors for their various binding sites could determine the onset of expression of the corresponding target genes. According to this model, the cas, bon, andfau/gata5 promoters would have high affinity sites for Nodal effectors whereas the later expressed foxA2 (andsox17) gene(s) would have low affinity sites.
Summary
The formation of an ancient tissue like the endoderm is expected to utilize common molecular pathways across the phyla, and the studies so far point in that very direction (Table 1). However, this comparative analysis also points to clear gaps in our knowledge, such as the potential role of TGF-β signaling in endoderm formation in more primitive organisms and that of Wnt signaling in vertebrates. In the next several years, we can expect a rapid elucidation of these questions, as well the continuing investigation of more complex issues, such as how the various endodermal organs form. Equally exciting will be attempts to coax stem cells to form endodermal tissue, a likely prerequisite to forming endodermal organs such as the pancreas and liver de novo.
Signaling pathways and transcriptional effectors involved in endoderm formation
Acknowledgments
I thank Malcolm Whitman, Joel Rothman, Rachel Warga, Jon Alexander, Janet Heasman, Jeremy Reiter, Le Trinh, Pia Aanstadt, Xin Sun, Eric Davidson, Dave McClay, Andy Ransick, Susan Mango, Lynne Angerer, and Steve Beckendorf for discussions and/or critical reading of the manuscript. Rachel Warga also provided the inspiration for the title, Felix Aburto helped with the references, and Kathy Jee with Figure 1. Our work on endoderm formation has been supported by grants from the NIH (NIDDK) as well as the Packard Foundation.
Footnotes
-
E-MAIL didier_stainier{at}biochem.ucsf.edu; FAX (415) 476-3892.
-
Article and publication are at http://www.genesdev.org/cgi/doi/10.1101/gad.974902.
- Cold Spring Harbor Laboratory Press













