Genes and Development

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


GENES & DEVELOPMENT 18:2845-2860, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 0890-9369/ $5.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hieronymus, H.
Right arrow Articles by Silver, P. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hieronymus, H.
Right arrow Articles by Silver, P. A.
Related Content
Right arrow Chromatin and Gene Expression
Right arrow Post-transcriptional Control
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

REVIEW

A systems view of mRNP biology

Haley Hieronymus and Pamela A. Silver1

Department of Systems Biology, Harvard Medical School and the Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA


    Abstract
 Top
 Abstract
 Systems approaches in mRNP...
 Transcription
 Splicing
 mRNP transport and localization
 Translation and its regulation
 mRNA degradation
 An emerging mRNP systems...
 Acknowledgments
 References
 
Gene expression occurs through a complex mRNA-protein (mRNP) system that stretches from transcription to translation. Gene expression processes are increasingly studied from global perspectives in order to understand their pathways, properties, and behaviors as a system. Here we review these beginnings of mRNP systems biology, as they have emerged from recent large-scale investigation of mRNP components, interactions, and dynamics. Such work has begun to lay the foundation for a broader, integrated view of mRNP organization in gene expression.

[Keywords: Systems biology; messenger ribonucleoprotein complexes; mRNP; genomics; mRNA decay; mRNA transport; translation; splicing]


The central processes of gene expression, from mRNA biogenesis to translation and degradation, revolve around messenger ribonucleoprotein (mRNP) complexes. These mRNP complexes consist of individual transcripts bound by a changing repertoire of proteins that mediate and couple the post-transcriptional events of gene expression. The mRNP lifecycle begins with transcription, when a host of proteins cotranscriptionally bind the nascent mRNA to form a growing mRNP (Fig. 1). Transcription itself involves mRNA-binding proteins, such as those that prevent cotranscriptional formation of RNA:DNA hybrids and promote transcriptional elongation (Huertas and Aguilera 2003Go). Other mRNA-binding proteins (RBPs) mediate cotranscriptional 5' end capping, splicing, and editing; 3' cleavage and polyadenylation; and quality control of mRNAs within nascent mRNP complexes (Gott and Emeson 2000Go; Neugebauer 2002Go; Reed 2003Go). Many of these processes can take place post-transcriptionally as well (Gott and Emeson 2000Go). A range of mRNA export factors are also cotranscriptionally recruited to transcripts and subsequently mediate mRNP translocation through the nuclear pores (Vinciguerra and Stutz 2004Go). Once in the cytoplasm, some mRNPs are localized to specific cellular regions by the signal recognition particle or by various motor proteins that bind mRNAs via mRNP adaptor proteins (Singer 2003Go; Van de Bor and Davis 2004Go). Transcripts are often translationally repressed during localization (Huang and Richter 2004Go), and many mRNA-binding proteins involved in translation and translational regulation also affect mRNA transport and stability (Kuersten and Goodwin 2003Go). Ultimately, mRNAs undergo exonuclease-mediated degradation by normal, nonsense-mediated (NMD), and nonstop decay (NSD) pathways (Parker and Song 2004Go). This straight-forward mRNP lifecycle gains complexity through the multiplicity and regulation of different mRNP pathways in gene expression processes (Vinciguerra and Stutz 2004Go). In this sense, mRNPs make up a complex, dynamic gene expression system rather than a single linear pathway. The study of this system constitutes the emerging field of mRNP systems biology.



View larger version (107K):
[in this window]
[in a new window]
 
Figure 1. The mRNP lifecycle. mRNAs traverse the gene expression process from synthesis to translation and degradation in the form of dynamic protein-bound messenger ribonucleoprotein complexes (mRNPs). Within an mRNP complex, a transcript is bound by a changing set of proteins that mediate the cotranscriptional and post-transcriptional events that make up a transcript's lifecycle. Transcripts first undergo 5' end capping, splicing in many cases, 3' cleavage and polyadenylation, mRNA quality control by the nuclear exosome, and export factor recruitment. They are then exported to the cytoplasm, where some undergo specific subcellular localization. Transcripts are eventually translated, often in a regulated manner, and degraded. Degradation may occur through normal decay or through nonsense-mediated and nonstop decay pathways upon detection of these errors.

 
Systems biology, at its most fundamental level, is the study of how biological systems function and give rise to emergent biological properties (Levesque and Benfey 2004Go). Such systems are typically defined as sets of elements that specifically interact to produce coherent emergent behaviors. Systems biology therefore has several defining steps (Fig. 2; Kitano 2002Go; Aitchison and Galitski 2003Go). The first involves discovery and integration of information about the system's component elements, processes, and dynamics under normal and perturbed states. This is often achieved using high-throughput techniques that yield large-scale or comprehensive information about the system features (Ge et al. 2003Go). The system can then be modeled to provide a framework for hypothesis testing. Finally, hypotheses from this model are iteratively generated and tested. This process allows testing and refinement of the model's underlying assumptions and subsequent biological discovery. Through this cycle, systems biology aims to determine how biological processes act as whole systems, rather than collections of a few model components.



View larger version (30K):
[in this window]
[in a new window]
 
Figure 2. A systems biology approach. Systems biology begins with a foundational understanding of a system's structural and dynamic elements. It often uses utilizes high-throughput or massively parallel approaches to identify and characterize these elements, the inputs that affect them, and the effects to which they give rise. Basic genomic and proteomic approaches have begun to quantify the mRNA and protein components of cells and subcellular mRNP systems. Large-scale identification of mRNA or DNA that associate with RNA-binding proteins, as well as proteomic approaches, have begun to define the interactions within and among mRNPs in almost all events in gene expression. Integration of data from different approaches provides a more robust understanding of the system. This understanding of system components and dynamics then allows modeling of the system, and the model can in turn be tested computationally against existing data for refinement. For example, computational modeling of nonsense-mediated and general mRNA degradation pathways has been carried out through this iterative process; it has subsequently been used to test hypotheses about recognition of nonsense-containing transcripts in nonsense-mediated decay. Thus, the model can then be used to generate and formulate hypotheses that, when tested experimentally, give rise to further understanding of the system.

 
Systems-level investigation of mRNP biology has thus far focused primarily on the first and, to a lesser extent, second aims of systems biology: system component characterization and modeling of mRNP processes. First, the physical and functional interactions among mRNP proteins, transcripts, and their parent genes are increasingly being examined on a system-wide scale at most points in gene expression. Progress has also been made in measuring the kinetics of mRNP processes such as transport and mRNA degradation. Additionally, rudimentary systems-level modeling of transcriptional regulation and mRNA degradation has been undertaken. Here we will review these beginnings of mRNP system biology and the insights they provide into the mRNP lifecycle.


    Systems approaches in mRNP biology
 Top
 Abstract
 Systems approaches in mRNP...
 Transcription
 Splicing
 mRNP transport and localization
 Translation and its regulation
 mRNA degradation
 An emerging mRNP systems...
 Acknowledgments
 References
 
Systems-level mRNP biology has begun to emerge from quantitative genomic, proteomic, and microscopy-based investigation. These approaches have yielded information about mRNA and protein state, protein-protein and protein-nucleic acid interactions, protein localization, and dynamics of mRNP systems and processes. Since large-scale proteomic studies have generally focused on whole proteome interactions rather than those of mRNP systems in particular, this review primarily focuses on genomic and kinetic investigation of mRNPs in gene expression.

Global mRNA state has been extensively quantified at the level of the transcriptome, transcription, translation, and mRNA decay by high-throughput approaches. Transcriptomes, or whole-genome steady-state mRNA populations, are now routinely characterized by microarray and other high-throughput techniques (e.g., Velculescu et al. 1997Go; Holstege et al. 1998Go; Shoemaker et al. 2001Go; Kapranov et al. 2002Go; Kampa et al. 2004Go). Less commonly, large-scale transcriptional state has been established directly by microarray analysis of nuclear run-on products and indirectly from microarray-determined mRNA levels and degradation rates (Holstege et al. 1998Go; Fan et al. 2002Go). Genome-wide translational state has increasingly been characterized by microarray analysis of transcripts associated with actively transcribing polyribosomes, as compared with total or nonpolyribosomal transcripts (e.g., Johannes et al. 1999Go; Kuhn et al. 2001Go; Kash et al. 2002Go; Arava et al. 2003Go; Preiss et al. 2003Go; Rajasekhar et al. 2003Go). Global mRNA degradation has also been profiled by monitoring changes in transcript abundance after transcriptional inhibition (e.g., Holstege et al. 1998Go; Bernstein et al. 2002Go; Fan et al. 2002Go; Wang et al. 2002Go; Grigull et al. 2004Go).

The RNA-protein interactions of mRNPs are commonly identified by high-throughput analysis of mRNAs that coimmunoprecipitate with particular mRNA-binding proteins. In this approach, an mRNA-binding protein of interest is immunoprecipitated, and the associated mRNA is isolated (Penalva et al. 2004Go). RNA-protein cross-linking is not routinely used with genome-wide RNA coimmunoprecipitation because of concerns about the effects of cross-linking reversal conditions on RNA quality; however, ultraviolet and chemical cross-linking approaches have been recently used for stringent identification of some mRNP target transcripts (Ule et al. 2003Go; Gilbert et al. 2004Go). Following coimmunoprecipitation, interacting transcripts are quantified by microarray or sequencing. For comparative microarray analysis, coimmunoprecipitated mRNAs and total mRNAs are often competitively hybridized to the same microarray. The resulting ratio of immunoprecipitated to total mRNA indicates the relative levels of protein binding across the transcriptome. These ratios are generally further normalized to the mean signal level, to the levels of internal standards (e.g., housekeeping genes), or to the levels of external standards added to the samples, yielding binding levels relative to these standards. Another complementary approach is to hybridize the immunoprecipitated sample to a mock or negative control immunoprecipitation. This method does not yield the protein's relative binding levels across the transcriptome, but instead establishes the set of transcripts that are bound by the protein above background binding levels. One caveat to these approaches is that they may not comprehensively identify all in vivo mRNA interactions of mRNP proteins, since RNA coimmunoprecipitation without cross-linking may not capture the full or specific repertoire of mRNAs bound throughout the cell and the mRNP lifecycle.

Since many nuclear mRNA-binding proteins are recruited to transcripts during transcription and can be cross-linked to DNA at the corresponding transcriptional loci, the protein-DNA interactions of mRNPs are increasingly being identified by coimmunoprecipitation of chromatin with specific mRNP proteins (ChIP) and subsequent microarray analysis. This approach is analogous to the RNA immunoprecipitation approach used to identify large-scale mRNA-protein interactions, except that the DNA and proteins are chemically cross-linked prior to immunoprecipitation. Such cross-linking captures both direct interactions and indirect mRNA-mediated interactions (Abruzzi et al. 2004Go). This method therefore provides a snapshot of a protein's chromatin interactions, or "genomic location", in living cells (e.g., Lieb et al. 2001Go; Yu et al. 2004Go). A disadvantage of this approach is that the chromatin targets of an mRNP protein may not precisely map to its mRNA targets due to post-transcriptional events.

The functional effects of mRNP proteins on their mRNA targets are typically identified by microarray-based profiling of cells lacking the wild-type proteins. While total mRNA levels are most commonly examined, global translational state in translation factor mutants has also been examined by polysomal transcript analysis (e.g., Johannes et al. 1999Go; Baron-Benhamou et al. 2003Go). The transcripts exported by specific nuclear export factors have also been identified by genome-wide microarray comparison of cytoplasmic to total transcript levels in knockdown cells (e.g., Herold et al. 2003Go; Rehwinkel et al. 2004Go); the transcripts with significantly decreased cytoplasmic levels require the particular export factor for normal export. A significant drawback of these approaches is that functional targets of mRNPs may be direct or indirect. Most notably, it is difficult to rule out secondary effects of removing or mutating proteins central to gene expression on mRNA populations.

One fundamental variable in large-scale mRNP analyses is the method used to define the nucleic acid population that is bound or affected by an RNA-binding protein from profiling data. This issue poses particular difficulties in analysis of nucleic acid-protein binding interactions. Since comparisons of bound chromatin to total chromatin are generally normalized to the mean binding level, only the binding level values relative to this mean binding level are determined. These relative data must be used to somehow define bound populations. One approach is to determine whether the binding levels show a bimodal distribution, a distribution with two modes that represent a subpopulation of enriched targets with one binding level mode and a second unbound population with another mode (e.g., Lieb et al. 2001Go). Cutoffs can then be used to isolate the enriched subpopulation to various degrees of confidence. However, many binding profiles do not have a bimodal distribution (e.g., Lee et al. 2002Go; Yu et al. 2004Go). Therefore, another common approach is to define the bound genes as those that show enrichment of the immunoprecipitated protein relative to the mean binding level at various confidence levels. High confidence levels are often used to avoid false positives at the risk of increasing false negatives. The bound population therefore depends on the method used to define it and the confidence levels chosen. To begin to address this issue, some analyses have used expression profiling data to help identify bound genes (Lee et al. 2002Go; Bar-Joseph et al. 2003Go). One recent study has used expression profiling data to define correlated expression patterns found in genes bound at high confidence and then used those patterns to identify genes with the same patterns in the binding genes of one or more factors at lower confidence levels (Bar-Joseph et al. 2003Go). This method substantially decreased the false negative rate without increasing the false positive rate. Nonetheless, false positives and negatives remain a thorny problem for systems-level investigations in mRNP biology.


    Transcription
 Top
 Abstract
 Systems approaches in mRNP...
 Transcription
 Splicing
 mRNP transport and localization
 Translation and its regulation
 mRNA degradation
 An emerging mRNP systems...
 Acknowledgments
 References
 
The first step in the mRNP lifecycle is transcription and concurrent cotranscriptional mRNA processing and packaging. Systems-level investigation of transcription has taken several main avenues. First, transcriptomes and their properties have been established by mRNA expression profiling of many systems under hundreds of conditions (e.g., DeRisi et al. 1997Go; Velculescu et al. 1997Go; Wodicka et al. 1997Go; Holstege et al. 1998Go; Shoemaker et al. 2001Go; Kapranov et al. 2002Go; Kampa et al. 2004Go). Genome-wide transcriptome characterization not only supplies fundamental data about biological state but also provides information that is useful to understanding the mechanisms of mRNP biology. For example, genome-wide mRNA transcription and degradation rates (Holstege et al. 1998Go; Wang et al. 2002Go; Grigull et al. 2004Go) have been widely used to analyze the properties of genes and transcripts bound by specific mRNP proteins in yeast (e.g., Hieronymus and Silver 2003Go; Casolari et al. 2004Go; Yu et al. 2004Go). Second, the transcriptional roles and interactions of many transcription factors have been determined by chIP profiling and mutant expression profiling (e.g., Holstege et al. 1998Go; Lee et al. 2000Go; Ren et al. 2000Go, 2002Go; Iyer et al. 2001Go; Shen et al. 2003Go; Zeitlinger et al. 2003Go; Cawley et al. 2004Go). Last, networks of transcriptional regulatory and binding interactions have been modeled to reveal the connectivity of the transcriptional machinery (Thieffry et al. 1998Go; Guelzim et al. 2002Go; Lee et al. 2002Go; Milo et al. 2002Go; Shen-Orr et al. 2002Go; Bar-Joseph et al. 2003Go; Covert et al. 2004Go; Harbison et al. 2004Go). Yeast transcriptional regulatory networks contain patterns, or motifs, shared by neuronal networks and electronic circuits, suggesting that these motifs may reflect a common information processing function (Milo et al. 2002Go). In all, these types of investigation into genome-wide transcriptional interactions and networks provide a framework for future study of post-transcriptional processes. While extensive review of such work is beyond the scope of this article, large-scale transcriptional studies are nonetheless central to the current analyses and future aims of mRNP systems biology.


    Splicing
 Top
 Abstract
 Systems approaches in mRNP...
 Transcription
 Splicing
 mRNP transport and localization
 Translation and its regulation
 mRNA degradation
 An emerging mRNP systems...
 Acknowledgments
 References
 
Splicing is a RNP-mediated processing step central to the expression of many genes. mRNA sequence information is transformed through this process to generate functional and regulatory diversity not directly encoded by the genome. Our understanding of the spliced transcriptome is far from complete, since global mRNA analysis approaches generally do not provide a comprehensive picture of splicing or mRNA splice variants. However, recent high-throughput studies have begun to expand our understanding of splicing and the spliced transcriptome. These large-scale splicing studies have three broad aims: (1) identifying the splicing patterns and spliced isoforms of all mRNAs under various conditions, including different tissue and cell types, developmental stages, and disease states; (2) understanding regulated alterations in the spliced transcriptome upon these changing conditions; and (3) understanding and characterizing the roles of different splicing factors in generating the spliced transcriptome. A long-term goal is to model splicing networks and their regulation.

The genome-wide splicing patterns of human transcripts have recently been established using several new types of microarrays (Shoemaker et al. 2001Go; Johnson et al. 2003Go). Splicing arrays that probe every consecutive exon junction have been used to predict the splicing patterns of nearly a third of the human transcriptome in 50 cell types (Johnson et al. 2003Go). Exon arrays that probe all predicted human exons (Shoemaker et al. 2001Go) and tiling arrays that probe regular, closely spaced intervals across chromosomes 21 and 22 (Shoemaker et al. 2001Go; Kapranov et al. 2002Go) have also been used to predict human alternative splicing patterns by various analytical approaches (e.g., Wang et al. 2003Go). These approaches predict that at least three-fourths of multiexon genes are alternatively spliced in humans. Revealingly, similar tissues have been found to have similar large-scale splicing patterns, while splicing in cell lines clusters separately (Johnson et al. 2003Go). The transcripts with the highest level of alternative splicing were found in cell lines and were enriched for transcripts involved in cell communication and enzyme regulation. These results indicate that alternative splicing patterns show specificity in different cell types and across different functional categories of proteins for the first time. However, exon junction, exon, and tiling arrays provide only composite pictures of each gene's alternative splicing because they cannot distinguish whether the various splicing events they predict occur in the same or different transcripts. Several other promising approaches have recently been developed to profile such alternative splicing, including splicing patterns of single mRNA molecules, but they have not yet been scaled to examine large numbers of genes (Yeakley et al. 2002Go; Zhu et al. 2003Go).

Recent work has begun to establish the effects and interactions of the conserved splicing machinery on a genome-wide scale. Splicing arrays representing each yeast intron, exon junction, and 3' exon were used to examine the splicing effects of 18 nonessential splicing factors in Saccharomyces cerevisiae (Clark et al. 2002Go). Because both intron and exon sequences were probed, different types of splicing phenotypes could be observed in S. cerevisiae mutants of these splicing factors. For example, yeast Dbr1 is required for lariat debranching but not splicing itself, and loss of Dbr1 resulted in genome-wide increases in intron levels but had no effect on exon-junction levels (Clark et al. 2002Go). Interestingly, no fixed relationship was found between splicing factors' effects on unspliced and spliced mRNA levels (Clark et al. 2002Go). Clustering of these effects uncovered both similar and differential specificities of the profiled splicing factors. The splicing patterns of different transcript groups showed different dependencies on the yeast mRNA splicing factors studied, suggesting some degree of splicing specificity by these splicing factors. The relative similarities between these splicing factors' effects were also revealed by clustering (Clark et al. 2002Go), though few of these similarities are recapitulated by other hierarchical clustering approaches (Barrass and Beggs 2003Go). This work has nonetheless uncovered a previously unknown diversity of splicing factor effects in yeast.

Splicing factor targets have also been comprehensively identified by chromatin and mRNA coimmunoprecipitation, followed by high-throughput analysis (Kotovic et al. 2003Go; Ule et al. 2003Go; Blanchette et al. 2004Go). The Drosophila mRNA splicing and export factor dU2AF50 appears to have both intron-containing and intron-less targets, as determined by microarray analysis of mRNAs affected by dU2AF50 knockdown and of coimmunoprecipitating mRNA (Blanchette et al. 2004Go). However, the most highly bound 3% of mRNAs were greatly enriched for those of intron-containing genes. Similarly, the genomic targets of the S. cerevisiae U1 snRNP protein Prp42 were overwhelmingly intron-containing genes (Kotovic et al. 2003Go). Last, the mRNA targets and mRNA-binding sites of the neuronal splicing regulator Nova-1 were identified by a new sequencing-based UV-cross-linking and immunoprecipitation (CLIP) approach (Ule et al. 2003Go). The Nova-bound mRNA included novel and known target transcripts (Buckanovich and Darnell 1997Go) that were functionally enriched for inhibitory synapse components, generating new hypotheses about the role of Nova proteins as autoantigens in neuronal ataxia (Ule et al. 2003Go). Since up to 15% of disease-causing mutations are thought to affect splicing (Krawczak et al. 1992Go), these large-scale approaches have the potential to not only identify the targets of the splicing machinery, but also to help uncover mechanisms of disease.


    mRNP transport and localization
 Top
 Abstract
 Systems approaches in mRNP...
 Transcription
 Splicing
 mRNP transport and localization
 Translation and its regulation
 mRNA degradation
 An emerging mRNP systems...
 Acknowledgments
 References
 

Nuclear mRNP export networks

After transcription and processing, mRNAs are transported from the nucleus to the cytoplasm by proteins that bind and export mRNAs as mRNP complexes (Stutz and Izaurralde 2003Go). Until recently, little was known about the mRNA composition of these export-competent mRNPs or the specificity of their bound mRNA export factors. To begin to address these issues, the genes cotranscriptionally bound and mRNAs bound and exported by a range of mRNA export factors, including Hpr1, Tho2, Sub2/UAP56, Yra1, Nab2, Npl3, Mex67/Nxf1, Nxf2, Nxf3, and p15, have been globally identified (Herold et al. 2003Go; Hieronymus and Silver 2003Go; Blanchette et al. 2004Go; Rehwinkel et al. 2004Go; Rodriguez-Navarro et al. 2004Go; Yu et al. 2004Go). These approaches have begun to illuminate the specificity of mRNA export factor interactions with DNA and RNA and the specificity of the mRNP export process in general.

mRNA export factors at all stages of export, from transcriptional coupling to translocation, show specificity for transcript and gene subpopulations (Herold et al. 2003Go; Hieronymus and Silver 2003Go; Rehwinkel et al. 2004Go; Yu et al. 2004Go). This export factor specificity often manifests as preferential association of an export factor to some transcripts or genes relative to others. Drosophila THO complex members Hpr1 and Tho2 are essential for nuclear export of a fifth and a seventh of all transcripts respectively, as shown by a drop in the transcripts' cytoplasmic levels upon Tho2 and Hpr1 knockdown (Rehwinkel et al. 2004Go). Consistent with this export specificity, yeast Tho2 is preferentially recruited to approximately a sixth of the genome (Yu et al. 2004Go). Yra1, an mRNA export factor that binds Tho2, preferentially associates with a sixth to an eighth of the genome (Yu et al. 2004Go) and an eighth of the transcriptome in yeast (Hieronymus and Silver 2003Go). Mex67/NXF1, an export factor thought to facilitate mRNP translocation through the nuclear pore, binds almost a fifth of transcripts in yeast. Drosophila NXF1 exports a fifth to a half of the transcriptome, as determined by cytoplasmic mRNA analysis after knockdown for 2 d, and a third to three-fourths of the transcriptome by analysis after knockdown for 4 d (Herold et al. 2003Go). Nearly identical populations of transcripts show mRNA export dependence on NXF1, the NXF1 coreceptor p15, and the Sub2 homolog UAP56 (Herold et al. 2003Go). Interestingly, NXF2, NXF3, and the transport factor Crm1 do not appear to be essential for mRNA export in Drosophila cells. Together these studies provide growing evidence for specificity in mRNA export and suggest that bulk mRNA export mutant phenotypes do not provide information about the number of mRNAs exported by the mutated protein (Rehwinkel et al. 2004Go).

The specificity of mRNA export factors often reflects functional specificity for transcripts that encode proteins with related functions. The export factors Tho2, Yra1, Npl3, and Mex67 all preferentially associate with mRNA and/or DNA targets enriched for targets of specific functional classes. For example, Npl3-bound genes represent many functions, including ribosomal biogenesis, reproduction and development, and stimulus perception (Yu et al. 2004Go). Yra1's DNA and RNA targets are enriched for targets encoding cell wall proteins, carbohydrate metabolism genes, and translation factors (Hieronymus and Silver 2003Go; Yu et al. 2004Go). Genes up-regulated upon Drosophila Tho2 knockdown are significantly enriched for DNA repair proteins and mRNA export factors, likely reflecting compensation for the hyper-recombination and mRNA export phenotypes of Tho2 depletion (Rehwinkel et al. 2004Go). System-wide investigation into mRNA export factor targets has therefore revealed functional organization in mRNA export.

While the basis of mRNA export factor specificity is not known, this specificity is linked to transcription in several ways. mRNA binding of the cotranscriptionally recruited export factor Yra1 correlates with transcriptional frequency on a whole-genome scale, while binding of Mex67 does not (Hieronymus and Silver 2003Go). Similarly, the DNA association of export factors Yra1, Tho2, Npl3, and Nab2 shows varied levels of correlation with transcriptional frequency (Yu et al. 2004Go). Systematic analysis of mRNA export factor binding therefore suggests that transcriptional recruitment may play a role in establishing mRNP specificity for whole transcript populations.


RNA-protein interactions in mRNA localization

After nuclear export, many mRNAs are localized to subcellular regions by various mechanisms as an efficient method of localizing protein expression (Tekotte and Davis 2002Go). Recent studies have begun to characterize the extent and mechanisms of mRNA localization using genomic approaches (Diehn et al. 2000Go; Takizawa et al. 2000Go; Tenenbaum et al. 2000Go; Lerner et al. 2003Go; Shepard et al. 2003Go; Lopez de Silanes et al. 2004Go). A starting point for many of these studies has been the systematic identification of the mRNAs bound and localized by specific localization factors, such as the yeast She proteins that asymmetrically localize transcripts to the bud-tip of daughter cells (Takizawa et al. 2000Go; Shepard et al. 2003Go). More than 20 transcripts were found to be bud-tip localized by She1, She2, and She3 using coimmunoprecipitation and microarray analysis (Takizawa et al. 2000Go; Shepard et al. 2003Go). Subsequent characterization of one such transcript, IST2, found that its She-dependent localization is maintained by a septin membrane diffusion barrier at the bud neck; this work thereby demonstrated multiple mechanisms of localization establishment and retention (Takizawa et al. 2000Go). Proteins involved in translational regulation and stability as well as mRNA localization, such as the Hu/ELAV family, have also been studied by these and other genomic approaches as discussed below (Diehn et al. 2000Go; Tenenbaum et al. 2000Go; Lerner et al. 2003Go; Lopez de Silanes et al. 2004Go; see Translation and Its Regulation). Thus, genome-wide analysis of mRNA localization factors' targets and interactions has begun to expand our understanding of mRNA localization mechanisms.


Quantitative kinetics and dynamics of mRNP transport

In addition to mapping mRNP component interactions, systems-level understanding of mRNA export and localization requires quantitative knowledge of transport kinetics. The kinetics of mRNP movement has been studied at several levels: intranuclear movement, nuclear-cytoplasmic export, and cytoplasmic movement and localization. A number of approaches have been used to observe and quantify these forms of mRNP movement in living cells (Singer 2003Go). To observe endogenous mRNPs in live cells, fluorescent proteins such as GFP have been used to tag mRNA transport proteins directly and mRNAs indirectly through insertion of RNA hairpin sequences that bind proteins such as GFP-U1A or GFPMS2 (e.g., Bertrand et al. 1998Go; Brodsky and Silver 2000Go; Rook et al. 2000Go; Calapez et al. 2002Go; Carrero et al. 2003Go; Forrest and Gavis 2003Go; Fusco et al. 2003Go).

Intranuclear mRNA kinetics have recently been quantified in real time from the point of transcription onward (Janicki et al. 2004Go; Shav-Tal et al. 2004Go). These kinetics were established in a mammalian cell system that enables in vivo visualization of transcription and transcript movement; the transcription locus was visualized via a YFP-lac repressor bound to hundreds of upstream lac operator sequences, and the transcribed mRNA was tagged with MS2 hairpins and visualized by bound YFP-MS2 (Janicki et al. 2004Go). Using this system, mRNA synthesis is detected immediately upon transcriptional induction, though the gene locus remains condensed for several minutes post-induction. mRNA levels show an initial linear increase that slows over time, possibly due to inhibition of further transcription by accumulated transcripts. By 150 min, the mRNA localizes to the transcriptional locus as well as to a particulate pattern throughout the nucleoplasm (Janicki et al. 2004Go). The mRNA particles travel through the nucleoplasm at 0.3-0.8 µm/sec in a pattern that fits a simple diffusion model (Shav-Tal et al. 2004Go). Their diffusion coefficient decreases linearly with temperature (Shav-Tal et al. 2004Go), consistent with simple diffusion and known nuclear kinetics of mRNP proteins (Calapez et al. 2002Go). Interestingly, this temperature relationship depends on the mRNA binding ability of the tested mRNP protein PABP2 (Calapez et al. 2002Go). Two types of nuclear mRNA diffusion were observed: simple diffusion where mRNP movement does not deviate from the diffusion model, and partially corralled diffusion where the mRNP movement appears directionally hindered (Shav-Tal et al. 2004Go). Since these studies are carried out with a single artificial transcript, it remains to be seen whether other mRNAs exhibit different or diverse nuclear kinetics.

In contrast to work on intranuclear transport, elucidation of nuclear export kinetics has focused on protein movement rather than mRNA movement (Ribbeck and Gorlich 2001Go; Siebrasse and Peters 2002Go; Gorlich et al. 2003Go). Investigation of protein export has led to fruitful computational modeling of nuclear protein transport (Smith et al. 2002Go; Gorlich et al. 2003Go; Kustanovich and Rabin 2004Go) and generated several theoretical models for protein translocation mechanisms (Rout et al. 2000Go; Ribbeck and Gorlich 2001Go; Kustanovich and Rabin 2004Go). However, little is known about the kinetics of endogenous mRNA translocation. mRNA export is assumed to not be rate limiting in the overall mRNA localization process, since mRNA does not accumulate at the nuclear periphery under normal conditions. Nuclear export of two transcripts, LT-{alpha} and {beta}-globin, has also been shown to occur within 2-5 min with first-order reaction kinetics (Audibert et al. 2002Go). In the future, it will be interesting to see whether mRNP translocation fits current models of receptor-bound protein translocation, since mRNA translocation has been proposed to occur through general mRNA export receptors that associate with mRNA-binding adaptors (Stutz and Izaurralde 2003Go).

The kinetics of cytoplasmic mRNA localization have been established for specific transcripts in yeast, Drosophila, and mammalian cells by in vivo visualization (Bertrand et al. 1998Go; Rook et al. 2000Go; Fusco et al. 2003Go). Early work developed systems for visualizing cytoplasmic movements of MS2 stem loop-containing mRNA constructs using GFP-MS2 (e.g., Beach et al. 1999Go; Takizawa and Vale 2000Go). A yeast mRNA construct containing the ASH1 3'-UTR and MS2 stem loops was subsequently found to move to the bud-tip at speeds consistent with known myosin V motor speeds (Bertrand et al. 1998Go). In mammalian hippocampal neurons, a dendritically localized mRNA construct with the calcium/calmodulin-dependent kinase II{alpha} 3'-UTR was found to move at roughly one-tenth this speed (Rook et al. 2000Go). While the bud-tip-localized mRNA's movement was largely directional (Bertrand et al. 1998Go), the calcium/calmodulin-dependent kinase II{alpha} construct had three distinct classes of movement: oscillatory movement and directional movement toward and away from the cell body (Rook et al. 2000Go). The mRNA speed did not change upon neuronal depolarization, but movement down the dendrites doubled and oscillatory motion decreased. Kinetic studies have therefore provided a window into dynamic mRNA movements and their biological regulation in vivo.

In addition to the kinetics of localized mRNAs, the movements of single nonlocalized mRNA molecules have also been recently characterized (Fusco et al. 2003Go). Nonlocalized mRNAs have been hypothesized to diffuse through the cytoplasm; however, a lacZ-MS2 mRNA construct containing the SV40 or human growth hormone reporter 3'-UTR sequences exhibited static behavior, diffusion, and directed movement at speeds an order of magnitude slower than seen in neurons. Interestingly, microtubules appeared to be required for the "corralling" of diffusive movement. These studies are the first to establish the kinetics of mRNP movement and provide data for future modeling of mRNPs in the nucleus, cytoplasm, and between.


    Translation and its regulation
 Top
 Abstract
 Systems approaches in mRNP...
 Transcription
 Splicing
 mRNP transport and localization
 Translation and its regulation
 mRNA degradation
 An emerging mRNP systems...
 Acknowledgments
 References
 

Global translational state and regulation

Translational state and its regulation have recently begun to be defined on a genomic scale (Diehn et al. 2000Go; Kuhn et al. 2001Go; Kash et al. 2002Go; Arava et al. 2003Go; Lerner et al. 2003Go; Toyoda et al. 2003Go; Qin and Sarnow 2004Go). Global translational state has increasingly been assayed through microarray analysis of polyribosome-associated mRNAs relative to total or nonribosome-associated mRNA levels. This approach has been used to determine the genome-wide translation levels of yeast mRNAs (Arava et al. 2003Go). For almost all translated genes, the majority of mRNA molecules were actively translated. Transcripts were bound by ribosomes at one-fifth the maximal ribosome density on average, consistent with translational initiation being rate limiting.

Global translational responses to changing metabolic and cellular state have also recently been established for the first time. Rapid transfer of yeast from fermentable glucose to nonfermentable glycerol carbon sources results in a global decrease in translation (Kuhn et al. 2001Go). However, much of this decrease is accompanied by a decrease in overall mRNA abundance, making it difficult to distinguish between transcriptional, translational, and mRNA stability alterations. Nonetheless, 2% of the transcripts showed a more than twofold decrease in translational level without a corresponding decrease in overall abundance (Kuhn et al. 2001Go). More than two-thirds of these transcripts encoded ribosomal proteins, and conversely all ribosomal protein transcripts showed translational repression upon carbon shift to some degree. Interestingly, this ribosomal regulation resembles 5'-terminal oligopyrimidine (5' TOP)-mediated translational repression of mammalian ribosomal protein transcripts upon serum starvation (Keene 2001Go).

In addition to translational state, fundamental mechanisms of translation have been investigated through genome-wide studies (Johannes et al. 1999Go; Diehn et al. 2000Go; Tenenbaum et al. 2000Go; Grolleau et al. 2002Go; Baron-Benhamou et al. 2003Go; Lerner et al. 2003Go; Preiss et al. 2003Go; Rajasekhar et al. 2003Go; Toyoda et al. 2003Go; Gera et al. 2004Go; Qin and Sarnow 2004Go). Translation initiation and localization have been major focuses of such work. Some of the earliest whole-genome translation studies examined the genome-wide requirements for initiation factors eIF4GI and eIF4GII in translation (Johannes et al. 1999Go; Baron-Benhamou et al. 2003Go). eIF4G, as part of the eIF4F complex, associates with cap binding eIF4E and increases its affinity for the 5' m7GpppN cap (von der Haar et al. 2004Go). This cap-eIF4E-eIF4G chain of interactions is essential for full cap-dependent translation and consequently has the potential to regulate translation. Transcripts that do not require full eIF4G levels for translation have been identified by microarray profiling of polysome-associated mRNAs in poliovirus-infected cells that have reduced eIF4G concentrations (Johannes et al. 1999Go). A small percentage of transcripts show heightened polysome association at these lower eIF4G levels, representing transcripts that are potentially translated through cap-independent initiation at internal ribosome entry sites (IRESs). The identified transcripts included mRNAs encoding immediate early factors and mitogen-activated regulators (Johannes et al. 1999Go), consistent with subsequent work demonstrating preferential polysome association of IRES-containing transcripts during mitosis (Qin and Sarnow 2004Go).

Besides providing a broad window into translational initiation, genomic approaches have established patterns of translational localization (Diehn et al. 2000Go; Lerner et al. 2003Go; Toyoda et al. 2003Go). Localized translation occurs predominantly at the ER, where proteins are cotranslationally threaded into and through the membrane (Nicchitta 2002Go). Transcripts of such membrane and secretory proteins have been identified by microarray analysis of membrane-associated mRNAs and polyribosomes (Diehn et al. 2000Go; Lerner et al. 2003Go; Toyoda et al. 2003Go). Significantly, transcripts of some cytoplasmic proteins have been found to associate with membrane-bound polysomes, suggesting that membrane-associated translation may not confer a secretory or membrane fate in all cases (Diehn et al. 2000Go; Lerner et al. 2003Go).


mRNP interactions in translation and translational regulation

Translational regulation is mediated by a host of mRNA-binding proteins and microRNAs (miRNAs) that make up mRNP and miRNP complexes. These regulatory RNPs often affect transcript stability and localization as well as translation. Until recently, the mRNA targets and composition of such RNPs were identified and studied in small numbers. A more comprehensive understanding of RNP interactions and targets in translation has subsequently emerged from microarray-based analysis of polysomal and coimmunoprecipitating transcripts (e.g., Johannes et al. 1999Go; Tenenbaum et al. 2000Go; Brown et al. 2001Go; Miyashiro et al. 2003Go; Gerber et al. 2004Go; Inada and Guthrie 2004Go).

Translational RNA-binding proteins (RBPs) have widely been found to bind specific transcript subpopulations, shown for the first time through large-scale identification. These RBPs include HuR (Lopez de Silanes et al. 2004Go), HuB (Tenenbaum et al. 2000Go), Scp160p (Li et al. 2003Go), the pumilio-fem-3-binding factor (Puf) family (Gerber et al. 2004Go), {alpha}CP2 (Waggoner and Liebhaber 2003Go), and Lhp1 (Inada and Guthrie 2004Go). The mRNA populations associated with these proteins vary in size from ~1% to 15% of the mRNAs assayed. The bound mRNAs comprise unique but often overlapping sets. Mammalian translational and stability regulator HuB and eIF4E bind a tenth of assayed transcripts relative to a negative control immunoprecipitation in embryonal carcinoma stem cells, whereas the poly(A)-binding protein PABP1 has a more ubiquitous binding pattern consistent with its broad role in mRNA processing and translation (Tenenbaum et al. 2000Go). Similarly, HuR binds ~15% of transcripts (Lopez de Silanes et al. 2004Go). The yeast Puf translational repressor proteins also each bind distinct transcript populations with 12% bound by more than one Puf protein (Gerber et al. 2004Go). The partial redundancy of these translational regulators suggests that they bind transcripts in a combinatorial manner (Tenenbaum et al. 2000Go; Keene 2001Go).

Large-scale characterization of translational RBP targets has also identified possible sequence determinants of translational mRNP specificity. The Hu proteins bind individual mRNAs containing 3' AU-rich elements (AREs), and correspondingly the HuB target transcripts were found to be enriched in U-rich motifs (Tenenbaum et al. 2000Go). Similar sequence motifs were identified from the HuR-bound transcript population and were then successfully used to predict additional HuR target transcripts (Lopez de Silanes et al. 2004Go). Puf3-, Puf4-, and Puf5-bound mRNA targets were also shown to contain distinct sequence motifs that selectively bind to the corresponding protein in vitro (Gerber et al. 2004Go). Nonetheless, these individual sequences are neither universally nor exclusively found in the target mRNA populations. Future work will be needed to help resolve the combinatorial contributions of mRNA sequence, structure, and secondary protein interactions in establishing mRNP specificity.

mRNP binding specificity is hypothesized to generate specificity in translational regulation. This translational specificity might mediate coregulation of mRNAs encoding functional protein classes and thereby functionally organize gene expression (Keene and Tenenbaum 2002Go). Indeed, translational regulatory proteins specifically bind transcripts encoding functional and colocalized protein classes (Brown et al. 2001Go; Miyashiro et al. 2003Go; Waggoner and Liebhaber 2003Go; Gerber et al. 2004Go). For example, the Puf proteins each bind transcript groups enriched in mRNAs encoding proteins with common localization and functions: Puf1 and Puf2 targets encode membrane-associated transport and vesicular trafficking proteins, Puf3-bound mRNAs encode mitochondrial translation factors and regulators, and Puf4 and Puf5 targets represent nuclear structure and processing factors (Gerber et al. 2004Go). In addition to this steady-state functional specificity, dynamic changes in mRNP specificity are induced by cellular changes such as differentiation (Tenenbaum et al. 2000Go). mRNP specificity and its dynamics in translation therefore act to post-transcriptionally organize gene expression.

The specific mRNA interactions of translational regulators are also central to disease. Fragile X syndrome, the most common form of inherited mental retardation, is caused by lack of the mRNA-binding fragile X mental retardation protein (FMRP). FMRP acts as a neuronal translational repressor that is essential for proper synaptic development (Antar and Bassell 2003Go). Several hundred mRNA targets bound by rat FMRP have been identified by coimmunoprecipitation from brain extracts and microarray analysis (Brown et al. 2001Go). Half of these binding targets were found to be translationally regulated by human FMRP, as demonstrated by increases or decreases in their polysomal binding in Fragile X patient cells. Two-thirds of the regulated transcripts contain a FMRP-binding G-quartet structure identified in vitro, and many encode proteins involved in synaptic development and cognition (Darnell et al. 2001Go). To further understand FMRP binding specificity in vivo, FMRP targets were identified in situ in hippocampal neurons by antibody-positioned RNA amplification (Miyashiro et al. 2003Go). In this new technique, FMRP-associated transcripts are reverse transcribed in situ using anti-FMRP antibody-targeted probes, and the resulting antibody-cDNA complexes are immunoprecipitated and identified by microarray. The identified FMRP targets require FMRP for normal transcript distribution in the brain and for normal protein expression levels, demonstrating the functional effects of FMRP on its target transcripts. These and other results have led to the model that FMRP suppresses translation of specific mRNAs during and after mRNA localization to the synapse in the absence of proper stimulus. Drosophila FMRP has been shown to translationally repress individual homologs of murine FMRP1 target mRNAs. Deletion of one such homolog, futsch, suppressed the neuronal defects of Drosophila lacking the FMR1 homolog, while overexpression led to neuromuscular junction phenotypes (Zhang et al. 2001Go). Together these studies provide an unbiased view of FMRP's molecular function at mRNA and generate further hypotheses as to how FMRP abrogation gives rise to Fragile X phenotypes.

In contrast to the growing body of identified translational mRNP targets, miRNA targets and miRNP composition have proven difficult to identify. While computational and genomic approaches have been applied to miRNA prediction (Bartel 2004Go) and quantification (Liu et al. 2004Go; Schmittgen et al. 2004Go), the identification of miRNP targets has largely remained in the realm of low-throughput investigation. Computational approaches to miRNA target prediction have begun to address this problem and have generated hypotheses about miRNA functions based on the target mRNA functions (Bartel 2004Go). Extension of RNP analysis approaches, such as large-scale polysomal and coimmunoprecipitating mRNA analysis discussed here, to the problem of miRNP target analysis has great potential to expand global understanding of miRNP biology in the future.


    mRNA degradation
 Top
 Abstract
 Systems approaches in mRNP...
 Transcription
 Splicing
 mRNP transport and localization
 Translation and its regulation
 mRNA degradation
 An emerging mRNP systems...
 Acknowledgments
 References
 
mRNA degradation plays an essential role in the mRNP lifecycle as a determinant of steady-state mRNA levels and a source of mRNA regulation and quality control. General mRNA decay can occur by two pathways: 5' -> 3' exonucleolytic degradation by the exonuclease Xrn1 and 3' -> 5' degradation by the exosome complex (Maquat 2004Go; Parker and Song 2004Go). Degradation of aberrant transcripts occurs by nuclear degradation, NMD, and NSD. These pathways and the proteins that mediate them have been extensively delineated through studies of individual transcripts. Global studies have now begun to examine and model these pathways.

Genome-wide mRNA degradation patterns have been established in a number of systems, including Escherichia coli (Bernstein et al. 2002Go; Selinger et al. 2003Go), S. cerevisiae (Holstege et al. 1998Go; Wang et al. 2002Go; Grigull et al. 2004Go), Arabidopsis (Gutierrez et al. 2002Go), and human cells (Fan et al. 2002Go; Raghavan et al. 2002Go; Yang et al. 2003Go). These studies most commonly define the mRNA decay rates of thousands of genes by profiling changes in mRNA levels upon transcriptional inhibition (e.g., Grigull et al. 2004Go). To avoid potential effects of transcriptional inhibition on mRNA stability, microarray profiling of nuclear run-on products has also recently been used (Fan et al. 2002Go). mRNA stability has not been found to correlate with ORF or operon length, mRNA abundance, or ribosome density (Bernstein et al. 2002Go; Wang et al. 2002Go). Intriguingly, mRNA stability is instead often correlated among transcripts encoding constituents of a common macromolecular complex or functional class. Decay rates are highly similar among mRNAs encoding components within stoichiometric complexes such as the nucleosome, proteasome, and ribosome (Wang et al. 2002Go). mRNAs encoding central metabolism proteins have lower degradation rates than average (Bernstein et al. 2002Go; Wang et al. 2002Go; Yang et al. 2003Go; McCarroll et al. 2004Go). Transcripts encoding transcriptional machinery and factors (Wang et al. 2002Go; Yang et al. 2003Go; McCarroll et al. 2004Go) as well as translation and ribosome biogenesis machinery (Wang et al. 2002Go; Grigull et al. 2004Go) are rapidly turned over. The high turnover of transcriptional and translational machinery suggests that mRNA degradation may facilitate rapid regulation of these central gene expression processes.

Large-scale studies of mRNA decay have lent support to hypotheses that differential mRNA degradation plays a general role in regulation of gene expression. Individual gene expression has been shown to be affected by mRNA degradation induced by a variety of cellular signals and conditions (Thomson et al. 1999Go). On a large scale, cellular signaling and stresses such as heat shock and UV irradiation have been shown to induce alterations in genome-wide mRNA degradation patterns (Fan et al. 2002Go; Raghavan et al. 2002Go). Approximately one-tenth of transcripts are found to be destabilized and 3% are stabilized under stress conditions, for example (Fan et al. 2002Go). Thus, while transcription is widely accepted as a major global regulator of gene expression, mRNA degradation also seems to have a significant regulatory role.

The molecular foundation of global mRNA degradation specificity is not well understood. While human mRNA decay rates correlate with the presence of AREs (Raghavan et al. 2002Go; Yang et al. 2003Go), AREs are not found comprehensively nor exclusively in rapidly decaying mRNAs (Bernstein et al. 2002Go; Yang et al. 2003Go). New sequence motifs have been found in rapidly decaying transcript populations but cannot account for the stability differences of all transcripts (Yang et al. 2003Go). It is currently unknown how sequence elements establish the global mRNA stability patterns or how the interplay between mRNA stability, translation, and localization affects these patterns. In the future, systems-level investigation of mRNA degradation pathways may identify target populations that define sequence and other determinants of mRNA decay.

Mechanistic insights into mRNA degradation are beginning to emerge from global expression and stability analysis of degradation machinery mutants (Lelivelt and Culbertson 1999Go; He et al. 2003Go; Grigull et al. 2004Go). Specific stability factors, such as the yeast Hu homolog Pub1, deadenylation factors Ccr4 and Pan2, and translation/stability factor Puf4, regulate the stability of transcripts enriched in translational machinery mRNAs (Grigull et al. 2004Go). The general yeast decapping enzyme Dcp1 and the 5' -> 3' exonuclease Xrn1 surprisingly affect the levels of only 15% and 20% of transcripts upon deletion, respectively (Lelivelt and Culbertson 1999Go; He et al. 2003Go). This relatively small target transcript population suggests that 3' -> 5' degradation, rather than 5' -> 3' Xrn1-mediated degradation, may be the central mRNA decay pathway in yeast. Alternatively, expression profiling of knockout mutants might not reveal the full set of 5' -> 3' degradation targets if transcription or other degradation pathways compensate for loss of 5' -> 3' degradation in the mutants.

The global effects of NMD on the yeast transcriptome have also been established using mutant profiling. Deletion of NMD factors Upf1, Nmd2, and Upf3 each alter the levels of a tenth of the yeast transcriptome, with almost complete overlap between these factors (Lelivelt and Culbertson 1999Go; He et al. 2003Go). Their targets are enriched for transcripts of cellular defense proteins and underenriched in transcripts encoding protein synthesis machinery (Lelivelt and Culbertson 1999Go; He et al. 2003Go), in contrast to Ccr4 and Pan2 deadenylase targets (Grigull et al. 2004Go). A third of the yeast Upf targets represent structural classes of transcripts, such as transposable element, pseudogene, and bicistronic transcripts. These Upf target classes include intron-containing genes (He et al. 2003Go), despite the observation that the yeast Upf3 deletion mutant shows little accumulation of unspliced transcripts by splicing array (Clark et al. 2002Go). Since codeletion of Xrn1 with any of the Upf proteins has a similar effect on mRNA levels as deletion of Xrn1 alone, the general Xrn1 pathway may mediate 5' -> 3' degradation of the majority of NMD targets (He et al. 2003Go). In all, steady-state global studies have begun to define the scope and mechanisms of NMD. Similar studies will undoubtedly establish the contributions of different mRNA degradation pathways to total cellular mRNA decay in the future.

An ultimate goal of investigations into global mRNA degradation is the reconstruction of decay regulatory networks. In the past several years, both general and nonsense-mediated degradation have been modeled computationally (Cao and Parker 2001Go, 2003Go). General mRNA degradation has been modeled to include transcription, deadenylation, deadenylation-independent and -dependent decapping, and 3' -> 5' and 5' -> 3' cytoplasmic degradation (Cao and Parker 2001Go). This model assumes zero-order transcription, first-order decay steps, and lack of coupling between degradation steps. Nonetheless, this framework fairly accurately models the degradation of PGK1 and MFA2 transcripts using a combination of estimated and experimentally determined rate constants. The computational model was subsequently extended to NMD by introducing degradation initiated by aberrant mRNA recognition and followed by rapid decapping, accelerated deadenylation, and degradation upon varying le