Genes and Development

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


GENES & DEVELOPMENT 18:1513-1526, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 0890-9369/ $5.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kielkopf, C. L.
Right arrow Articles by Green, M. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kielkopf, C. L.
Right arrow Articles by Green, M. R.
Related Content
Right arrow Post-transcriptional Control
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

REVIEW

U2AF homology motifs: protein recognition in the RRM world

Clara L. Kielkopf1,3, Stephan Lücke2 and Michael R. Green2,4

1 Department of Biochemistry and Molecular Biology, Johns Hopkins University Bloomberg School of Public Health, Baltimore, Maryland 21205, USA; 2 Howard Hughes Medical Institute, Programs in Gene Function and Expression and Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA


    Abstract
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
Recent structures of the heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) have revealed two unexpected examples of RNA recognition motif (RRM)-like domains with specialized features for protein recognition. These unusual RRMs, called U2AF homology motifs (UHMs), represent a novel class of protein recognition motifs. Defining a set of rules to distinguish traditional RRMs from UHMs is key to identifying novel UHM family members. Here we review the critical sequence features necessary to mediate protein–UHM interactions, and perform comprehensive database searches to identify new members of the UHM family. The resulting implications for the functional and evolutionary relationships among candidate UHM family members are discussed.

[Keywords: U2AF; RNA recognition motif; protein–protein interaction; RNA-binding domain; PUMP; splicing factor]


The processes of RNA splicing, transport, capping, editing, and polyadenylation are heavily dependent on protein factors that recognize the pre-mRNA and assemble the appropriate pre-mRNA processing complexes. Surprisingly, the many different protein factors that guide pre-mRNA modification pathways are composed of a limited number of conserved, modular RNA-binding domains (Burd and Dreyfuss 1994Go). Of these, the RNA recognition motif (RRM) domain is by far the most abundant type of eukaryotic RNA-binding motif. In addition to associations between protein and RNA, protein–protein interactions are essential to recruit catalytic components to sites of RNA modification and to coordinate pre-mRNA processing with other cellular pathways. Interestingly, traditional protein interaction domains, such as SH2, SH3, and WW motifs, are rarely observed in pre-mRNA processing factors (e.g., see Shatkin and Manley 2000Go; Zhou et al. 2002Go), implying that the ability to interact with other proteins may reside in the sequences previously thought to be involved in RNA binding. Consistent with this idea, recent structures of the heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) have revealed two unexpected examples of RRM-like domains with specialized features for protein recognition (Kielkopf et al. 2001Go; Selenko et al. 2003Go). In light of this structural information, we call these unusual RRMs U2AF homology motifs (UHMs) to reflect their distinct role in protein recognition. Here, the critical sequence features necessary to mediate protein–UHM interactions are reviewed and formulated in a manner that has permitted a comprehensive database search designed to identify members of the UHM family. The resulting implications for the functional and evolutionary relationships among candidate members of the UHM family are discussed. This review represents a first step toward distinguishing canonical RRMs from UHMs, and thereby contributes toward a major goal of the postgenomic era (Thornton et al. 2000Go): to convert genomic sequences into testable functional hypotheses.


    Structural features of RNA recognition by canonical RRMs
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
The RNA-binding function of the canonical RRM domain has been extensively investigated over the last two decades. The most conserved RRM signature sequence is an eight-residue motif called ribonucleoprotein 1 (RNP1; Adam et al. 1986Go; Sachs et al. 1986Go), which has the consensus [RK]-G-[FY]-[GA]-[FY]-[ILV]-X-[FY] (where X is any amino acid). A second six-residue region of homology, called RNP2, is typically located ~30 residues N-terminal to RNP1 (Lahiri and Thomas 1986Go; Dreyfuss et al. 1988Go), and has the consensus [ILV]-[FY]-[ILV]-X-N-L. Additional conserved amino acids define an ~80-residue domain that encompasses the RNA-binding function (Query et al. 1989Go; Scherly et al. 1989Go; Birney et al. 1993Go).

The three-dimensional structure of the canonical RRM domain was first determined for the RRM of U1A (Nagai et al. 1990Go; Hoffman et al. 1991Go). The RRM fold is composed of two {alpha}-helices packed against four antiparallel {beta}-strands with topology {beta}{alpha}{beta}{beta}{alpha}{beta}, which form an {alpha}/{beta} sandwich (Fig. 1). The RNP consensus motifs form two central {beta}-strands, with RNP1 in {beta}3 and RNP2 in {beta}1. Because of the alternating side-chain conformations of the pleated {beta}-sheet, some of the consensus residues maintain the core fold, whereas others are displayed on the surface for nucleic acid recognition. Structures of single RRMs complexed with RNA have been determined for the U1A-RRM bound to a hairpin loop of U1 snRNA (Oubridge et al. 1994Go; Price et al. 1998Go; Deo et al. 1999Go; Handa et al. 1999Go; Allain et al. 2000Go; Wang and Tanaka Hall 2001Go), and for a ternary complex of the U2 snRNP proteins U2B''-RRM/U2A' with a U2 snRNA hairpin loop (Price et al. 1998Go). In contrast to the isolated RRM of U1A, in most cases multiple RRMs are observed within a single polypeptide, with an average of two RRMs per protein (Letunic et al. 2004Go). The structures of several proteins composed of two tandem RRMs complexed with single-stranded RNA oligonucleotides have been determined, including the alternative splicing factor Sxl (Handa et al. 1999Go), PAB (Deo et al. 1999Go), pre-rRNA packaging protein nucleolin (Allain et al. 2000Go), and translation regulatory protein HuD (Wang and Tanaka Hall 2001Go). A comparison of these six different structures has revealed some common themes, as well as differences, in the mode of canonical RRM/RNA recognition. When all the structures are superimposed, structural equivalent hydrogen-bonds or stacking interactions are observed between single-stranded RNA and residues in the RNP1 and RNP2 motifs. A variety of sequences and RNA conformations are recognized by a variety of complementary hydrogen bonds with specific bases and differing arrangements of single or multiple RRMs.



View larger version (63K):
[in this window]
[in a new window]
 
Figure 1. Representative canonical RRM fold, from the structure of the U1A/RNA complex (PDB code 1URN [PDB] ). The N- and C-terminal ends of the RRM are indicated. The position and orientation of the RNA are represented with a ribbon diagram.

 

    U2AF, the UHM prototype
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
During pre-mRNA splicing, U2AF and other essential factors facilitate sequential association of small nuclear RNP particles (snRNPs), including U1, U2, U4, U5, and U6 snRNPs, with the borders of intervening pre-mRNA sequences (for review, see Brow 2002Go). Following assembly of the functional spliceosome, the intron is excised as a branched lariat by two catalytic steps, and adjacent exons are joined together to form the spliced mRNA. U2AF was identified as a factor that binds to pre-mRNA consensus sequences at the 3' splice site (3'SS), and is required for stable association of the U2 snRNP core spliceosome particle with the pre-mRNA branch point sequence (BPS) during the first ATP-dependent step of the splicing process (Complex A; Ruskin et al. 1988Go; Zamore and Green 1989Go). The importance of U2AF in vitro was soon corroborated by the discovery that both subunits are essential in Drosophila melanogaster (Kanaar et al. 1993Go; Rudner et al. 1996Go, 1998bGo) and Caenorhabditis elegans (Zorio and Blumenthal 1999bGo). Moreover, U2AF65 is an essential protein in Schizosaccharomyces pombe (Potashkin et al. 1993Go), and U2AF35 is necessary for vertebrate development (Golling et al. 2002Go). Because U2AF commits the pre-mRNA to the first critical ATP-dependent step of splicing, its binding is often regulated during alternative splicing (Smith and Valcarcel 2000Go). In humans, the products of five U2AF35-like open reading frames and the single U2AF65 subunit may form distinct heterodimers with different functional activities (Tupler et al. 2001Go; Shepard et al. 2002Go). In addition to U2AF, other non-snRNP protein factors are required for formation of Complex A, including Splicing Factor 1 (SF1) and Splicing Factor 3b (SF3b), a multisubunit component of the U2 snRNP (Kramer and Utans 1991Go).

To perform its role in RNA splicing, two central canonical RRM domains of U2AF65 recognize the polypyrimidine tract (Py-tract) in the pre-mRNA (Fig. 2). Binding of U2AF65 to the Py-tract is strengthened by cooperative protein–protein interactions with SF1 at the upstream BPS (Berglund et al. 1998Go; Rain et al. 1998Go) and with U2AF35, which contacts the downstream 3'SS consensus (Merendino et al. 1999Go; Wu et al. 1999Go; Zorio and Blumenthal 1999aGo). The C-terminal UHM domain of U2AF65 interacts with the N-terminal domain of SF1 (U2AF65-UHM/SF1-ligand; Rain et al. 1998Go). At the opposite end of the large subunit, the N-terminal domain of U2AF65 provides a ligand that interacts with the central UHM domain of U2AF35 (U2AF35-UHM/U2AF65-ligand; Zhang et al. 1992Go; Rudner et al. 1998bGo). Subsequently, entry of the U2 snRNP displaces SF1 by interacting with the BPS via the U2 snRNA (Nelson and Green 1989Go; Wu and Manley 1989Go; Zhuang and Weiner 1989Go; Query et al. 1994Go), and with the U2AF65 C-terminal domain via the SF3b subunit, SAP155 (Gozani et al. 1998Go; Habara et al. 1998Go). Once the U2 snRNP has contacted the pre-mRNA, U2AF is dissociated by conformational rear-rangements of the spliceosome components (Bennett et al. 1992Go; Chiara et al. 1997Go). In summary, key protein–protein interactions are mediated by the U2AF65-UHM, which interacts with SF1 and subsequently SAP155, and by the U2AF35-UHM, which interacts with the U2AF65 N terminus.



View larger version (52K):
[in this window]
[in a new window]
 
Figure 2. Diagram of protein–protein interactions mediated by the U2AF heterodimer during the initial stages of pre-mRNA splicing. The U2AF heterodimer (mediated by the U2AF35-UHM/U2AF65-ligand interaction) binding to the poly-pyrimidine tract (Py-tract) and 3'-splice site (3'SS) is facilitated by cooperative interactions between the U2AF65-UHM and SF1 at the branchpoint sequence (BPS). Subsequently, SF1 interactions with the U2AF65-UHM are replaced by SAP155. The N- and C-terminal ends of the proteins are indicated; the conserved ligand Trp residue (W) is also shown (discussed further in the text).

 

    Structural features of protein–protein interactions by UHMs
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
Based on primary sequence analysis, both the U2AF65 C-terminal domain and the central domain of U2AF35 were suspected to contain unusual variations of the RRM fold (Birney et al. 1993Go). However, the borders of the U2AF-UHM domains could not be assigned accurately because of sequence insertions in the first helix of the fold (Helix A) and the absence of aromatic amino acids in the RNP-like motifs that are normally critical for RNA recognition. The independent determination of the X-ray structure of the U2AF35-UHM/U2AF65-ligand complex (Kielkopf et al. 2001Go) and NMR structure of the U2AF65-UHM/SF1-ligand complex (Selenko et al. 2003Go) confirmed that both the C-terminal U2AF65 and central U2AF35 protein interaction domains adopt the {beta}{alpha}{beta}{beta}{alpha}{beta} RRM-fold topology. Within the RRM-like fold, the sequence insertions separating the RNP-like motifs increase the length of Helix A from three turns observed among canonical RRMs to five or eight turns for U2AF65 and U2AF35, respectively; the functional role of these sequence insertions, if any, is unclear. The parallel use of an RRM-like fold to recognize similar peptide ligands implies that the U2AF35-UHM and U2AF65-UHM domains represent a new type of protein–protein interaction motif, hitherto undetected amid the many canonical RRMs of pre-mRNA processing factors.

Three-dimensional structural information revealed unanticipated sequence features of U2AF35-UHM and U2AF65-UHM domains that enable interaction with short protein ligands. Despite low primary sequence identity (23%), ligand recognition by the different UHM domains is very similar (Fig. 3). In both the U2AF35-UHM/U2AF65-ligand and U2AF65-UHM/SF1-ligand structures, a critical Trp residue in the ligand sequence inserts into a tight hydrophobic pocket between the {alpha}-helices and the RNP1- and RNP2-like motifs (Kielkopf et al. 2001Go; Selenko et al. 2003Go). In addition to aliphatic residues, a conserved Arg–X–Phe motif (where X is any amino acid; see below) on the loop connecting the last {alpha}-helix (Helix B) and {beta}-strand of the UHM fold contributes to the Trp-binding pocket. The Arg residue in the loop (U2AF35-Arg 133 or U2AF65-Arg 452) forms an intramolecular salt bridge with the last Glu residue of Helix A (U2AF35-Glu 88 or U2AF65-Glu 405) that shields one face of the ligand-Trp, whereas the Phe residue (U2AF35-Phe 135 or U2AF65-Phe 454) encloses the opposite Trp face. In addition to the extensive interface with the ligand-Trp, a series of acidic residues in Helix A of the UHM interacts with basic residues at the N terminus of the protein ligand. Specifically, electrostatic interactions between U2AF35-Glu 84 and U2AF65-Lys 90 as well as U2AF65-Asp 401 and SF1-Arg 21 are observed at similar positions for both structures. The essential nature of acidic residues within Helix A, Phe 454, and the Trp-binding pocket was confirmed for the U2AF65-UHM/SF1-ligand complex by site-directed mutagenesis of the U2AF65-UHM or SF1-ligand followed by pulldown assays (Selenko et al. 2003Go). Likewise, the U2AF65-ligand-Trp 92 was found to contribute two orders of magnitude to the affinity of the U2AF35-UHM/U2AF65-ligand complex by isothermal titration calorimetry.



View larger version (46K):
[in this window]
[in a new window]
 
Figure 3. Structures of U2AF-UHM/ligand complexes. The UHM is shown in blue, the protein ligand is shown in yellow. Key interacting side chains are drawn in ball-and-stick representation. (A) The U2AF35-UHM bound to the U2AF65-ligand. (B) The U2AF65-UHM bound to the SF1-ligand. (C) Schematic representation of signature UHM protein–protein interactions shared by the two complexes.

 
In the U2AF35-UHM, a distinctive Trp residue (U2AF35-Trp 134) is observed at the X position of the Arg–X–Phe motif on the last loop of the UHM domain. Bulky aromatic residues such as Trp at this solvent-exposed position are especially rare among canonical RRM domains (1% of 676 annotated RRM domains in the SWISS-PROT database). The most frequently observed residues at the corresponding position of canonical RRMs are highly charged, including Glu (16%) and Lys (15%), as is also observed for the U2AF65-UHM (Arg–Lys–Phe). The unusual U2AF35-Trp 134 inserts between a series of unique Pro residues at the C terminus of the U2AF65-ligand, which are completely absent from the SF1-ligand of the U2AF65-UHM. The additional Trp/Pro interaction significantly contributes to the high affinity of the U2AF heterodimer (1.7 nM Kd; Kielkopf et al. 2001Go). Because the U2AF65-UHM/SF1-ligand complex lacks the corresponding Trp/Pro interaction, the affinity is relatively weak (~100 nM Kd; Selenko et al. 2003Go). The sequence differences in the ligands recognized by the U2AF35-UHM and U2AF65-UHM domains reflect the different functional roles of the complexes, which, respectively, maintain the constitutive U2AF35-UHM/U2AF65 heterodimer (Zhang et al. 1992Go; Rudner et al. 1998bGo) or form a transient U2AF65/SF1 intermediate during spliceosome assembly (Rutz and Seraphin 1999Go).

The structures of the U2AF35-UHM/U2AF65-ligand and U2AF65-UHM/SF1-ligand complexes revealed several sequence features that distinguish UHMs from canonical RRM domains. One striking feature of UHM domains is their atypical RNP-like motifs. The first residue of the RNP1-like motif and the second residue of the RNP2-like motif are unusual in that they are exposed on the {beta}-sheet surface rather than directly involved in RNA binding. Residues in these positions consist of aliphatic amino acids (U2AF35-Ala 47, Val 110, or U2AF65-Cys 379, Cys 429) as opposed to the basic and aromatic residues used for RNA recognition by canonical RRM domains. Other prominent distinguishing sequences include the Arg–X–Phe motif and acidic residues in Helix A (especially U2AF35-Glu 84/Glu 88 and U2AF65-Asp 401/Glu 405). As a consequence of the acidic nature of Helix A and lack of a basic RNP1 residue that usually contacts the RNA, the isoelectric points of UHM domains are remarkably low (pI 4.1 for the U2AF35-UHM and pI 4.3 for the U2AF65-UHM) compared with the typically basic character of canonical RRMs (pI >9) that function to bind anionic RNA ligands. The majority of the aliphatic residues lining the Trp-binding pocket (including U2AF35-UHM Leu 48, Val 85, Leu 130, and Ile 140 and their U2AF65-UHM counterparts Leu 380, Val 402, Leu 449, and Val 459), however, cannot be used to distinguish UHM from canonical RRM domains, because they also serve to preserve the RRM fold (Birney et al. 1993Go). One exception is the last aliphatic residue of the RNP2 motif (U2AF35-Ile 51 or U2AF65-Met 383), which contributes to the Trp-binding pocket and consequently differs from the conserved RNP2-Leu residue within the hydrophobic core of canonical RRM domains. Thus, at least three major sequence differences required for UHM–protein interactions distinguish UHMs from canonical RRM domains: (1) atypical RNP-like motifs, (2) an Arg–X–Phe motif in the last loop, and (3) an acidic character of Helix A.


    Identifying novel UHM family members
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
The discovery of two examples of RRM-like domains with specialized sequence characteristics for protein–protein interactions raised the question of whether the U2AF35-UHM and U2AF65-UHM domains represented a larger family of modular protein interaction domains. Examples of proteins with domains similar to the U2AF65 or U2AF35-UHM had been previously noted (Kielkopf et al. 2001Go; Selenko et al. 2003Go), including the C-terminal homodimerization domain of PUF60, which has previously been referred to as the PUF60, U2AF65, MUD2 protein–protein interaction (PUMP) domain (Page-McCaw et al. 1999Go). Several search strategies were used to further extend the UHM family. An initial consensus pattern for the U2AF65, U2AF35, PUF60, and Tat-SF1 UHM domains, defined automatically using the program PRATT (Brazma et al. 1996Go), proved too stringent as it only matched homologs of these proteins in a Scan-ProSite search of the SWISS-PROT/TrEMBL databases (Gattiker et al. 2002Go). Therefore, a target pattern ([ILM-VFC]-X-[LIFV]-X-[NSHT]-[ILMVC]-X(6,40)-[VLIT]-X(2)-[ED]-X(4,5)-G-X-[IVA]-X(4)-[VIL]-X(4,25)-[GV]-X-[VIAL]-[FY]-[VIL]-X-[FYC]-X(6,12)-[AC]-[LVMIC]-X-X-[LMIF]-X-[NG]-R-[WYKM]-[FY]-X-G-X(4,8)-[IVL]) was defined manually based upon conserved residues that either maintain the RRM-like fold (Birney et al. 1993Go) or mediate protein–protein interactions in the structures of U2AF35-UHM/U2AF65-ligand and U2AF65-UHM/SF1-ligand (Kielkopf et al. 2001Go; Selenko et al. 2003Go). A search of the SWISS-PROT/TrEMBL databases with this target pattern identified several novel UHM candidates. The UHM family was further extended by manually inspecting RRM family alignments (Prosite PS50102) and the results of iterative PHI-PSI BLAST searches (Altschul et al. 1997Go) for similarities to the signature Arg–X–Phe motif observed in the last loop of the prototype U2AF-UHM domains.

Sequence comparisons revealed that the principal features that distinguish UHM candidates from canonical RRMs are conserved among 12 novel UHM candidates (Table 1; Fig. 4A), including (1) poor conservation of amino acids in the RNP1- and RNP2-like consensus motifs that would normally bind RNA (first/third and second positions, respectively); (2) an Arg–X–Phe motif in the last loop of the RRM-like fold; and (3) conserved acidic residues in the predicted Helix A and a low isoelectric point (average pI ~4.5). Seven additional UHM candidates displayed a subset of the UHM characteristics. To further investigate the evolutionary relationship among members of the UHM and RRM families, a phylogenetic tree of the candidates was constructed using neighbor joining with correction for multiple substitutions (Fig. 4B; Thompson et al. 1997Go). A comparison with canonical RRMs whose role in RNA recognition has been established by structure determination (including U1A, SXL, PAB, HuD, and nucleolin) revealed that the 12 convincing UHM candidates occupy a phylogenetic branch distinct from canonical RRMs that diverged from a common ancestral domain. The dendrogram also confirms that several of the putative UHM candidates (i.e., those that displayed only a subset of the UHM characteristics) are more closely related to canonical RRM domains than to U2AF or other UHM candidates, indicating that these proteins may have independently evolved UHM-like sequence motifs. Additional proteins from diverse eukaryotes displayed UHM signature sequences, but were considered homologs of other UHM candidates based on high sequence identity (Table 2). Given the difficulty of distinguishing UHMs from canonical RRM domains based on primary sequence comparisons alone, additional UHM protein interaction domains may be hidden within the RRM superfamily.


View this table:
[in this window]
[in a new window]
 
Table 1. Signature sequences of UHM candidates compared with representative canonical RRMs

 




View larger version (120K):
[in this window]
[in a new window]
 
Figure 4. Identification of candidate UHM-containing proteins. (A) Structure-based alignment of candidate UHM sequences, U2AF35-UHM, U2AF65-UHM, and representative canonical RRMs whose structures in complex with RNA are known. Structures were aligned using the program TOPP from the CCP4 suite (Collaborative Computational Project, no. 4, 1994Go). Secondary structure elements are indicated above the sequences. The positions of key residues that distinguish UHM from RRM domains are highlighted yellow. Consensus UHM residues including acidic residues and the Arg–X–Phe loop are red. Residues in the RNP-like motifs that match the canonical RRM consensus are blue. (B) Phylogenetic tree of candidate UHM domains.

 


View this table:
[in this window]
[in a new window]
 
Table 2. Representative homologs of UHM candidates

 
In a few cases, UHM candidates that share a similar domain organization may represent homologs despite low sequence identities and/or a lack of consistent functional data, including SPF45 and DRT111; HCC1 and PAD1; TAT-SF1, UAP2, and CUS2; and MUD2 and U2AF65. In a well-studied example, MUD2 is the S. cerevisiae homolog of U2AF65 based on similar functional interactions with the Py-tract, U2 snRNP, and SF1 (Abovich et al. 1994Go; Rain et al. 1998Go). Despite low sequence identity (16%), heterologous complexes between MUD2 and human SF1, or between S. cerevisiae SF1 and human U2AF65 have not been observed (Rain et al. 1998Go) indicating that the ligand specificity of the human U2AF65-UHM has diverged from the MUD2-UHM. These differences in protein–protein interaction specificity are consistent with functional divergence of MUD2 from other U2AF large subunits. For example, MUD2 is dispensable for viability in S. cerevisiae (Abovich et al. 1994Go), whereas the UHM domain of the S. pombe U2AF65 homolog (which shares 31% sequence identity with human U2AF65) is required in vivo (Banerjee et al. 2004Go). The U2AF65-UHM interacts with an N-terminal domain of the SAP155 subunit of the U2 snRNP that is absent from the S. cerevisiae homolog of SAP155 (Gozani et al. 1998Go). Moreover, S. cerevisiae lacks an ortholog of the U2AF small subunit, indicating that MUD2 functions in the absence of the heterodimeric partner. These differences between S. cerevisiae and other U2AF homologs, coupled with the identification of eight human UHM candidates and 35 more homologs in a variety of higher eukaryotes compared with only three convincing yeast UHM candidates, suggests that the UHM diverged from the canonical RRM late in the evolutionary timeframe to serve the complicated pre-mRNA processing requirements of multicellular organisms.

The 12 candidate UHM domains are found in the context of a variety of domain arrangements within their protein sequences; a subset is detailed in Table 3. With the exception of the central URP-UHM, the UHM domains often occur near the C terminus of the candidate proteins, providing an exposed position to facilitate molecular recognition. Many of the UHM candidates also contain motifs frequently observed in splicing factors, such as canonical RRMs, arginine–serine (RS) domains, zinc fingers, and Gly-rich regions. Additional unexpected domains are also observed, including the LAP2-Emerin-Man1 (LEM) protein–protein interaction domain of MAN1 and kinase domain of KIS.


View this table:
[in this window]
[in a new window]
 
Table 3. Function of human UHM candidates and their potential disease relevance

 
The diverse functional domains of the UHM candidates are accompanied by an array of different biological functions (Table 3). Like U2AF65 and U2AF35, many of the UHM candidates play important roles during RNA splicing. Interestingly, a few of the candidates (PUF60, TAT-SF1, and HCC1) play a dual role in regulating transcription. Because RNA splicing factors can influence the efficiency of transcription, and conversely, transcription often influences the efficiency and products of alternative RNA splicing (Auboeuf et al. 2002Go; Rosonina and Blencowe 2002Go), these UHM candidates may couple RNA splicing with transcription to coordinate gene expression. Other candidate UHM proteins, namely, the membrane protein MAN1 and the regulatory kinase KIS, are important for signal transduction, but have not yet been shown to affect RNA processing. It remains to be determined whether KIS and MAN1 couple pre-mRNA splicing with other cellular pathways via their established roles in signal transduction. Demonstrating the significance of their functions, misregulated UHM-containing proteins are associated with several human diseases, including human immunodeficiency virus type 1 (HIV-1; Zhou and Sharp 1996Go), and certain cancers (Liu et al. 2001Go; Bieche et al. 2003Go; Sampath et al. 2003Go).


    Specificity of protein–protein interaction by UHM domains
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
As stated above, the structures of the U2AF35-UHM and U2AF65-UHM complexed with their ligands revealed remarkably similar modes of protein interaction. Notably, both the U2AF65 and SF1 ligands bind their respective UHM domains via a similar arrangement of basic and Trp residues. A search for the ligand consensus pattern [RK]-X-[RK]-W, shared by both the SF1 and U2AF65 ligands, found >1000 matches within the SWISS-PROT database, indicating that predicting protein ligands of candidate UHMs is impractical without further experimental information to identify their functional binding partners. Given that all 12 compelling UHM candidates possess the signature sequences predicted to recognize ligands containing the [RK]-X-[RK]-W motif, it remains an open question whether each UHM domain specifically recognizes a single target, or could promiscuously interact with the intended ligand of a different UHM in the absence of temporal or spatial regulation. Toward answering this question, the U2AF65-UHM has been found not to interact with the N terminus of the U2AF65-ligand in two-hybrid assays (Tronchere et al. 1997Go) and pull-down experiments (C. Kielkopf, unpubl.), suggesting that individual UHMs may, indeed, specifically recognize a unique target. By analogy with other modular peptide binding domains (Pawson and Nash 2003Go), UHM sequences flanking the Trp-binding pocket may ensure specific and directional interactions (N-to-C orientation) with the ligand.

In support of this analogy, structure-based modeling suggests that variation of the central X residue in the UHM Arg–X–Phe loop may provide one mechanism for UHM recognition of diverse C-terminal ligand sequences. Several of the UHM candidates (hURP, PUF60, Tat-SF1, and HCC1) share a Trp residue within the Arg–X–Phe loop that is essential in the U2AF35-UHM for specific recognition of C-terminal U2AF65-ligand-Pro residues (Kielkopf et al. 2001Go). Other UHM sequences vary from similar Arg–Tyr–Phe motifs (SPF45, DRT111, UAP2, CUS2, and PAD1) to divergent Lys (U2AF65) and Met (KIS) residues. Besides recognizing the ligand C terminus via the Arg–X–Phe loop, distinct U2AF65-UHM or U2AF35-UHM residues make specific contacts with N-terminal ligand residues. In particular, the bulky U2AF65-ligand Tyr 91 stacks against unique U2AF35 aromatic residues (Tyr 52 and Phe 81), and forms a specific hydrogen-bond with His 77 of the U2AF35-UHM (Kielkopf et al. 2001Go). Similar or identical residues in the URP-UHM (Phe 206, Phe 239, and Gln 235) suggest that a bulky, hydrophobic residue preceding the consensus ligand-Trp would be recognized in an analogous manner, consistent with an interaction between hURP and U2AF65 in pull-down and yeast two-hybrid assays (Tronchere et al. 1997Go). The smaller size of the corresponding U2AF65-UHM residues (Ile 398 and Val 384) would leave the hydrophobic side chain of a ligand-Tyr in an unfavorable, solvent-exposed environment (Selenko et al. 2003Go). Considering the variety of cellular roles played by UHM candidates and the consequent requirement to recognize diverse protein ligands, it will be important to determine whether variation in the positions corresponding to U2AF35 Tyr 52, Phe 81, and His 77, and the central position of the Arg–X–Phe loop enables recognition of distinct ligand sequences by UHM domains.

In addition to recognizing short peptide ligands, UHM domains can self-associate to form protein homodimers. For example, the PUF60-UHM domain interacts with itself in two-hybrid assays (Poleev et al. 2000Go) and forms SDS-resistant homodimers during electrophoresis (Page-McCaw et al. 1999Go). The U2AF35-UHM has been shown to form weak homodimers by gel filtration, analytical ultracentrifugation, dynamic light scattering (Kielkopf et al. 2001Go), and two-hybrid assays (Wentz-Hunter and Potashkin 1996Go), whereas homodimers of the U2AF65-UHM have not been observed (Tronchere et al. 1997Go). Homo- or heterotypic oligomerizations also have been observed for classical protein–protein interaction domains, with several different effects on ligand recognition. For example, the nNOS-PDZ/syntrophin heterodimer prohibits peptide recognition (Hillier et al. 1999Go), whereas GRIP or Shank PDZ homodimers leave the peptide-binding pockets free (Im et al. 2003aGo,bGo) and the Eps8-SH3 homodimer alters the ligand specificity (Kishan et al. 1997Go). Although a U2AF35-UHM homodimer can be modeled with the solvent exposed Arg–Trp–Phe loop binding to the Trp-binding site on a second UHM domain, alternative interfaces are possible that would allow the oligomer to simultaneously recognize peptide ligands, as observed for established protein–protein interaction domains.


    Do UHM domains recognize RNA?
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
Modeling of the U2AF-UHM/ligand structures with RNA has revealed that peptide binding to the helical surface of the RRM-like fold is not predicted to physically interfere with putative RNA interactions on the opposite {beta}-sheet face (Kielkopf et al. 2001Go; Selenko et al. 2003Go). Although the U2AF35-UHM/U2AF65-ligand complex binds RNA weakly (Kd >6 µM), accessory protein factors and adjacent domains in the full-length U2AF35 sequence (e.g., flanking zinc fingers and an RS domain) are required to assist the weak interaction (Rudner et al. 1998aGo; J. Valcarcel, pers. comm.). Likewise, the U2AF65-UHM domain is not required for Py-tract recognition (Banerjee et al. 2003Go), and does not appear to interact with RNA (Selenko et al. 2003Go). These results indicate that UHM domains are not likely to be involved in RNA interactions.

Instead, the UHM family has evolved sequence characteristics that have no benefit for RNA binding, while optimizing the interaction with peptide ligands. In most canonical RRM-RNA structures, conserved aromatic Phe/Tyr residues at the third RNP1 position or second RNP2 position stack with RNA bases or sugars, and a basic Arg/Lys residue at the first position of the RNP1 motif frequently forms a salt bridge with the phosphate backbone (Fig. 5A; Oubridge et al. 1994Go; Price et al. 1998Go; Deo et al. 1999Go; Handa et al. 1999Go; Allain et al. 2000Go; Wang and Tanaka Hall 2001Go). In contrast, the corresponding U2AF35-UHM (Fig. 5B) and U2AF65-UHM (Fig. 5C) residues are replaced with aliphatic substitutions that are not predicted to interact favorably with RNA. Moreover, UHMs display unexpectedly low isoelectric points for optimal binding of basic peptides.



View larger version (61K):
[in this window]
[in a new window]
 
Figure 5. RNA recognition by an RRM domain compared with the U2AF-UHM domains. The UHM or RRM domains are shown in blue, protein ligands are yellow, and RNA ligands are purple. (A) The U1A-RRM recognizing an RNA oligonucleotide. The RNP1-Arg recognizes the RNA phosphates, a nucleotide base stacks on RNP2-Tyr, and RNP2-Leu is folded in the hydrophobic core. (B) The U2AF35-UHM/U2AF65-ligand complex (PDB code 1JMT [PDB] ) rotated 180° about the Y-axis with respect to Figure 3 to show the putative RNA interaction surface. (C) Similar view of the U2AF65-UHM/SF1-ligand structure (PDB code 1O0P [PDB] ). The RNP2-Ile packs with the ligand-Trp. The small aliphatic residues in the RNP-like motifs are predicted to be unable to form favorable RNA interactions. Schematic representation of RNP residues that differ between UHMs compared with canonical RRMs is shown to the right.

 
In addition to poor conservation of RNP-like motifs and overall negative charge, RNA binding by the U2AF65-UHM structure is further inhibited by a C-terminal {alpha}-helix that forms a tight hydrophobic interface with the putative RNA-binding surface of the RRM-like fold (Selenko et al. 2003Go). In contrast, the C-terminal extensions of canonical RRMs more often strengthen rather than inhibit RNA binding. For example, the C-terminal helical extension of the N-terminal U1A-RRM not only contributes to RNA binding (Oubridge et al. 1994Go; Zeng and Hall 1997Go) but also mediates dimer formation for recognition of tandem RNA elements (Klein Gunnewiek et al. 2000Go; Varani et al. 2000Go). Based on the U2AF65-UHM structure, Phe 433 in the RNP1-like motif and Tyr 463 within the preceding turn interact with Tyr 469, Phe 474, and Trp 475 in the C-terminal {alpha}-helix (Selenko et al. 2003Go). Although counterparts of Phe 474 and Trp 475 are absent among the UHM candidates, aromatic residues at positions corresponding to Phe 433, Tyr 463, and Tyr 469 are observed for the UHM domains of KIS, PUF60, SPF45, and HCC1. This raises the possibility that some of the UHM candidates may have a hydrophobic C-terminal extension that may either interfere with RNA binding as for the U2AF65-UHM, or contribute to homodimer formation in a manner similar to U1A.


    Conclusions
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
The canonical RRM domain was a relatively late evolutionary addition to the array of RNA-binding folds that emerged in response to the needs of complex pre-mRNA processing pathways (Anantharaman et al. 2002Go). As processes that were originally based on the RNA world became progressively more regulated and reliant on protein interactions, the RRM fold further developed specialized sequence characteristics for protein recognition to form the UHM subfamily. These UHM signature sequences included divergent residues in the RNP-like motifs, an Arg–X–Phe loop sequence, and key acidic residues that collectively recognized the Trp residue and positive charge of the protein ligand. Convincing UHM candidates have been discovered in association with a variety of fundamental cellular processes, ranging from pre-mRNA splicing to transcription, DNA repair, and signal transduction. The large number of proteins that share the signature protein–protein interaction residues of UHM domains supports the proposal that the U2AF-UHMs represent a novel family of modular protein interaction domains.

Because protein interaction domains are attractive modules for communication among a network of pathways, the UHM domain may be an evolutionary extension of RRMs that couples pre-mRNA processing with other nuclear processes. Protein recognition by so-called RNA-binding domains is an emerging theme in molecular recognition. An early example of RRM–protein interactions was observed in the structure of U2B''/U2A', in which the {alpha}-helical surface of the U2B''-RRM interacts with the U2A' leucine-rich repeat motif (Price et al. 1998Go). Several recent structures of the {beta}-sheet surfaces of heterodimeric RRM domains interacting with {alpha}-helical protein ligands have revealed a second mode of RRM–protein recognition distinct from that of UHM/protein complexes (Fribourg et al. 2003Go; Lau et al. 2003Go; Shi and Xu 2003Go; Kadlec et al. 2004Go). In addition to distinguishing protein recognition domains within the RRM family, a growing list of fold families such as the Sterile {alpha}-Motif (SAM; Kim and Bowie 2003Go), LEM (Cai et al. 2001Go; Laguri et al. 2001Go), Pumilio/HEAT-repeat domains (Wang et al. 2002Go), and zinc fingers (Morgan et al. 1997Go) have been found to bind either nucleic acids or protein ligands through slight variations of a common scaffold. Furthermore, RS domains have been shown to contact the pre-mRNA during splicing (Valcarcel et al. 1996Go; Shen et al. 2004Go), and have also been reported to mediate protein–protein interactions (Wu and Maniatis 1993Go). Because a major goal of the "postgenomic" era is the ability to predict protein functions even in the absence of corroborating experimental results (Thornton et al. 2000Go), it will be essential to compile a lexicon of signature sequences, such as those that distinguish UHMs from canonical RRMs, for other fold families whose members play diverse functional roles.


    Acknowledgments
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
We thank S. Evans for editorial assistance, and J. Bender, M. Matunis, M. Swenson, and J. Wedekind for careful reading of the manuscript. Funding for C.L.K. is provided by the Johns Hopkins University Center for AIDS Research grant #P30 AI42855.


    Footnotes
 
Article and publication are at http://www.genesdev.org/cgi/doi/10.1101/gad.1206204.

Corresponding authors.

3 E-MAIL ckielkop{at}jhsph.edu; FAX (410) 955-2926. Back

4 E-MAIL michael.green{at}umassmed.edu; FAX (508) 856-5473. Back


    References
 Top
 Abstract
 Structural features of RNA...
 U2AF, the UHM prototype
 Structural features of protein...
 Identifying novel UHM family...
 Specificity of protein-protein...
 Do UHM domains recognize...
 Conclusions
 Acknowledgments
 References
 
Abovich, N., Liao, X.C., and Rosbash, M. 1994. The yeast MUD2 protein: An interaction with PRP11 defines a bridge between commitment complexes and U2 snRNP addition. Genes & Dev. 8: 843-854.[Abstract/Free Full Text]

Adam, S.A., Nakagawa, T., Swanson, M.S., Woodruff, T.K., and Dreyfuss, G. 1986. mRNA polyadenylate-binding protein: Gene isolation and sequencing and identification of a ribonucleoprotein consensus sequence. Mol. Cell. Biol. 6: 2932-2943.[Abstract/Free Full Text]

Allain, F.H., Bouvet, P., Dieckmann, T., and Feigon, J. 2000. Molecular basis of sequence-specific recognition of pre-ribosomal RNA by nucleolin. EMBO J. 19: 6870-6881.[CrossRef][Medline]

Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389-3402.[Abstract/Free Full Text]

Anantharaman, V., Koonin, E.V., and Aravind, L. 2002. Comparative genomics and evolution of proteins involved in RNA metabolism. Nucleic Acids Res. 30: 1427-1464.[Abstract/Free Full Text]

Auboeuf, D., Honig, A., Berget, S.M., and O'Malley, B.W. 2002. Coordinate regulation of transcription and splicing by steroid receptor coregulators. Science 298: 416-419.[Abstract/Free Full Text]

Auboeuf, D., Dowhan, D.H., Kang, Y.K., Larkin, K., Lee, J.W., Berget, S.M., and O'Malley, B.W. 2004. Differential recruitment of nuclear receptor coactivators may determine alternative RNA splice site choice in target genes. Proc. Natl. Acad. Sci. 101: 2270-2274.[Abstract/Free Full Text]

Banerjee, H., Rahn, A., Davis, W., and Singh, R. 2003. Sex lethal and U2 small nuclear ribonucleoprotein auxiliary factor (U2AF65) recognize polypyrimidine tracts using multiple modes of binding. RNA 9: 88-99.[Abstract/Free Full Text]

Banerjee, H., Rahn, A., Gawande, B., Guth, S., Valcarcel, J., and Singh, R. 2004. The conserved RNA recognition motif 3 of U2 snRNA auxiliary factor (U2AF65) is essential in vivo but dispensable for activity in vitro. RNA 10: 240-253.[Abstract/Free Full Text]

Bennett, M., Michaud, S., Kingston, J., and Reed, R. 1992. Protein components specifically associated with prespliceosome and spliceosome complexes. Genes & Dev. 6: 1986-2000.[Abstract/Free Full Text]

Berglund, J.A., Abovich, N., and Rosbash, M. 1998. A cooperative interaction between U2AF65 and mBBP/SF1 facilitates branchpoint region recognition. Genes & Dev. 12: 858-867.[Abstract/Free Full Text]

Bieche, I., Manceau, V., Curmi, P.A., Laurendeau, I., Lachkar, S., Leroy, K., Vidaud, D., Sobel, A., and Maucuer, A. 2003. Quantitative RT–PCR reveals a ubiquitous but preferentially neural expression of the KIS gene in rat and human. Brain Res. Mol. Brain Res. 114: 55-64.[Medline]

Birney, E., Kumar, S., and Krainer, A.R. 1993. Analysis of the RNA-recognition motif and RS and RGG domains: Conservation in metazoan pre-mRNA splicing factors. Nucleic Acids Res. 21: 5803-5816.[Abstract/Free Full Text]

Boehm, M., Yoshimoto, T., Crook, M.F., Nallamshetty, S., True, A., Nabel, G.J., and Nabel, E.G. 2002. A growth factor-dependent nuclear kinase phosphorylates p27Kip1 and regulates cell cycle progression. EMBO J. 21: 3390-3401.[CrossRef][Medline]

Brazma, A., Jonassen, I., Ukkonen, E., and Vilo, J. 1996. Discovering patterns and subfamilies in biosequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 4: 34-43.[Medline]

Brow, D.A. 2002. Allosteric cascade of spliceosome activation. Annu. Rev. Genet. 36: 333-360.[CrossRef][Medline]

Burd, C.G. and Dreyfuss, G. 1994. Conserved structures and diversity of functions of RNA-binding proteins. Science 265: 615-621.[Abstract/Free Full Text]

Cai, M., Huang, Y., Ghirlando, R., Wilson, K.L., Craigie, R., and Clore, G.M. 2001. Solution structure of the constant region of nuclear envelope protein LAP2 reveals two LEM-domain structures: One binds BAF and the other binds DNA. EMBO J. 20: 4399-4407.[CrossRef][Medline]

Chiara, M.D., Palandjian, L., Feld Kramer, R., and Reed, R. 1997. Evidence that U5 snRNP recognizes the 3' splice site for catalytic step II in mammals. EMBO J. 16: 4746-4759.[CrossRef][Medline]

Collaborative Computational Project, number 4. 1994. The CCP4 suite: Programs for protein crystallography. Acta Crystallogr. D50: 760-763.

Dean, W., Bowden, L., Aitchison, A., Klose, J., Moore, T., Meneses, J.J., Reik, W., and Feil, R. 1998. Altered imprinted gene methylation and expression in completely ES cell-derived mouse fetuses: Association with aberrant phenotypes. Development 125: 2273-2282.[Abstract]

Dendouga, N., Callebaut, I., and Tomavo, S. 2002. A novel DNA repair enzyme containing RNA recognition, G-patch and specific splicing factor 45-like motifs in the protozoan parasite Toxoplasma gondii. Eur. J. Biochem. 269: 3393-3401.[Medline]

Deo, R.C., Bonanno, J.B., Sonenberg, N., and Burley, S.K. 1999. Recognition of polyadenylate RNA by the poly(A)-binding protein. Cell 98: 835-845.[CrossRef][Medline]

Dreyfuss, G., Swanson, M.S., and Pinol-Roma, S. 1988. Heterogeneous nuclear ribonucleoprotein particles and the pathway of mRNA formation. Trends Biochem. Sci. 13: 86-91.[CrossRef][Medline]

Fong, Y.W. and Zhou, Q. 2001. Stimulatory effect of splicing factors on transcriptional elongation. Nature 414: 929-933.[CrossRef][Medline]

Fribourg, S., Gatfield, D., Izaurralde, E., and Conti, E. 2003. A novel mode of RBD–protein recognition in the Y14–Mago complex. Nat. Struct. Biol. 10: 433-439.[CrossRef][Medline]

Gattiker, A., Gasteiger, E., and Bairoch, A. 2002. ScanProsite: A reference implementation of a PROSITE scanning tool. Appl. Bioinformatics 1: 107-108.[Medline]

Golling, G., Amsterdam, A., Sun, Z., Antonelli, M., Maldonado, E., Chen, W., Burgess, S., Haldi, M., Artzt, K., Farrington, S., et al. 2002. Insertional mutagenesis in zebrafish rapidly identifies genes essential for early vertebrate development. Nat. Genet. 31: 135-140.[CrossRef][Medline]

Gozani, O., Potashkin, J., and Reed, R. 1998. A potential role for U2AF–SAP155 interactions in recruiting U2 snRNP to the branch site. Mol. Cell. Biol. 18: 4752-4760.[Abstract/Free Full Text]

Habara, Y., Urushiyama, S., Tani, T., and Ohshima, Y. 1998. The fission yeast prp10+ gene involved in pre-mRNA splicing encodes a homologue of highly conserved splicing factor, SAP155. Nucleic Acids Res. 26: 5662-5669.[Abstract/Free Full Text]

Handa, N., Nureki, O., Kurimoto, K., Kim, I., Sakamoto, H., Shimura, Y., Muto, Y., and Yokoyama, S. 1999. Structural basis for recognition of the tra mRNA precursor by the Sex-lethal protein. Nature 398: 579-585.[CrossRef][Medline]

Hillier, B.J., Christopherson, K.S., Prehoda, K.E., Bredt, D.S., and Lim, W.A. 1999. Unexpected modes of PDZ domain scaffolding revealed by structure of nNOS–syntrophin complex. Science 284: 812-815.[Abstract/Free Full Text]

Hoffman, D.W., Query, C.C., Golden, B.L., White, S.W., and Keene, J.D. 1991. RNA-binding domain of the A protein component of the U1 small nuclear ribonucleoprotein analyzed by NMR spectroscopy is structurally similar to ribosomal proteins. Proc. Natl. Acad. Sci. 88: 2495-2499.[Abstract/Free Full Text]

Im, Y.J., Lee, J.H., Park, S.H., Park, S.J., Rho, S.H., Kang, G.B., Kim, E., and Eom, S.H. 2003a. Crystal structure of the Shank PDZ–ligand complex reveals a class I PDZ interaction and a novel PDZ–PDZ dimerization. J. Biol. Chem. 278: 48099-48104.[Abstract/Free Full Text]

Im, Y.J., Park, S.H., Rho, S.H., Lee, J.H., Kang, G.B., Sheng, M., Kim, E., and Eom, S.H. 2003b. Crystal structure of GRIP1 PDZ6-peptide complex reveals the structural basis for class II PDZ target recognition and PDZ domain-mediated multimerization. J. Biol. Chem. 278: 8501-8507.[Abstract/Free Full Text]

Imai, H., Chan, E.K., Kiyosawa, K., Fu, X.D., and Tan, E.M. 1993. Novel nuclear autoantigen with splicing factor motifs identified with antibody from hepatocellular carcinoma. J. Clin. Invest. 92: 2419-2426.

Jung, D.J., Na, S.Y., Na, D.S., and Lee, J.W. 2002. Molecular cloning and characterization of CAPER, a novel coactivator of activating protein-1 and estrogen receptors. J. Biol. Chem. 277: 1229-1234.[Abstract/Free Full Text]

Kadlec, J., Izaurralde, E., and Cusack, S. 2004. The structural basis for the interaction between nonsense-mediated mRNA decay factors UPF2 and UPF3. Nat. Struct. Mol. Biol. 11: 330-337.[CrossRef][Medline]

Kanaar, R., Roche, S.E., Beall, E.L., Green, M.R., and Rio, D.C. 1993. The conserved pre-mRNA splicing factor U2AF from Drosophila: Requirement for viability. Science 262: 569-573.[Abstract/Free Full Text]

Kielkopf, C.L., Rodionova, N.A., Green, M.R., and Burley, S.K. 2001. A novel peptide recognition mode revealed by the X-ray structure of a core U2AF35/U2AF65 heterodimer. Cell 106: 595-605.[CrossRef][Medline]

Kim, C.A. and Bowie, J.U. 2003. SAM domains: Uniform structure, diversity of function. Trends Biochem. Sci. 28: 625-628.[CrossRef][Medline]

Kishan, K.V., Scita, G., Wong, W.T., Di Fiore, P.P., and Newcomer, M.E. 1997. The SH3 domain of Eps8 exists as a novel intertwined dimer. Nat. Struct. Biol. 4: 739-743.