|
|
|
1 Department of Cancer Immunology and AIDS, Dana-Farber Cancer Institute, Division of AIDS, Harvard Medical School, Boston, Massachusetts 02115, USA; 2 Division of Medicine, Imperial College London, St. Marys Campus, London W2 1PG, United Kingdom; 3 Department of Infection, University College London, London W1T 4JF, United Kingdom
| Abstract |
|---|
|
|
|---|
[Keywords: Integrase; LEDGF/p75; HIV-1; AIDS; transcription; integration]
Received April 25, 2007; revised version accepted June 6, 2007.
Purified IN proteins display 3' processing and DNA strand transfer activities in vitro (Craigie et al. 1990
; Katz et al. 1990
; Engelman et al. 1991
), yet results of numerous studies indicate cell proteins play important roles during virus infection. More than a dozen cell factors have been shown to directly bind human immunodeficiency virus type 1 (HIV-1) IN, but genetic evidence indicating important roles for the majority of these remains scant (for review, see Vandegraaff and Engelman 2007
). Recent results have highlighted a crucial role for the IN-interacting protein lens epithelium-derived growth factor (LEDGF)/p75 (Cherepanov et al. 2003
; Turlure et al. 2004
; Emiliani et al. 2005
) in HIV-1 replication and integration (Llano et al. 2006a
; Vandekerckhove et al. 2006
; Zielske and Stevenson 2006
). The precise function of LEDGF/p75 in viral replication, however, is unknown.
LEDGF/p75 is a hepatoma-derived growth factor-related protein (HRP) that interacts specifically with lentiviral INs (Llano et al. 2004b
; Busschots et al. 2005
; Cherepanov 2007
) and significantly stimulates their enzymatic function in vitro (Cherepanov et al. 2003
, 2004
; Turlure et al. 2006
; Cherepanov 2007
). Binding occurs through a conserved IN-binding domain (IBD) found within the C-terminal portion of the larger p75 LEDGF splice variant (Maertens et al. 2003
; Cherepanov et al. 2004
; Vanegas et al. 2005
). The IBD is essential for stimulation of IN activity in vitro (Cherepanov et al. 2004
, 2005b
; Cherepanov 2007
) and for LEDGF/p75 function during HIV-1 infection (Llano et al. 2006a
). LEDGF/p75 might therefore act as a critical costimulator of IN activity (for reviews, see Goff 2007
; Vandegraaff and Engelman 2007
). Ectopically expressed HIV-1 IN is degraded by the proteasome in human cells (Mulder and Muesing 2000
; Llano et al. 2004a
), and LEDGF/p75 significantly increases its stability (Maertens et al. 2003
; Llano et al. 2004a
). The HIV-1 PIC can be degraded by the proteasome (Schwartz et al. 1998
), so the IN-LEDGF/p75 interaction might help maintain PIC integrity during infection. Consistent with these models, functional HIV-1 and feline immunodeficiency virus PICs were recovered from cytoplasmic extracts of infected cells using anti-LEDGF antibodies (Llano et al. 2004b
).
Alternatively, LEDGF/p75 might function as an obligate chromatin acceptor for the PIC. Consistent with this hypothesis, LEDGF/p75 intimately associates with chromatin (Nishizawa et al. 2001
; Cherepanov et al. 2003
), and its N-terminal PWWP domain and AT-hook (ATh) DNA-binding motifs, which mediate chromatin binding (Llano et al. 2006b
; Turlure et al. 2006
), are required for HIV-1 infection (Llano et al. 2006a
). Yeast retrotransposon Ty5 represents a paradigm whereby a direct interaction between IN and a chromatin-associated protein, Sir4p in this case, targets a significant fraction of overall transposition events (Xie et al. 2001
). Mapping large numbers of retroviral integrations has revealed biases toward or against various host genomic DNA features, though retroviral targeting appears less prominent than highly selective elements such as Ty5 (for review, see Bushman et al. 2005
). Lentiviruses in large part favor actively expressed genes, integrating into transcription units (TUs) nearly equally along their lengths (Schroder et al. 2002
; Mitchell et al. 2004
; Crise et al. 2005
). The
-retrovirus Moloney murine leukemia virus (MLV) modestly favors TUs, but in stark contrast to HIV-1, preferentially integrates nearby promoter regions and CpG islands (Wu et al. 2003
; Mitchell et al. 2004
). Simian foamy virus (SFV), a Spumaretrovirus, is slightly biased against insertion into TUs, while significantly favoring promoter regions and CpG islands, albeit to lesser extents than MLV (Nowrouzi et al. 2006
; Trobridge et al. 2006
). Avian sarcoma-leukosis virus (ASLV), an
-retrovirus, only modestly favors TUs, promoter regions, and CpG islands, and thus displays the least selectivity toward genomic DNA features among studied retroviruses (Mitchell et al. 2004
; Narezkina et al. 2004
). Collectively, these results suggest that most if not all retroviruses differentially interact with chromatin and/or transcription complexes to affect a significant fraction of their overall integration events (Bushman et al. 2005
). Analyses of MLV/HIV chimerae have revealed that IN is the primary viral determinant governing integration site selection (Lewinski et al. 2006
).
Intriguingly, LEDGF/p75 was shown to play a modest role in HIV-1 targeting under initial knockdown conditions (Ciuffi et al. 2005
) that paradoxically failed to reveal a viral infectivity defect (Llano et al. 2004b
). Subsequent RNA interference (RNAi)-based studies indicated modest roles for LEDGF/p75 in HIV-1 replication and integration (Vandekerckhove et al. 2006
; Zielske and Stevenson 2006
), while the most recent study utilizing lentiviral-based short hairpin RNA (shRNA) vectors highlighted an essential role for the host factor (Llano et al. 2006a
). As varying efficiencies of RNAi-mediated knockdowns have yielded contrasted conclusions, we reasoned that a cell system completely devoid of LEDGF/p75 protein would be an invaluable tool for analyzing its role(s) in HIV-1 biology (Vandegraaff et al. 2006
). A highly significant though nonessential role for LEDGF/p75 in HIV-1 infection and integration is described here using mouse embryo fibroblasts (MEFs) disrupted for Psip1, the gene that encodes for the highly conserved murine Ledgf/p75 ortholog of human LEDGF/p75. Using the genetic knockout, we show that HIV-1 in large part loses its ability to target TUs while maintaining its preferred local target DNA sequence at the site of insertion. Accordingly, biochemical fractionation reveals normal processing of HIV-1 cDNA 3' ends in Ledgf-null cells and wild-type levels of knockout cell PIC integration activity in vitro. Our results pinpoint LEDGF/p75 function to a step downstream from functional PICs in the nucleus, as a component of chromatin and/or transcription complexes to affect the lentiviral-specific pattern of genomic DNA targeting.
| Results |
|---|
|
|
|---|
Murine and human LEDGF/p75 are highly homologous, sharing 92.3% identity and 97.7% homology considering conservative amino acid substitutions. Importantly, the known functional regions (PWWP domain, AThs, and IBD) are 100% conserved between orthologs. As murine fibroblasts are readily infected with pseudotyped HIV vectors (Siva and Bushman 2002
; Shun et al. 2007
), MEFs were derived from Psip1 knockout mice to analyze the mechanism of LEDGF/p75 function during HIV-1 integration. Ledgf and LEDGF are used throughout to denote mouse and human orthologs, respectively.
A conditional knockout strategy was utilized (Sauer 1998
): DNA recombination sites for the Cre recombinase were engineered to flank exon 3, which is the second coding exon of Psip1 (Fig. 1A). Chimeric animals created following implantation of transfected embryonic stem (ES) cells were mated to C57BL/6 mice, and resultant MEFs were analyzed for flox (f) sites by Southern blotting and/or PCR (Fig. 1AC). F1 (f/+) crosses yielded expected wild-type (+/+), heterozygous, and homozygous (f/f) progeny (Fig. 1B, lanes 13). For most experiments, MEFs were transformed with simian virus (SV) 40 large T antigen to effect long-term tissue culture passage. Cre was introduced into f/f cells in vitro or through selective breeding; a self-inactivating retroviral vector deleted Psip1 exon 3 (Fig. 1B, lanes 4,5) and/or its own gene (Silver and Livingston 2001
), yielding
97% exon 3 loss at the mRNA level (Fig. 1D). Due to the residual level of intact Psip1, two additional rounds of transduction were performed, which reduced the level of exon 2/3 amplification to below the detection limit of the quantitative RTPCR (qRTPCR) assay (<0.02%). Exon 3 deletion shifts the open reading frame, yielding a potential N-terminal 24-residue product fused to an unrelated 11-mer tail. Accordingly, knockout (also referred to as /) cells failed to express detectable levels of Ledgf/p75 protein (Fig. 1E, lane 2). Importantly, f/f cells thrice transduced with a vector expressing R173K active site mutant Cre (Silver and Livingston 2001
) failed to delete exon 3 (Fig. 1B, lane 6) and maintained wild-type levels of Psip1 expression (Fig. 1D,E). Most virological experiments were conducted using transformed / and f/f cells. Intermittent qRTPCR analyses importantly revealed persistent, undetectable levels of exon 2/3 amplification over relatively long periods of / cell culturing (>35 passages). Alternatively, f/f animals were mated to Sox2Cre transgenic mice (Hayashi et al. 2002
) to effect recombination in vivo. Subsequent +/ Sox2Cre, f/f crosses yielded f/+, f/-, +/, and / offspring (Fig. 1C). Similar intermediate levels of Ledgf/p75 protein were detected in f/- and +/ cells (Fig. 1F, lanes 4,5), confirming that the nonrearranged f allele expressed Ledgf/p75 at normal levels. Cells derived from knockout animals predictably failed to yield detectable levels of Ledgf/p75 protein (Fig. 1F, lanes 2,6) or exon 2/3 amplification by qRTPCR (data not shown). E1f/+ and E2/ MEFs (Fig. 1C,F, lanes 1,2) were used in a subset of experiments; f/+ and / will denote cells transformed by SV40 large T antigen, whereas E1f/+ and E2/ will refer to primary cells. We note that exon 3 loss disrupts the expression of the Ledgf/p52 splice variant (Ge et al. 1998
) that does not harbor the IBD and hence does not bind IN (Maertens et al. 2003
).
|
Viral vectors that express the luciferase (Luc) reporter gene (HIV-Luc, HIV-SIN-Luc, or MLV-Luc) (Shun et al. 2007
) were pseudotyped with the pan-tropic vesicular stomatitis virus G (VSV-G) envelope glycoprotein, and infectivity was quantified as the level of Luc activity per microgram of total protein in cell extracts. An IN active site mutant virus (NN-Luc, containing D64N and D116N changes) yielded similar low levels of IN-independent Luc activity in control and / cells, indicating the knockout did not significantly affect HIV promoter function under these conditions (Fig. 2A,B). Side-by-side comparison of f/f, / and f/+, / cell pairs revealed
5% of integration-dependent HIV expression in Ledgf knockout as compared with control cells (Fig. 2A,B). Some variation in the level of residual HIV-Luc infectivity in / as compared with f/f cells was noted upon multiple experimental replicates (0.3%9.4%; x = 1.2 ± 1.3% for n = 14). As NN-Luc was invariably less infectious (12 ± 6%, n = 14) than HIV-Luc in / cells, we conclude that Ledgf accounts for the vast majority, but not all, of the integration-dependent reporter gene expression under these conditions. Similar results were obtained using fluorescence-activated cell sorting (FACS) in conjunction with green fluorescent protein (GFP) reporter viruses (data not shown). Knockout and f/f cells importantly supported indistinguishable levels of MLV-Luc infectivity, revealing that the knockout does not perturb general susceptibility to retrovirus infection (Fig. 2C).
|
Stable expression of LEDGF/p75 in / cells fully restored their susceptibility to HIV-1 infection under conditions wherein murine p52 expression failed to reveal an effect (Fig. 2D; data not shown). A transient expression system that utilized FACS to select for transfectants was developed to probe the requirement for LEDGF/p75 effector domains in HIV-1 infection. As expected from previous analyses (Cherepanov et al. 2004
; Llano et al. 2006a
), an IBD deletion mutant failed to support infection (Fig. 2E). Asp-366 within the IBD plays a critical role in mediating the viralhost interaction (Cherepanov et al. 2005a
, b
). The D366N missense mutant likewise failed to support infection, highlighting the importance of the specific binding interaction for HIV-1 infection (Fig. 2E). As recently reported (Llano et al. 2006a
), the combined deletion of the PWWP domain and ATh DNA-binding motifs (
PWWP
ATh) rendered LEDGF/p75 inoperative (Fig. 2E). Separate mutations were analyzed to assess the relative contributions of these conserved elements. The
PWWP mutant functioned at
17% (n = 5) of wild-type LEDGF/p75, whereas
ATh function was indistinguishable from wild-type (Fig. 2E). Altering the invariant ArgGlyArg sequences within the AThs to AlaAlaAla reduced recombinant LEDGF/p75 DNA-binding activity to
7% of wild-type (Turlure et al. 2006
). When combined with
PWWP, the six Ala mutations rendered the same additive affect as the
ATh deletion (Fig. 2E).
HIV-1 integration is defective in Ledgf knockout cells
Levels of HIV-1 cDNA synthesis, nuclear migration, and integration were assessed by qPCR to characterize the infectivity block. A primer pair and Taqman probe that rely on the second template switch for amplification revealed similar levels of late reverse transcription (LRT) product formation in f/f and / cells (Fig. 3A). Host-mediated nonhomologous DNA end-joining acts on a small fraction of cDNA in the nucleus, yielding a circular ligation product containing two copies of the viral long terminal repeat (2-LTR circle) (Li et al. 2001
). Knockout cells supported 2-LTR circle formation at levels equal to or above those observed in f/f cells (Fig. 3B), indicating that nuclear migration was unimpaired by the knockout. Levels of HIV-1 integration into human DNA can be quantified by nested real-time Alu-PCR; as Alu is a primate-specific repeat (Deininger and Batzer 1999
), a novel assay utilizing mouse B1, B2, and LINE-1 repeatsreferred to as BBL-PCRwas designed to measure integration. Knockout cells supported
11% of the level of integration detected in f/f cells (Fig. 3C).
|
HIV-1 is strongly biased toward integration into transcriptionally active chromatin (Schroder et al. 2002
; Mitchell et al. 2004
). Monitoring Luc activity alongside an expanded set (n = 7) of BBL-PCR assays supported preliminary observations that knockout cells portray a larger infectivity than integration defect (
8% integration, corresponding to
1.5% residual infection). This suggested that a significant proportion of knockout cell integrations might occur within unfavorable regions such as transcriptionally silent DNA. Indeed, partial depletion of LEDGF/p75 in human 293T cells yielded a modest reduction of HIV-1 integration into TUs in the absence of an accompanying infection defect (Ciuffi et al. 2005
). Integration levels in Ledgf-null MEFs afforded the recovery of sufficient viralchromosomal junctions for downstream statistical analyses. In total, 326 and 408 unique sites isolated from primary E1f/+ and E2/ cells, respectively, were unambiguously mapped on the draft mouse genome. TUs (based on Ensembl definitions) hosted 68.7% and 47.3% of HIV-1 integration events in E1f/+ and E2/ cells, respectively (Fig. 4A), a highly statistically significant (P = 108) difference. The suppression of gene targeting in Ledgf-null cells is quite remarkable, considering that the random level derived from computer simulation was 39.6%. Although E2/ cells supported 7.7% more gene-specific integration than random (P = 0.01), this residual level was lower than those observed for ASLV (49.6%), MLV (49.6%), and adeno-associated virus (AAV) (50.9%), a parvovirus, in human cells (Fig. 4A; Supplementary Table S1).
|
2.6-fold (P < 0.0001) and 1.7-fold (P < 0.01) more frequently, respectively, in E2/ as compared with E1f/+ cells (Fig. 4B,C; Supplementary Table S1). At least one in every six proviruses resided within 5 kb of a promoter in Ledgf-null cells.
The G/C content of genomic HIV-1 integration sites is significantly lower than that of other retroviruses. As noted by Berry et al. (2006)
, this is paradoxical, since HIV-1 primarily targets TUs, which tend to be G/C-rich. Knockdown of LEDGF/p75 in human cells increased the average G/C content of HIV-1 integration sites (Ciuffi et al. 2005
). Our analyses fully confirmed this observation, as HIV-1 tended to integrate into G/C-rich regions in the absence of Ledgf (Supplementary Fig. S1).
Using oligonucleotide microarrays, we determined the transcriptional profiles of three matched pairs of primary f/+ and / MEFs, including the E1/E2 pair used for the integration site analyses. Overall, Ledgf ablation did not lead to global changes in transcription profiles, with <200 genes significantly (>1.5-fold) and consistently (false discovery rate <5%) up- or down-regulated in knockout cells (data not shown). The similarities in gene expression profiles are in accord with high prenatal survival rates among knockout animals (Sutherland et al. 2006
; J.E. Daigle and A. Engelman, unpubl.). Gene expression and HIV-1 integration data sets were cross-correlated to determine the effect of transcriptional activity on integration site selection. Ensembl TUs represented by probes on arrays were divided into five equal bins based on relative expression level, and integrations into TUs in each bin were counted (Fig. 5). In E1f/+ cells, integration strongly correlated with gene expression (P
1015). The mouse model therefore faithfully replicates the preference of HIV-1 to integrate into active genes (Schroder et al. 2002
; Mitchell et al. 2004
). Interestingly, the trend appears to break for genes expressed at the highest levels (Fig. 5, f/+ bin 5). This has been noted in human cells, presumably illustrating that very high levels of gene expression can interfere with retroviral integration (Mitchell et al. 2004
). Overall, integration in E2/ cells displayed a weaker correlation with gene expression (P = 0.02 for the difference between E1f/+ and E1/). While not significantly influencing the frequency of integration into genes expressed at the lowest and highest levels (bins 1, 2, and 5), Ledgf ablation led to approximately twofold reductions in integration into genes expressed at medium to high levels (bins 3 and 4). Therefore, Ledgf-null cells not only hosted a larger proportion of integration events into transcriptionally silent regions, but a larger proportion of TU-specific integrations ended up in genes that were expressed to lower levels.
|
Retroviruses display short palindromic consensi in the immediate vicinity of their integration sites, which appear to be virus- and/or genus-specific (Carteau et al. 1998
; Holman and Coffin 2005
; Berry et al. 2006
). The TDG
GTWACCHA consensus, wherein the virus plus-strand becomes joined to the underlined nucleotide, has been elaborated for HIV-1 (Holman and Coffin 2005
). Integration site sequences recovered from E1f/+ and E2/ cells were aligned to determine if Ledgf/p75 influences the selection of local target DNA sequence. The resulting nucleotide frequencies for positions 8 to +12 are shown in Figure 6. Under both conditions, a strong bias was evident at positions 3 to +7, with positions 0 and +4 displaying the highest degree of selection (P < 1023). The consensus derived from our alignments, TDG
(G/V)TNA(C/B)CHA, is very similar to the reported sequence (Holman and Coffin 2005
) and clearly independent of Ledgf cell content.
|
Recent results have highlighted that an IN multimer possessing dyad symmetry, likely a tetramer, catalyzes HIV-1 cDNA integration (Li et al. 2006
). Finding the palindromic target DNA consensus sequence intact in Ledgf-null cells suggested that basic IN catalytic function might persist in the absence of the host factor. Levels of IN 3' processing and DNA strand transfer activities were measured to directly address this supposition.
The 3' ends of the reverse transcript must be processed by IN prior to integration. To determine 3' processing activity, unintegrated DNA extracted from acutely infected cells was analyzed by indirect end-labeling following denaturing polyacrylamide gel electrophoresis to visualize substrate termini (Fig. 7A). The nascent U3 minus-strand, for example, is detected as 103 nucleotides (nt) after HindIII digestion; IN processing yields a 101-mer product. The NN active site mutant protein expectedly failed to detectably process either viral DNA end, regardless of Ledgf cell content (Fig. 7B, lanes 2,4). As both ends were similarly processed by wild-type IN in f/f and / cells (Fig. 7B, lanes 1,3), we conclude that its in vivo activity does not depend on Ledgf/p75 binding. In vitro integration assays were conducted with extracted, native PICs to assess IN DNA strand transfer activity. Most experiments utilized a sensitive qPCR design to quantify the level of U3 end integration into a circular plasmid target DNA (Engelman 2007
). As expected, NN-Luc PICs supported background levels of integration activity (Fig. 7C). HIV-Luc PICs extracted from / cell cytoplasm at 7 h post-infection supported the same level of integration activity as f/f cell PICs. In one experiment, PICs were extracted a second time at 12 h: The f/f cell complexes were marginally less active than the corresponding 7 h samples, yet, notably, knockout cell PICs displayed the same activity as f/f cell PICs at both time points (Fig. 7C). Nuclear PICs were also analyzed, in this case by Southern blotting with a viral-specific probe to visualize the full-length DNA recombination product (Engelman 2007
). The cDNA substrate is detected in its full-length 10.7-kb form, whereas integration into linearized 5.4-kb
X174 DNA yields a linear 16.1-kb integration product (IP) (Fig. 7D). Consistent with our interpretation that the knockout does not perturb PIC nuclear localization, cDNA levels in / cell nuclei equaled or exceeded those observed in control cells (Fig. 7D, lanes 1,3). Importantly, control and knockout cell nuclear PICs displayed indistinguishable levels of integration activity (Fig. 7D, lanes 2,4).
|
| Discussion |
|---|
|
|
|---|
LEDGF/p75 domains and amino acid residues important for HIV-1 function
MLV efficiently infected Ledgf-null cells, and stable expression of human LEDGF/p75 fully restored their susceptibility to HIV-1 infection (Fig. 2). Moreover, the D366N missense mutant failed to complement the infectivity defect in knockout cells (Fig. 2E), revealing a critical role for the specific IN-LEDGF/p75 interaction in HIV-1 replication and integration. Of note, while the D366N mutation abrogates the IN-LEDGF/p75 interaction, it does not affect the interaction with JPO2, the cellular transcription factor that likewise associates with LEDGF/p75 through the IBD (Maertens et al. 2006
). Consistent with recent results (Llano et al. 2006a
), the combined
PWWP
ATh deletion mutant, which is defective for chromatin binding (Llano et al. 2006b
), failed to rescue HIV-1 (Fig. 2E). By analyzing separate ATh and PWWP mutations, we can conclude that the PWWP domain is the dominant of the two conserved N-terminal regions in terms of HIV-1 function. The PRGR sequence comprises the heart of the ATh DNA-binding motif, and full-length LEDGF/p75 containing dual RGR > AAA substitutions supported
7% of residual DNA-binding activity in vitro (Turlure et al. 2006
). Combining these changes with the
PWWP deletion rendered LEDGF/p75 inactive (Fig. 2E), consistent with the notion that these sequences are likely to function as AThs during infection. However, it is unclear if the predominant ATh function is engagement of AT-rich target DNA sequences.
Our integration site libraries revealed that HIV-1 favors integrating into G/C-rich regions in the absence of Ledgf (Supplementary Fig. S1), akin to the result reported using initial knockdown conditions (Ciuffi et al. 2005
). The propensity to normally integrate within A/T-rich regions extends beyond local DNA sequence, as analyzing short (1-kb; P = 0.002), medium (5-kb; P < 0.001), and relatively long (30-kb; P = 0.008) sequence windows around the proviruses revealed shifts toward higher G/C content in all cases (Supplementary Fig. S1). LEDGF/p75 would therefore appear to direct HIV-1 to A/T-enriched regions of chromatin. As the
ATh deletion and ATh missense mutants rescued wild-type levels of infectivity (Fig. 2E), the propensity for HIV-1 to target AT-rich sequences may very well require LEDGF/p75 interaction(s) that exceed direct AThDNA binding.
The mechanism of LEDGF/p75 function during HIV-1 integration
PICs isolated from knockout cells supported the same level of in vitro integration activity as matched control complexes (Fig. 7C,D). Integration site sequence alignments moreover revealed the HIV-1 target consensus site maintained in Ledgf-null cells (Fig. 6). Taken together, we conclude that IN is structurally and functionally intact in the absence of cellular LEDGF/p75 and, by extension, that the propensity for different retroviruses to integrate at weakly preferred target DNA sequences (Holman and Coffin 2005
) is mechanistically separable from the targeting machineries that attract them to different chromatin structural elements.
In stark contrast, the propensity for HIV-1 to integrate into TUs was in large part countermanded by the knockout, yielding an overall frequency of gene targeting that was similar to MLV, ASLV, and AAV (Fig. 4A). Furthermore, the dramatic drop in gene-specific targeting was accompanied by a surge in integration activity nearby gene start sites and CpG islands (Fig. 4B,C; Supplementary Table S1). From these results, we conclude that LEDGF/p75 is a bona fide targeting factor for lentiviral DNA integration. In its absence, HIV-1 suffers an overall integration defect, yet is able to access promoter- and CpG island-proximal regions within chromatin to accomplish significant fractions of its residual integration events.
Compared with other retroviruses, ASLV is the least biased toward genomic DNA features (Fig. 4AC; Mitchell et al. 2004
; Narezkina et al. 2004
). Rep-deficient AAV vectors, which integrate via cellular DNA double-strand breaks (Miller et al. 2004
), can be considered a benchmark for random mobile DNA insertion in the human genome. The gene targeting reduction observed for HIV-1 in Ledgf-null cells, to levels normally seen for AAV and ASLV in human cells, suggests that HIV-1 PIC trafficking and resulting integration site distributions could be governed by general and/or nebulous parameters such as chromosomal DNA accessibility. We note, however, that at present we cannot rule out that additional interactions between the PIC and chromatin-associated factors dictate augmented integration nearby CpG islands and promoter regions (Fig. 4B,C) or residual levels of gene targeting (Figs. 4A, 5) in Ledgf-null cells. Our preliminary experiments revealed that when overexpressed, human HRP2 (Cherepanov et al. 2004
) can rescue the block to HIV-1 infection in Ledgf knockout cells, hinting that other IN-binding proteins might possibly contribute to HIV-1 integration.
LEDGF/p75 is known to function as a molecular tether, whereby cellular JPO2 (Maertens et al. 2006
) or HIV-1 IN (Maertens et al. 2003
; Emiliani et al. 2005
; Vanegas et al. 2005
) bound to the IBD is tethered to chromatin via the N-terminal PWWP domain and AThs (Llano et al. 2006b
; Turlure et al. 2006
). The INIBD interaction, also critical for HIV-1 infection (Fig. 2E; Llano et al. 2006a
), is well understood at the molecular level (Cherepanov et al. 2005a
), but the mechanism of chromatin engagement is far from clear. Although a considerable body of evidence implicates LEDGF/p75 in transcriptional processes (Ge et al. 1998
; Shinohara et al. 2002
), a clear model for its cellular function has not been formulated. Our results strongly indicate that LEDGF/p75 would associate with transcriptionally active genes and, moreover, that a significant fraction of the protein could be expected to footprint approximately equally along their lengths (Mitchell et al. 2004
). LEDGF/p75 has been proposed to interact directly with RNA polymerase II subunits (Ge et al. 1998
). Alternatively, specific histone modifications are associated with transcriptional elongation (for review, see Saunders et al. 2006
) and the PWWP domain, which plays an important role in HIV-1 infection (Fig. 2E), is structurally analogous to other modular protein domains like Tudor and Chromo that interact with specific histone tail modifications (Kim et al. 2006
). Available evidence strongly indicates that reverse transcription, PIC assembly, and nuclear import proceed normally in the absence of LEDGF/p75 (Figs. 3, 7; Llano et al. 2006a
; Vandekerckhove et al. 2006
; Zielske and Stevenson 2006
). We propose that upon engaging TU-associated LEDGF/p75, HIV-1 is encouraged to integrate into a nearby region. Although IN DNA strand transfer activity does not absolutely require the cofactor, the interaction sufficiently biases resulting integration site distributions toward LEDGF/p75-associated regions. Accordingly, recombinant HIV-1 IN protein is active in the absence of cellular factors (Engelman et al. 1991
), and its activity can be drastically increased by LEDGF/p75 (Cherepanov et al. 2003
, 2004
; Turlure et al. 2006
). LEDGF/p75 engagement, however, is unlikely to be an all-or-none decision. The conservation of the integration site consensus in Ledgf-null cells suggests that IN has sufficient opportunity to select its appropriate target DNA sequence independent of LEDGF/p75 binding. This model explains why a certain percentage of HIV-1 proviruses are normally found in gene-poor regions, and also why partial LEDGF/p75 knockdown could reveal a difference in integration site distribution without a concomitant decrease in infectivity (Ciuffi et al. 2005
). Ledgf ablation would appear to leave the PIC little choice but to rely on basal IN strand transfer activity, leading to
10-fold reductions in overall integration (Fig. 3). While hijacking a cellular factor to direct most of their integrations into active TUs, lentiviruses have avoided a complete dependence on it for IN catalytic function. Such behavior has important implications for both disease progression and viral latency.
Our results establish LEDGF/p75 as a critical lentiviral-specific integration targeting factor. Retrotransposons have evolved largely site-specific integration mechanisms to help maintain the integrity of the host cell genome (for review, see Lesage and Todeschini 2005
). The molecular mechanism of targeted integration is best understood for Ty5, whereby heterochromatin-associated Sir4p interacts with the C terminus of Ty5 IN to direct integration. Akin to the results reported here, disruption of the IN-tethering factor interaction significantly reduced overall levels of Ty5 transposition (Xie et al. 2001
). Evolution has apparently ensured that yeast and human retroelement INs maintain close relationships with their respective tethering factors. An exciting avenue of research will be to identify cellular factors that commandeer the integration machineries of other retroviruses, such as MLV or SFV, whose telltale profiles hint of specific targeting mechanisms at play (Fig. 4AC; Wu et al. 2003
; Mitchell et al. 2004
; Nowrouzi et al. 2006
; Trobridge et al. 2006
). Additional research into the mechanisms of retroelement targeting will help to assess and improve the safety of viral-based gene therapy vectors, and may lead to novel antiviral therapies.
| Materials and methods |
|---|
|
|
|---|
Plasmids were constructed using standard techniques, details of which are given in the Supplemental Material. Regions of all plasmids that underwent PCR amplification were verified by DNA sequencing.
Cells, viruses, and infections
Linearized pCP75KO was electroporated into TC1 ES cells (derived from the 129SvEv strain), and G418-resistant clones carrying successfully targeted Psip1/f alleles were identified by Southern blotting after prescreening by PCR with AE2331/AE2334 (primer sequences listed in Supplementary Table S2). One clone, B5a, microinjected into C57BL/6 blastocysts generated high-percentage chimera mice, which were subsequently bred to C57BL/6 animals. MEFs were isolated from 13.5-d-old embryos.
HIV-Luc is a near-full-length derivative of HIV-1NL43 that expresses Luc from the viral nef position, whereas HIV-SIN-Luc is a minimal transfer vector that achieves expression from a heterologous cytomegalovirus immediate early promoter (Shun et al. 2007
). MLV-Luc, derived from pFB-Luc (Stratagene), expresses Luc from the MLV promoter. Viral supernatants were produced from transfected 293T cells as described (Shun et al. 2007
). HIV-Luc and HIV-SIN-Luc titered using a 32P-based RT assay were treated for 1 h at 37°C with 40 U/mL Turbo DNase (Ambion). MEFs (4 x 104 per well) plated in 12-well trays 16 h before infection were infected with 14 x 106 RT-cpm for 8 h. MLV vectors (10 mL) were concentrated to 300 µL by ultracentrifugation, and cells were infected with 800 µL of a 1:20 dilution of concentrated virus. At 44 h post-infection, infected cells were processed for determination of Luc activity as described (Shun et al. 2007
). Cells transduced with VSV-G-pseudotyped pLPCX-LEDGF/p75 or pLPCX-Ledgf/p52-HA were selected in puromycin. Cells transfected with pIRES2-eGFP expression vectors and sorted by FACS were lysed for Western blotting or plated for infection, which was conducted 10 h after seeding.
DNA and RNA analyses
The Southern blotting probe in Figure 1 was constructed by PCR using AE2772/AE2773. Relative DNA content in extrachromosomal and chromosomal DNA fractions (Vandegraaff et al. 2001
) was determined by qPCR using mouse mitochondrial (AE2507/AE2508) and superoxide dismutase-specific (AE2697/AE2698) primers, respectively.
LRT products in extrachromosomal DNA fractions at 7 h post-infection were analyzed by qPCR using MH531/MH532 primers, LRT-P probe, and pTY-CMVLuc (Shun et al. 2007
) to generate the standard curve. 2-LTR circle formation at 24 h post-infection was quantified using primers AE2621/AE2622 and AE2623 probe. Chromosomal DNA integration at 24 h post-infection was quantified using BBL-PCR: First-round products amplified essentially as described (Vandegraaff et al. 2001
) using AE2257 and a mixture of target primers annealing to murine B1 (AE2604; AE2605), B2 (AE2606; AE2607), and LINE-1 (AE2608; AE2609) repeats were diluted 1:2000 for qPCR using AE989/AE990 primers and AE995 probe. Serial dilutions of f/f cell Hirt supernatant and pellet DNAs were utilized to generate 2-LTR circle and BBL-PCR standard curves, respectively. HindIII-digested Hirt supernatant DNA was separated through sequencing gels and transferred to Duralon-UV membrane (Stratagene) in 0.3x TBE for 1 h at 50 mA using a TransBlot SD Cell (Bio-Rad) to visualize HIV-1 3' ends. EcoRI-linearized pCR2.1-U3 and pCR2.1-U5 were used to generate strand-specific riboprobes. PICs were isolated from cells essentially as previously described (Brown et al. 1987
); see the Supplemental Material for details.
RNA extracted using the RNeasy Mini Kit (Qiagen) and quantified by spectrophotometry was analyzed by qRTPCR using Psip1 exon 2/3 primers AE2624/AE2625. Standard curves were constructed by analyzing serial fivefold dilutions of RNA extracted from Ledgf/p75-expressing cells as described (Vandegraaff et al. 2006
).
Western blotting
Proteins were extracted from whole cells (Vandegraaff et al. 2006
) or isolated nuclei (Maertens et al. 2006
) as described, and concentrations were determined using Dc Protein or Bradford Assays (Bio-Rad), respectively. Anti-Ledgf/p75 antibodies were from BD Biosciences, Abnova, or Bethyl; HA antibodies were from Roche.
Gene expression and integration site cloning analyses
Total RNA extracted from primary MEFs with Trizol (Invitrogen) was purified on RNeasy spin columns (Qiagen). Four independent RNA samples were isolated in parallel from each of two MEF lines, and Cy5-labeled probes were prepared using Agilent cRNA linear amplification, labeling, and fragmentation reagents. Cy3-labeled internal control cRNA was made using pooled Standard Mouse RNA (Stratagene). Cy5- and Cy3-labeled probes were cohybridized to eight 44k Whole-Mouse Genome oligonucleotide arrays (4 x 44k format, Agilent). The arrays were processed according to the manufacturers instructions and scanned using a G2505B Microarray Scanner (Agilent). Feature Extraction software version 9.5 was used to process the expression data. Spike-in controls included in amplification reactions indicated linear dose signal response in both channels. For analysis of differential gene expression, microarray data for E1f/+ and E2/ cells were normalized and compared using GeneSpring GX software (Agilent). Signal intensity in the red channel was used to estimate relative abundance of transcripts within the samples. The Cy5 signal for each probe was normalized on the mean Cy5 signal over the whole array and averaged among four arrays. Probe IDs were matched to the corresponding Ensembl IDs using BioMart (http://www.ensembl.org). All Ensembl transcripts detected on the 44k arrays were ranked based on their expression level (averaged signal was taken for transcripts represented by two or more probes) and divided into five equal bins, each containing
4965 transcripts. TUs that hosted HIV-1 integration or simulated random integration were counted in each bin to generate Figure 5.
HIV-1 integration sites were cloned via ligation-mediated (LM)-PCR essentially as previously described (Schroder et al. 2002
). Genomic DNA digested with a mixture of AvrII, NheI, and SpeI was ligated overnight with AE2844/AE2845 linker. Sequences amplified by PCR with AE2814/SB-76 were nested using AE2815/ASB-1. Resulting products subcloned into pCR4.1-TOPO (Invitrogen) were sequenced (SeqWright) using AE2865. Sequences were parsed to remove U3 and linker-derived portions; low-quality sequences and sequences that did not contain the processed 5'-TTAGCCCTTCCA-3' U3 terminus, or with <16 base pair (bp) of genomic DNA between the processed U3 end and beginning of the linker sequence, were discarded. Details of integration site sequence analyses are given in the Supplemental Material.
| Acknowledgments |
|---|
|
|
|---|
| Footnotes |
|---|
5 Present address: Avexa Limited, Richmond, Victoria 3121, Australia ![]()
E-MAIL alan_engelman{at}dfci.harvard.edu; FAX (617) 632-3113. ![]()
7 E-MAIL p.cherepanov{at}imperial.ac.uk; FAX 44-20-7594-3906. ![]()
Supplemental material is available at http://www.genesdev.org.
Article is online at http://www.genesdev.org/cgi/doi/10.1101/gad.1565107
| References |
|---|
|
|
|---|
Berry, C., Hannenhalli, S., Leipzig, J., and Bushman, F.D. 2006. Selection of target sites for mobile DNA integration in the human genome. PLoS Comput. Biol. 2: e157. doi: doi:10.1371/journal.pcbi.0020157.[CrossRef][Medline]
Brown, P.O., Bowerman, B., Varmus, H.E., and Bishop, J.M. 1987. Correct integration of retroviral DNA in vitro. Cell 49: 347356.[CrossRef][Medline]
Bushman, F., Lewinski, M., Ciuffi, A., Barr, S., Leipzig, J., Hannenhalli, S., and Hoffmann, C. 2005. Genome-wide analysis of retroviral DNA integration. Nat. Rev. Microbiol. 3: 848858.[CrossRef][Medline]
Busschots, K., Vercammen, J., Emiliani, S., Benarous, R., Engelborghs, Y., Christ, F., and Debyser, Z. 2005. The interaction of LEDGF/p75 with integrase is lentivirus-specific and promotes DNA binding. J. Biol. Chem. 280: 1784117847.
Carteau, S., Hoffmann, C., and Bushman, F. 1998. Chromosome structure and human immunodeficiency virus type 1 cDNA integration: Centromeric alphoid repeats are a disfavored target. J. Virol. 72: 40054014.
Cherepanov, P. 2007. LEDGF/p75 interacts with divergent lentiviral integrases and modulates their enzymatic activity in vitro. Nucleic Acids Res. 35: 113124.
Cherepanov, P., Maertens, G., Proost, P., Devreese, B., Van Beeumen, J., Engelborghs, Y., De Clercq, E., and Debyser, Z. 2003. HIV-1 integrase forms stable tetramers and associates with LEDGF/p75 protein in human cells. J. Biol. Chem. 278: 372381.
Cherepanov, P., Devroe, E., Silver, P.A., and Engelman, A. 2004. Identification of an evolutionarily conserved domain in human lens epithelium-derived growth factor/transcriptional co-activator p75 (LEDGF/p75) that binds HIV-1 integrase. J. Biol. Chem. 279: 4888348892.
Cherepanov, P., Ambrosio, A.L., Rahman, S., Ellenberger, T., and Engelman, A. 2005a. Structural basis for the recognition between HIV-1 integrase and transcriptional coactivator p75. Proc. Natl. Acad. Sci. 102: 1730817313.
Cherepanov, P., Sun, Z.-Y., Rahman, S., Maertens, G., Wagner, G., and Engelman, A. 2005b. Solution structure of the HIV-1 integrase-binding domain in LEDGF/p75. Nat. Struct. Mol. Biol. 12: 526532.[CrossRef][Medline]
Ciuffi, A., Llano, M., Poeschla, E., Hoffmann, C., Leipzig, J., Shinn, P., Ecker, J.R., and Bushman, F. 2005. A role for LEDGF/p75 in targeting HIV DNA integration. Nat. Med. 11: 12871289.[CrossRef][Medline]
Craigie, R., Fujiwara, T., and Bushman, F. 1990. The IN protein of Moloney murine leukemia virus processes the viral DNA ends and accomplishes their integration in vitro. Cell 62: 829837.[CrossRef][Medline]
Crise, B., Li, Y., Yuan, C., Morcock, D.R., Whitby, D., Munroe, D.J., Arthur, L.O., and Wu, X. 2005. Simian immunodeficiency virus integration preference is similar to that of human immunodeficiency virus type 1. J. Virol. 79: 1219912204.
Deininger, P.L. and Batzer, M.A. 1999. Alu repeats and human disease. Mol. Genet. Metab. 67: 183193.[CrossRef][Medline]
Emiliani, S., Mousnier, A., Busschots, K., Maroun, M., Van Maele, B., Tempe, D., Vandekerckhove, L., Moisant, F., Ben-Slama, L., Witvrouw, M., et al. 2005. Integrase mutants defective for interaction with LEDGF/p75 are impaired in chromosome tethering and HIV-1 replication. J. Biol. Chem. 280: 2551725523.
Engelman, A. 2007. Isolation and analysis of HIV-1 preintegration complexes. In HIV protocols, 2nd edition (eds. V.R. Prasad and G.V. Kalpana). Humana Press, Totowa, NJ. (in press).
Engelman, A., Mizuuchi, K., and Craigie, R. 1991. HIV-1 DNA integration: Mechanism of viral DNA cleavage and DNA strand transfer. Cell 67: 12111221.[CrossRef][Medline]
Ge, H., Si, Y., and Roeder, R.G. 1998. Isolation of cDNAs encoding novel transcription coactivators p52 and p75 reveals an alternate regulatory mechanism of transcriptional activation. EMBO J. 17: 67236729.[CrossRef][Medline]
Goff, S.P. 2007. Host factors exploited by retroviruses. Nat. Rev. Microbiol. 5: 253263.[CrossRef][Medline]
Hayashi, S., Lewis, P., Pevny, L., and McMahon, A.P. 2002. Efficient gene modulation in mouse epiblast using a Sox2Cre transgenic mouse strain. Mech. Dev. 119 (Suppl. 1): S97S101, doi: 10.1016/S0925-4773(03)00099-6.[CrossRef]
Holman, A.G. and Coffin, J.M. 2005. Symmetrical base preferences surrounding HIV-1, avian sarcoma/leukosis virus, and murine leukemia virus integration sites. Proc. Natl. Acad. Sci. 102: 61036107.
Katz, R.A., Merkel, G., Kulkosky, J., Leis, J., and Skalka, A.M. 1990. The avian retroviral IN protein is both necessary and sufficient for integrative recombination in vitro. Cell 63: 8795.[CrossRef][Medline]
Kim, J., Daniel, J., Espejo, A., Lake, A., Krishna, M., Xia, L., Zhang, Y., and Bedford, M.T. 2006. Tudor, MBT and chromo domains gauge the degree of lysine methylation. EMBO Rep. 7: 397403.[Medline]
Lesage, P. and Todeschini, A.L. 2005. Happy together: The life and times of Ty retrotransposons and their hosts. Cytogenet. Genome Res. 110: 7090.[CrossRef][Medline]
Lewinski, M.K., Yamashita, M., Emerman, M., Ciuffi, A., Marshall, H., Crawford, G., Collins, F., Shinn, P., Leipzig, J., Hannenhalli, S., et al. 2006. Retroviral DNA integration: Viral and cellular determinants of target-site selection. PLoS Pathog. 2: e60. doi: 10.1371/journal.ppat.0020060.[CrossRef][Medline]
Li, L., Olvera, J.M., Yoder, K.E., Mitchell, R.S., Butler, S.L., Lieber, M., Martin, S.L., and Bushman, F.D. 2001. Role of the non-homologous DNA end joining pathway in the early steps of retroviral infection. EMBO J. 20: 32723281.[CrossRef][Medline]
Li, M., Mizuuchi, M., Burke, T.R., and Craigie, R. 2006. Retroviral DNA integration: Reaction pathway and critical intermediates. EMBO J. 25: 12951304.[CrossRef][Medline]
Llano, M., Delgado, S., Vanegas, M., and Poeschla, E.M. 2004a. Lens epithelium-derived growth factor/p75 prevents proteasomal degradation of HIV-1 integrase. J. Biol. Chem. 279: 5557055577.
Llano, M., Vanegas, M., Fregoso, O., Saenz, D., Chung, S., Peretz, M., and Poeschla, E.M. 2004b. LEDGF/p75 determines cellular trafficking of diverse lentiviral but not murine oncoretroviral integrase proteins and is a component of functional lentiviral preintegration complexes. J. Virol. 78: 95249537.
Llano, M., Saenz, D.T., Meehan, A., Wongthida, P., Peretz, M., Walker, W.H., Teo, W., and Poeschla, E.M. 2006a. An essential role for LEDGF/p75 in HIV integration. Science 314: 461464.