|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
J. Biol. Chem., Vol. 281, Issue 5, 2711-2720, February 3, 2006
Dissection of a Circumscribed Recombination Hot Spot in HIV-1 after a Single Infectious Cycle*
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The mechanisms underlying recombination in HIV-1 have been intensively studied during the last decade. Most recombination events have been shown to occur by template switching during synthesis of the first DNA strand, when the reverse transcriptase (RT) uses the genomic RNA as a template (3, 10-12). In retroviruses, reverse transcription is coupled to the degradation of the RNA template by the RT-encoded RNase H activity, a process that is mandatory for template switching (13-15). The degradation of the template RNA (donor RNA) is required to leave the trailing nascent DNA in a single-stranded form, available for annealing onto the other copy of genomic RNA present within the viral particle (acceptor RNA). This interaction would then drive the ultimate transfer of the growing 3' end of the nascent DNA onto the second RNA moiety, leading to the generation of a chimeric DNA carrying genetic information from both copies of genomic RNA. Different causes have been proposed to be responsible for template switching (16), such as breaks on the genomic RNA (17, 18) or strong pause sites during reverse transcription (13, 19, 20). The role of stalling of reverse transcription in response to these obstacles to DNA synthesis would be to allow a more extensive degradation of the RNA template by the RT-encoded RNase H activity (13, 19-21).
The idea that template switching could occur preferentially at strong pause sites of reverse transcription has been supported by the correlation between a high rate of template switching, and the nearby presence of a strong pause site observed in cell-free reconstituted systems with several templates (13, 19, 22, 23). Stalling of reverse transcription is indeed supposed to enhance the efficiency of template switching by allotting more time to the RNase H activity to degrade the donor RNA (24-26) and by increasing the time of residence of the reverse transcriptase in the given sequence interval (23, 27). However, mounting evidence indicates that preferential sites for recombination do not necessarily correlate with stalling of reverse transcription. A particularly well documented case is the occurrence of preferential transfer in structured regions of the RNA template (20, 23, 27-33). In some of these cases the hairpin structures were suggested to induce stalling of reverse transcription at their base favoring the annealing of the acceptor RNA on the nascent DNA (20).
Using an original system to study recombination after a single cycle of infection of cells in culture, we recently identified a hot spot within the coding portion of the gp120 gene from the LAI strain of HIV-1, where the recombination rate per nucleotide was up to 10 times higher than those observed in the surrounding regions (34). In that work, which provided the first demonstration of the existence of a hot spot for recombination in infected cells, copy choice was studied on a 400-nt long region that was subdivided into five subregions named R1 to R5 following the sense of (-)-DNA synthesis. The hot region corresponded to region R2, which spanned the 58 nt that make the upper part of a hairpin (C2 hairpin), whose structure was deduced by enzymatic probing in vitro (27). In addition, by introducing mutations in the lower part of the stem in such a way as to alter the stability of the hairpin, we were able to affect the efficiency of recombination in R2. In this study, we have addressed the question of which determinants, within R2 itself, are responsible for the high degree of transfer observed.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
R8.2 (35), coding for HIV-1 gag, pol, and accessory proteins; and pHCMV-G (36), which carries the gene for the G protein of the vesicular stomatitis virus envelope. For assays with vector particles lacking accessory proteins the pCMV
R8.74 transcomplementation plasmid was used (37), together with the pHCMV-G plasmid. Cells293T cells were grown in Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum, penicillin, and streptomycin (from Invitrogen), and maintained at 37 °C with 10% CO2. MT4 cells were maintained in RPMI 1640 medium supplemented with 10% fetal calf serum and antibiotics at 37 °C with 5% CO2.
Production of Vector ParticlesHIV-1-based vectors were produced by transient transfection of 293T cells using the calcium phosphate method as described in Ref. 34. Briefly, cells were transfected with an HIV-1 encapsidation plasmid (pCMV
R8.2) (35) and a vesicular stomatitis virus envelope expression plasmid (pHCMV-G) (36), together with two plasmids that generate the defective genomic RNAs shown in Fig. 1A. For the generation of viral vectors devoid of accessory proteins, pCMV
R8.2 was replaced by pCMV
R8.74 (37). To eliminate non-internalized DNA from viral preparations, the supernatants from transfected cells were DNase I treated prior to concentration using Vivaspin Ultrafiltration Concentrators (molecular weight cut-off 50,000). The amount of p24 present in vector preparations was determined by using a HIV-1 p24 enzyme-linked immunosorbent assay kit (PerkinElmer Life Sciences).
Single Cycle Infection AssaysMT4 cells were transduced with 200 ng of p24 antigen per 106 cells (corresponding to an approximate multiplicity of infection of 20) in a 500-µl volume in 35-mm dishes. Two-hours post-transduction the cells were diluted up to a 4-ml volume with supplemented RPMI medium and maintained at 37 °C in a 5% CO2 incubator for 40 h. The reverse transcription products (RTP) were purified from the cytoplasmic fraction of transduced cells using the method described by Hirt (38), because most of the RTP remain in an unintegrated form and, additionally, the genomic vectors lack the FLAP sequence shown to enhance nuclear import of RTP (39). Briefly, cells were lysed, high molecular weight DNA was removed by precipitation, and the lysates were cleared by ultracentrifugation. After phenol chloroform extraction from the supernatant, low molecular weight DNA was ethanol-precipitated and purified using the NucleoSpin® Extract clean-up kit (Macherey-Nagel). The purified double-stranded DNA was digested with DpnI to eliminate possible contaminating DNA of bacterial origin prior to PCR amplification (20 cycles) with primers BH and SH, and cloning in Escherichia coli. Plating on isopropyl 1-thio-
-D-galactopyranoside/5-bromo-4-chloro-3-indolyl-
-D-galactopyranoside (IPTG/X-gal) containing dishes allowed blue/white screening of recombinant and parental colonies, respectively (34). To determine the regions where strand transfer occurred, 48 recombinant clones were analyzed for each triplicate assay. A full panel of controls run in parallel to ascertain that the recombinant molecules were generated during reverse transcription has been previously described (34).
Estimation of the Recombination RatesThe transfection of equal amounts of pLac+ and pLac- plasmids leads to the production of similar quantities of each of the genomic RNAs as previously reported (34), because the same promoter sequence is present in both genomic plasmids. Given that the RNAs also share the same sequences for dimerization and encapsidation, these processes are expected to yield 50% of heterozygous, and 50% of homozygous particles, with an equal amount of lac +/+ and lac-/- vectors, as usually assumed (40). After transduction of MT4 cells, the RTP are amplified by PCR and cloned in E. coli after digestion with BamHI and SacII. This procedure will allow cloning BamHI+/lac- and BamHI+/lac+ RTP, which can be generated by reverse transcription in homozygous lac-/- and in heterozygous particles (34). As a result, assuming that only one molecule of double-stranded DNA is generated from each viral particle, one-third of the total amount of colonies will correspond to RTP issued from lac-/- vectors. The total number of colonies is therefore multiplied by 2/3 to consider only the RTP issued from heterozygous particles. The possibility of cloning products of cellular origin was also ruled out as described in Ref. 34. The number of white colonies (N) is corrected by a factor given by n/48, where n is the number of colonies that resulted from cloning of RTP after analysis of 48 white colonies. The global frequency of recombination (F) is given by: F = b/{2/3[N(n/48) + b]}, where b is the number of blue colonies. The recombination rate per nucleotide (f) within a given interval (i) is given by: f = F(xi/X)/zi, where F is as above, xi is the number of colonies analyzed where recombination was identified to have occurred within the interval considered, X is the total number of colonies on which mapping was performed, and zi is the size in nucleotides of the interval.
Primer Extension AssaysReverse transcription was primed using a 5'-terminal-labeled deoxyoligonucleotide and, for each RNA analyzed, 3 pmol were annealed with the primer at a molar ratio of primer to RNA of 10:1. Annealing was performed at 65 °C for 5 min in a buffer containing 50 mM Tris-HCl (pH 7.8) and 75 mM KCl, followed by slow cooling to 40 °C. After incubation on ice for 2 min, dithiothreitol was added at a final concentration of 1 mM, together with 100 units of RNasin (Promega). The nucleocapsid protein (NC) was then added at a ratio of 1 molecule of NC for 8 nt of total RNA and incubated at 37 °C for 5 min. HIV-1 RT was added at a final concentration of 400 nM and incubated for 5 min at 37 °C to allow the formation of the reverse transcription complex. The reaction was started by addition of the four dNTPs (1 mM each, final concentration) and MgCl2 (at a final concentration of 7 mM), and stopped at various time intervals by addition of EDTA and SDS at a final concentration of 25 mM and 0.4%, respectively. The samples were incubated for 1 h at 56°C and ethanol precipitated after phenol and chloroform extraction. The products were analyzed by autoradiography using a PhosphorImager (Amersham Biosciences) after electrophoresis on 8% (w/v) polyacrylamide gels containing 8 M urea, using a loading buffer with formamide at a final concentration of 22.5%.
| RESULTS |
|---|
|
|
|---|
|
|
Effect of HIV-1 Accessory Proteins on the Distribution of Strand Transfer EventsVarious viral accessory proteins have been shown to be present in HIV-1 cores (41), possibly participating in the formation of the reverse transcription complex and affecting the mutation rate during reverse transcription (42). To determine whether these viral proteins have any role in the generation of the identified recombination hot spot, we carried out a recombination assay on C2 RNA after generating vector particles deprived of vif, vpr, vpu, and nef proteins, as described under "Experimental Procedures." Template switching occurred at recombination rates similar to those found in the presence of the accessory proteins, as previously observed (43), and with preference for the same genomic region (Fig. 3A, right panel). Altogether, these observations suggest that the major determinants for the generation of the hot spot in R2c are not related to the presence of these cofactors and might therefore be determined by other parameters, as could be the primary sequence of this region, or its position in the C2 hairpin.
Position Effect on RecombinationThe observation that only the descending part of the hairpin constitutes a hot sequence for recombination raises the question of which determinants are responsible for this behavior. We first checked whether the location of R2c in the descending portion was essential for the high efficiency of transfer observed. To this end, we constructed a mutant hairpin where the portions of R2a and R2c engaged in the formation of the upper part of the hairpin were exchanged, as shown in Fig. 2B (SW RNA). The exchange was made maintaining the 3' to 5' polarity of the swapped sequences as in the C2 RNA, in such a way that the sequences the RT will come across within R2a and R2c will be the same as in C2 RNA (Fig. 2B). Based on folding prediction analyses of the 118 nt that constitute the hairpin using the m-fold program (44), these rearrangements should result in the formation of a hairpin similar to the one of C2 RNA, and no alternative structures were predicted to be formed (supplementary materials). As shown in Fig. 3B (SW RNA), the rate of recombination in R2c decreased by a factor 15 (from 10.5 to 0.7 x 10-4 template switching events per nt), and no longer constituted a hot spot. In parallel, positioning the sequence R2a in the descending portion of the SW structure turned it into a recombination hot spot displaying a recombination rate per nucleotide 20 times higher than that observed on the same sequence in C2 RNA (23.9 versus 1.2 x 10-4 per nt). These observations suggest that the descending part of the stem constitutes a crucial position for the high recombination rate observed in the hot spot. Noteworthy, the frequency found in R2a with SW RNA is approximately twice (Fig. 3) that found in the same region of the hairpin on C2 RNA, where that position was occupied by region R2c (Fig. 2). This suggests that the primary structure of the region influences the efficiency of recombination also, although at a lesser extent than its position in the secondary structure.
To further investigate the contribution of the position effect and the RNA sequence on the efficiency of recombination suggested by these results, we generated a new mutant where swapping of R2a and R2c was performed using the antiparallel sequences. In this way, while the base content of this portion of the hairpin remains unaltered, the primary structure within these regions is different than in SW RNA (SWap, Fig. 2B). This mutant RNA maintained, as judged by prediction analyses, the ability to fold in a structure comparable with the one of the C2 hairpin, albeit two other potential folding configurations (with a comparable free energy) were predicted in this case, indicating a possible equilibrium among the three forms (see supplementary materials). In this RNA, no preferential spots for recombination could be highlighted among the 8 regions comprising the homologous sequence (Fig. 3B, right panel).
|
Inside R2c in C2 RNATo investigate more in depth to which extent the primary structure of the RNA contributes to the high degree of transfer observed in a given region of the RNA, we focused on the wild type RNA (C2), where R2c constituted the hot region. We first attempted to define more precisely the hot region within R2c. To this end another donor RNA was made, carrying a mutation in a median location of R2c (m10 RNA), to split this region in approximately two halves. In this mutant the U residue at position 10 of the hot spot was replaced by a C (Fig. 2A), allowing to preserve the possibility of forming the hairpin by replacing the U-G pairing of C2 RNA with a C-G pairing in the mutant. Strikingly, the use of this RNA in combination with C2 as acceptor template led to a remarkable drop in the occurrence of recombination in the whole R2c region with respect to the C2 RNA (from 10.5 to 0.7 x 10-4 per nt). Recombination rates for each of the other regions were not significantly changed (not shown). We reasoned that two possible explanations could be provided for this result. One could be that sequence determinants required for efficient switching on the donor RNA were abolished in m10 RNA. In particular the interruption of a poly(U) stretch could have been detrimental to the transfer process, in accord to what has been previously suggested (45). An alternative explanation could have been that the presence of a base substitution between donor and acceptor RNA in the region of transfer severely affected the efficiency of the process. On this basis, we examined the contribution of the primary structure of the RNA and sequence divergence between donor and acceptor RNA on the transfer process.
Role of the Primary Structure of the RNA in R2cWe first addressed the issue of the role of the primary structure on the efficiency of transfer. This was achieved by constructing four mutant RNAs, each carrying two consecutive base substitutions in different regions of R2c. The mutations were introduced on the donor and acceptor RNAs in such a way as to maintain a perfect identity between these two RNA moieties inside the R2c region.
Four mutants were constructed, targeting the part near the apical loop (m2-3, see Fig. 1A), the small lateral loop in the descending stem (m6-7), the stretch of four uridine residues in the central part of the hot spot (m10-11), and the sequence close to the border with region R3 (m13-14). In the three cases where the mutated residues were located in a double-stranded portion, the mutations were introduced in such a way as to maximize the likelihood of preserving the hairpin structure by introducing compensative mutations in R2a. In all cases these mutations were, accordingly, predicted to preserve the possibility of forming a hairpin as the one in C2. However, in two cases (m6-7 and m10-11) alternative conformations were also predicted to be likely (see supplementary materials). The four mutants yielded different results with the pattern of recombination essentially not altered with m2-3 and m13-14, and a 2-3-fold decrease in the efficiency of transfer in R2c for m6-7 and m10-11. As a result, no significant preferential sites for recombination were present on these two RNAs. It is worth noting that, instead, the compensatory mutations introduced in R2a did not modify the recombination rate in this region in any of the mutants (Fig. 3C).
These results suggest that sequence specificity determinants influence the efficiency of recombination, confirming the observations made with SWap RNA. The fact that the recombination rates within R2c were not significantly affected when residues 2-3 and 13-14 were mutated (Fig. 3C) could be interpreted as an indication that these residues are located outside the actual hot region. However, this conclusion would rely on the assumption that all mutations within the region important for transfer have a negative effect on template switching. No rationale supports such an interpretation, since mutations could in principle also be neutral or beneficial for strand transfer. We therefore reasoned that another likely explanation could be that the substitutions introduced in m6-7 and m10-11 modified crucial parameters for the efficiency of transfer that were not affected in m2-3 and m13-14. These hypotheses were tested below.
Primary Structure and Kinetics of Reverse TranscriptionAs mentioned above, a likely parameter that can affect copy choice is constituted by the pattern of pausing during reverse transcription. Based on in vitro primer extensions assays, pausing of DNA synthesis has been related in several cases to the efficiency of strand transfer in vitro (13, 19, 20, 22, 23, 28). The kinetic of reverse transcription was therefore analyzed on the two RNAs carrying double point mutations for which recombination in R2c was decreased with respect to C2 RNA together with wild type RNA and, for comparison, a mutant RNA where recombination was not affected (m13-14). Reverse transcription was performed in the presence of HIV-1 NC, a nucleic acid chaperon protein tightly bound to the genomic RNA during reverse transcription in vivo.
An overview of the pausing pattern along the region of homology used in the recombination assays is presented in Fig. 4. The kinetics of reverse transcription observed here were similar along the four RNAs, with no particular signature of the pausing pattern in R2c compared with other regions. Only minor differences between the four RNAs were found, almost exclusively at the level of the regions harboring the mutations (region R2c for m6-7 RNA, and regions R2a and R2c for m10-11 and m13-14 RNAs, Fig. 3).
Repeating the same analysis at a higher resolution allowed us to estimate the degree of pausing for each of the 18 nt constituting R2c (Fig. 5). Two significant pause sites are observed in the wild type R2c region and in the three versions generated by point mutations (Fig. 5B). One site, indicated by a diamond in Fig. 5, is located near the border between R2c and R3 (nt 17-G in Fig. 5A). Its intensity is comparable in C2, m6-7, and m13-14 RNAs, whereas it only constitutes a minor site for stalling of reverse transcription in m10-11 RNA. The other pause site corresponds to the first of two guanine residues transcribed after the 4-U tract present in C2 RNA (position 13G, indicated by an asterisk in Fig. 5). In this case, pausing is similar for C2, m10-11, and m6-7 RNAs, whereas an attenuation of stalling was observed in the m13-14 RNA.
In conclusion, this analysis does not highlight any significant correlation between the presence of pause sites and the recombination rates in these four RNAs. This is further supported by the analysis of pausing on SW and SWap RNAs (Fig. 5C), where a strong pause site was detected at the border of the region of interest in SWap RNA, where no hot spot for recombination was detected, and no significant pausing was perceived in SW, where the presence of a hot spot was evident (Fig. 3B).
Sequence Divergence within R2cAs mentioned above, the dramatic inhibition of copy choice in R2c observed when using virions carrying the pair m10/C2 RNAs (donor/acceptor, respectively) was likely indicative of a detrimental role on recombination of the local sequence divergence in proximity of the region of transfer. We therefore reasoned that using slightly divergent donor and acceptor RNAs could be of use to define the region important for template switching. The rationale was that sequence heterogeneity between donor and acceptor RNAs in the area of transfer would hamper template switching. If the mutations were introduced ahead of the position of transfer (following the sense of reverse transcription), or too far behind it, no significant effect should be observed.
|
| DISCUSSION |
|---|
|
|
|---|
A series of mutant RNAs were designed to address the question of which features of this region are responsible for the high recombination rate observed. In a first instance the entire hot region was swapped in position with the sequence that was located in the mounting part of the hairpin (R2a). This was done either maintaining the original 5' to 3' orientation of the sequences, or using the antiparallel sequences. These experiments highlighted the existence of both a strong position effect within the secondary structure of the RNA and the implication of the sequence of the hot spot itself. Indeed, the location in the descending portion of the stem appeared crucial because, in both mutants, R2c underwent a 15-fold drop in the recombination rate in the ectopic position. In addition, for one of the mutants (SW RNA, Fig. 3B) a spectacular increase in recombination was observed for R2a. Altogether, these observations downplayed the role of the primary structure of the RNA in the hot spot. However, the result that, in the other mutant, the insertion of R2a in the antiparallel orientation in the descending limb of the stem failed to turn it into a hot spot (R2a-ap in SWap RNA) underscored that the RNA sequence also contributes to the efficiency of copy choice. The participation of the sequence in determining the efficiency of recombination was confirmed by the results obtained with four constructs that carried, in the R2c region, two base substitutions each, at different positions. For two mutants the ability of R2c to harbor template switching remained unaffected, whereas for two others it decreased by an approximate 3-fold factor.
|
|
|
Although this model remains speculative in its details, the occurrence of strand exchange between a DNA/RNA heteroduplex and a double-stranded acceptor RNA necessarily implies a delicate equilibrium between the conformations of both these reactants. How can the substitution of only two residues in the hot region, as in m10-11 and m6-7 RNAs, influence this process? One possibility could be that the disparity in the recombination rates observed with the different RNAs reflects different efficiencies of the migration process. Indeed, branch migration has recently been shown to be highly non-uniform with some sequences resulting in stable conformations in which migration is almost blocked, alternated to sequences where migration is fast (48). Such discrepancies could account for the differences in recombination observed with m6-7 and m10-11 mutants.
However, the mutations present in one of the aforementioned mutants (m6-7) involved two bulged nucleotides and it is therefore expected to affect branch migration only marginally. In the context of the infected cell, viral and, possibly, cellular proteins assist the reverse transcription process (42). In particular, the viral NC protein is expected to modulate the equilibrium between the various structures of the nucleic acids in the transfer process (46). In this regard, it has been shown that the identity of bulged residues as well as subtle sequence changes in the double-stranded region of hairpins can trigger important differences in the stability of these structures, both at the RNA and cDNA levels, and in the modulation of template switching by NC (49-52). Here, it is possible that the identity of the residues present in the bulge could influence template switching by such a mechanism.
Another possible explanation, not exclusive with the previous ones, is related to the folding of the RNA in the hot region. The different mutants used in this study were designed in such a way as to preserve the possibility for the RNA to fold into a hairpin similar to the one found in C2 RNA, whose structure has been previously determined (27). However, exclusively in those cases where the hot spot was erased (m6-7, m10-11, and also for the transposition mutant SWap), this hairpin was predicted to be in equilibrium with alternative structures of comparable stabilities (see supplementary materials), suggesting that these RNAs could frequently shift from one to another of these structural conformations, decreasing the proportion of molecules under the appropriate conformation to promote efficient recombination when reverse transcription reaches this region.
A major effect on recombination, stronger than that of the primary structure of the hot spot, is exerted by the degree of sequence identity in the region of transfer. The most logical explanation for the dramatic inhibition of template switching, observed even with a single nucleotide of sequence discordance between donor and acceptor RNA, is that it results in a destabilization of the heteroduplex between the nascent DNA and the acceptor RNA, because of the presence of mismatched residues. This implies that whenever a mutation interferes with transfer, this is probably indicative of the participation of the mutated residues in the formation of such heteroduplex and, consequently, that transfer occurs downstream from the mutation, in the sense of reverse transcription. Because the presence of divergent residues between donor and acceptor RNAs affected recombination for all the positions tested, it can be deduced that the transfer probably occurs on a restricted area near the 5' end of R2c (close to the border with R3). The inhibition observed for all the mutations tested suggests that, essentially, the whole R2c region is important for the transfer. For a transfer event occurring close to the R2c/R3 border, the size of R2c well correlates with that of the region that, on the nascent DNA, is certainly in a double-stranded form, i.e. the 18 nt that separate the polymerase and the RNase H active sites of the RT (53). It should be noted that for branch migration between DNA molecules, the presence of a single mismatch is sufficient to block the process (54).
Overall, the present work indicates that at least three factors contribute to the high rate of copy choice in R2c. The secondary structure of the RNA and the presence of a perfect identity between donor and acceptor RNAs appear pivotal, triggering changes in the rate of recombination in the 15-20-fold range in the first case, and leading to the total abolishment of recombination for as little as one or two discordant residues between donor and acceptor RNAs. The third factor, contributing to a lesser extent to the efficiency of transfer (effect in the 2-3-fold range) is constituted by the primary structure of the genomic RNA. In view of these observations, successful copy choice in infected cells appears to be the outcome of a delicate thermodynamic equilibrium.
| FOOTNOTES |
|---|
The on-line version of this article (available at http://www.jbc.org) contains a supplemental Figure. ![]()
1 Recipient of a fellowship from Sidaction. ![]()
2 To whom correspondence should be addressed. Tel.: 33-145-688-505; Fax: 33-145-688-399; E-mail: matteo{at}pasteur.fr.
3 The abbreviations used are: HIV-1, human immunodeficiency virus type 1; RT, reverse transcriptase; RTP, reverse transcription products; nt, nucleotides; NC, nucleocapsid protein. ![]()
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|