| Literature DB >> 25203725 |
Lei Jia1, Lin Li1, Hanping Li1, Siyang Liu1, Xiaolin Wang1, Zuoyi Bao1, Tianyi Li1, Daomin Zhuang1, Yongjian Liu1, Jingyun Li1.
Abstract
BACKGROUND: Recombination is one of the major mechanisms underlying the generation of HIV-1 variability. Currently 61 circulating recombinant forms of HIV-1 have been identified. With the development of recombination detection techniques and accumulation of HIV-1 reference stains, more accurate mosaic structures of circulating recombinant forms (CRFs), like CRF04 and CRF06, have undergone repeated analysis and upgrades. Such revisions may also be necessary for other CRFs. Unlike previous studies, whose results are based primarily on a single recombination detection program, the current study was based on multiple recombination analysis, which may have produced more impartial results.Entities:
Mesh:
Year: 2014 PMID: 25203725 PMCID: PMC4159329 DOI: 10.1371/journal.pone.0107349
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Information of 6 reanalyzed CRFs.
| Name | Reference strain | Accession number | Methods determining original mosaic structure |
| CRF07_BC | 97CN001_C54 | AF286226 | RIP |
| CRF08_BC | 97CNGX_6F | AY008715 | Simplot |
| CRF23_BG | CB118 | AY900571 | Recombinant structures were determined by Simplot. The programs Genconv, MaxChi, and GARD were used to locate more precisely the positions of breakpoints. |
| CRF24_BG | CB378 | AY900574 | |
| CRF38_BF | UY03_3389 | FJ213783 | Simplot |
| CRF44_BF | CH80 | FJ358521 | Simplot |
Lists of methods and algorithms of the four recombination analysis tools.
| Program | Method | Implemented algorithms | Publication Year |
| SimPlot | Phylogenetic methods, Pairwise sequence comparisons | Similarity/distance plot, Bootscanning | 1999 |
| RIP | Pairwise sequence comparisons | Similarity/distance plot | 1995 |
| jpHMM | Jumping profile hidden Markov models, Hidden Markov models | 2009 | |
| RDP3 | Phylogenetic methods, Patterns of sites, Pairwise sequence comparisons | Sawyer's statistical test for gene conversion, Maximum chi-squared, Similarity/distance plot, Bootscanning, Difference in Sums of Squares method, Graphical recombination detection using Phylogenetic Profiles, Likelihood Analysis of Recombination in DNA, Sister scanning method, RDP method, Automated bootscanning (Recscan), Recombination detection using multiple approaches, Recombination detection using hyper-geometric random walks. | 2010 |
Figure 1Recombination and phylogenetic reanalysis of CRF07_BC.
(A) Genome maps of CRF07_BC from different sources. The top picture shows the original mosaic structure from the Los Alamos HIV database. The middle image is the jpHMM-derived mosaic structure. The bottom image is the RDP3-derived mosaic structure. The standard representatives are marked by different colors, as indicated. (B) Phylogenetic relationship of the region spanning HXB2 nt 790–2053 in gag with the representatives of the major HIV-1 (group M) subtypes based on a new mosaic structure. The tree was constructed using the Maximum likelihood method implemented in Mega 5. Values at the nodes indicate the percent bootstraps in which the cluster to the right was supported. Bootstrap support values of ≥70% were considered significant. Only bootstraps of 70% and higher are shown. Subtype C clades are identified by brackets. Branch lengths are drawn to scale. Positions are shown beside the tree.
Comparison of newly identified segment assignment and breakpoint positions of CRF07_BC with original data.
| Method of recombination analysis | Segment assignment and breakpoint positions of CRF07_BC | ||||
| RIP | B1: 1270 1410 | B2: 2081 2620 | B3: 3011 3310 | B4: 5700 6310 | B5: 8797 9058 |
| jphMM | - | 2073 2546 | 2979 3203 | 5851 6428 | 8840 9059 |
| RDP3 | - | 2054 2523 | 2924 3192 | 6057 6420 | 8845 9051 |
| RDP3 plus original reference sequences | - | 2059 2546 | 2914 3185 | 5670 6431 | 8869 9059 |
-indicates that the segment was not detected using this method.
Figure 2Recombination and phylogenetic reanalysis of CRF08_BC.
(A) Genome maps of CRF08_BC from different sources. The first mosaic structure in the panel is from the Los Alamos HIV database. The second is from jpHMM. The third is from RDP3. The standard representatives are marked with different colors, as indicated. (B) Phylogenetic relationship of the region spanning HXB2 nt 9027–9417 in nef with the representatives of the major HIV-1 (group M) subtypes based on newly inferred mosaic structure. The tree was constructed using the maximum likelihood method implemented in Mega 5. Bootstrap support values of ≥70% are considered to be significant.
Comparison of newly identified segment assignment and breakpoint positions of CRF08_BC with original data.
| Method of recombination analysis | Segment assignment and breakpoint positions of CRF08_BC | ||
| Simplot | B1: 1234 1680 | B2: 2853 3149 | B3: 8797 9417 |
| jphMM | - | 2842 3177 | 8864 9026 |
| RDP3 | 1325 1649 | 2908 3171 | 8849 9002 |
| RDP3 plus original reference sequences | 1268 1641 | 2866 3171 | 8869 9028 |
-indicates that the segment was not detected using this method.
Figure 3Recombination and phylogenetic reanalysis of CRF24_BG and CRF44_BF.
(A) Results of CRF24 from both jpHMM and RDP3 indicate there is a whole B segment in the pol region rather than three separate segments in the previous structure. The result was further confirmed by the phylogenetic tree. The position spans 2588–4153 is shown above the tree. The tree was constructed using PhyML program implemented in the RDP3 software package. Bootstrap replications of ≥350 (i.e., bootstrap support values of ≥70%) are considered to be significant. The very small part of the B segment (8697–8750) in env was not detected by either newly-used program. (B) Both jpHMM and RDP3 indicate that in CRF44, there is a whole B segment in the pol region rather than two separate segments in previous mosaic structure. The result was further confirmed by the phylogenetic analysis. The position spans 2470–3705 is shown above the tree. The tree was constructed using PhyML program implemented in the RDP3 software package. Bootstrap replications of ≥350 (i.e., bootstrap support values of ≥70%) are considered to be significant. A very small part of B segment (6342–6446) in env was not detected by either new program.
Comparison of newly identified segment assignment and breakpoint positions of CRF24_BG with original data.
| Method of recombination analysis | Segment assignment and breakpoint positions of CRF24_BG | |||
| Simplot | B1: 2552 2842 | B2: 2975 3292 | B3: 3380 4148 | B4:8697 8750 |
| jphMM | 2574 4208 | - | ||
| RDP3 | 2588 4153 | - | ||
| RDP3 plus original reference sequences | 2591 4153 | - | ||
-indicates that the segment was not detected using this method.
Comparison of newly identified segment assignment and breakpoint positions of CRF44_BF with original data.
| Method of recombination analysis | Segment assignment and breakpoint positions of CRF44_BF | ||
| Simplot | B1: 2470 2724 | B2: 2935 3695 | B3: 6342 6446 |
| jphMM | 2465 3729 | - | |
| RDP3 | 2470 3705 | - | |
-indicates that the segment was not detected using this method.
Figure 4Reanalysis of CRF23_BG and CRF38_BF reveal the challenges of recombination analysis.
(A) Three recombination programs indicate similar endpoints but significantly different starting points of the B segment in pol region of CRF23. (B) These three recombination programs indicate similar starting points but different endpoints of the B segment in pol region of CRF38.