| Literature DB >> 19443440 |
Anne-Kathrin Schultz1, Ming Zhang, Ingo Bulla, Thomas Leitner, Bette Korber, Burkhard Morgenstern, Mario Stanke.
Abstract
Previously, we developed jumping profile hidden Markov model (jpHMM), a new method to detect recombinations in HIV-1 genomes. The jpHMM predicts recombination breakpoints in a query sequence and assigns to each position of the sequence one of the major HIV-1 subtypes. Since incorrect subtype assignment or recombination prediction may lead to wrong conclusions in epidemiological or vaccine research, information about the reliability of the predicted parental subtypes and breakpoint positions is valuable. For this reason, we extended the output of jpHMM to include such information in terms of 'uncertainty' regions in the recombination prediction and an interval estimate of the breakpoint. Both types of information are computed based on the posterior probabilities of the subtypes at each query sequence position. Our results show that this extension strongly improves the reliability of the jpHMM recombination prediction. The jpHMM is available online at http://jphmm.gobics.de/.Entities:
Mesh:
Year: 2009 PMID: 19443440 PMCID: PMC2703979 DOI: 10.1093/nar/gkp371
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Part of the jpHMM web server output for an artificial recombinant containing alternating B/F1 fragments of lengths 1500/300 nt. Above the genome map of the predicted recombination is shown (drawn with the HIV Sequence Locator Tool, http://hiv.lanl.gov/), below the posterior probabilities of the subtypes. Breakpoint intervals are shown by an interfingering of the colors of the two predicted subtypes, uncertainty regions by an interfingering of grey and the color of the predicted subtype. For the uncertainty region around position 3438 the posterior probabilities give a hint to the correct subtype F1.
Comparison of the accuracy of breakpoint intervals (BPI) predicted by jpHMM and the accuracy of BPI of fixed length
| Threshold | BPI length | Percentage of BP found using | ||
|---|---|---|---|---|
| Average | Minimum/ Maximum | Fixed BPI length | ||
| 0.75 | 16.12 | 0/113 | 50.28 | |
| 0.85 | 22.46 | 0/121 | 56.39 | |
| 0.90 | 26.89 | 2/135 | 59.44 | |
| 0.95 | 34.05 | 2/202 | 65.00 | |
| 0.99 | 48.58 | 5/233 | 71.94 | |
| 0.9999 | 84.77 | 11/492 | 81.39 | |
Results are shown for the first dataset. In column 1, the threshold tBPI is given. The average length of the BPI defined by tBPI is given in column 2, the minimal and maximal length in column 3. In column 4, the percentage of real breakpoints detected with these BPI is shown, in column 5 the percentage of breakpoints using the average length of the predicted BPI as fixed BPI length (naïve method). For each threshold the highest value is marked in bold face.