| Literature DB >> 20646301 |
Adam F Allred1, Guang Wu, Tuya Wulan, Kael F Fischer, Michael R Holbrook, Robert B Tesh, David Wang.
Abstract
BACKGROUND: All infectious disease oriented clinical diagnostic assays in use today focus on detecting the presence of a single, well defined target agent or a set of agents. In recent years, microarray-based diagnostics have been developed that greatly facilitate the highly parallel detection of multiple microbes that may be present in a given clinical specimen. While several algorithms have been described for interpretation of diagnostic microarrays, none of the existing approaches is capable of incorporating training data generated from positive control samples to improve performance.Entities:
Mesh:
Year: 2010 PMID: 20646301 PMCID: PMC2921407 DOI: 10.1186/1471-2105-11-384
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Viruses hybridized to the diagnostic microarray
| Virus | Family | Causes HF | # of strains hybridized |
|---|---|---|---|
| Amapari virus | No | 1 | |
| Guanarito virus | Yes | 4 | |
| Ippy virus | No | 1 | |
| Junin virus | Yes | 1 | |
| Lassa virus | Yes | 2 | |
| Lymphocytic choriomeningitis virus | No | 1 | |
| Machupo virus | Yes | 1 | |
| Mobala virus | No | 1 | |
| Mopeia virus | No | 1 | |
| Sabia virus | Yes | 1 | |
| Tacaribe virus | No | 1 | |
| California encephalitis virus | No | 1 | |
| Crimean-Congo hemorrhagic fever virus | Yes | 4 | |
| Hantaan virus | Yes | 1 | |
| La Crosse virus | No | 1 | |
| Ngari virus | Yes | 1 | |
| Puumala virus | Yes | 1 | |
| Rift Valley fever virus | Yes | 3 | |
| Seoul virus | Yes | 1 | |
| Toscana virus | No | 1 | |
| Angola marburgvirus | Yes | 1 | |
| Reston ebolavirus | No | 1 | |
| Sudan ebolavirus | Yes | 1 | |
| Zaire ebolavirus | Yes | 1 | |
| Gabon ebolavirus | Yes | 1 | |
| Dengue virus 1 | Yes | 2 | |
| Dengue virus 2 | Yes | 2 | |
| Dengue virus 3 | Yes | 2 | |
| Dengue virus 4 | Yes | 2 | |
| Kyasanur Forest disease virus | Yes | 2 | |
| Omsk hemorrhagic fever virus | Yes | 4 | |
| Rocio virus | No | 1 | |
| Yellow fever virus | Yes | 2 | |
Figure 1Flow of VIPR. The VIPR probabilistic model incorporates both empirical array data as well as sequence data from GenBank to calculate likelihoods for each candidate virus. A) Posterior probabilities are calculated for each probe. B) The On or Off posterior is chosen for each probe based on predicted binding to candidate genomes. C) Posteriors are multiplied to obtain a likelihood for each candidate virus.
Figure 2Examples of . A) One representative probe with highly resolved On and Off distributions based on the training set data. B) One representative probe where the On and Off distributions overlap. Empirical distributions (blue = Off, red = On) and estimated distributions (cyan = Off, pink = On) are shown.
Five highest scoring candidates for a Dengue 3 hybridization.
| Rank | Virus | Family | log(L) | p-value |
|---|---|---|---|---|
| 1 | Dengue virus 3 | -352 | 0 | |
| 2 | Dengue virus 4 | -391 | 0 | |
| 3 | Dengue virus 2 | -539 | 0 | |
| 4 | Dengue virus 1 | -599 | 0 | |
| 5 | Psittacid herpesvirus 1 | -433 | 1.0 | |
The six arrays that were misclassified by VIPR.
| False positives | ||
|---|---|---|
| Chip# | Hybridized virus | Top scoring virus (p < 5e-4) |
| 207 | Dengue virus 3 | Dengue virus 4 |
| Chip# | Hybridized virus | Top scoring virus (p < 5e-4) |
| 462 | Kyasanur Forest disease virus | none |
| 463 | Kyasanur Forest disease virus | none |
| 464 | Kyasanur Forest disease virus | none |
| 221 | Ippy virus | none |
| 245 | Ippy virus | none |
Figure 3Cross-validation results for different combinations of prior pairs.
Accuracy of VIPR compared to other methods for this dataset.
| Algorithm | Accuracy (%) |
|---|---|
| VIPR | 94 |
| DetectiV | 76-83 |
| E-Predict | 61 |
| PhyloDetect | 49 |