| Literature DB >> 17868443 |
Michael Watson1, Juliet Dukes, Abu-Bakr Abu-Median, Donald P King, Paul Britton.
Abstract
DNA microarrays offer the possibility of testing for the presence of thousands of micro-organisms in a single experiment. However, there is a lack of reliable bioinformatics tools for the analysis of such data. We have developed DetectiV, a package for the statistical software R. DetectiV offers powerful yet simple visualization, normalization and significance testing tools. We show that DetectiV performs better than previously published software on a large, publicly available dataset.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17868443 PMCID: PMC2375028 DOI: 10.1186/gb-2007-8-9-r190
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Flow of information, and steps taken, when analyzing pathogen detection microarray data using DetectiV.
Figure 2GSM40814 by family. Example barplot from DetectiV showing data from a virus detection microarray. The sample included amplified RNA from nasal lavage, positive for respiratory syncytial virus by DFA. Oligos have been averaged over replicates and grouped according to virus family. Each unique oligo is represented by a single bar. Each virus family has a unique background color. The y-axis is raw intensity.
Figure 3GSM40814 Paramyxoviridae by species. Example barplot from DetectiV showing data from a virus detection microarray. The sample included amplified RNA from nasal lavage, positive for respiratory syncytial virus by DFA. Only oligos representing species from the Paramyxoviridae family are shown. Oligos have been averaged over replicates and grouped according to virus species. Each unique oligo is represented by a single bar. Each virus species has a unique background color. The y-axis is raw intensity.
DetectiV normalization methods
| Method | Normalized statistic | Terms |
| Median | Where | |
| Control | Where | |
| Array | Where |
Explanation of the three normalized statistics offered by DetectiV.
E-Predict parameters
| Parameter | Value |
| user_wts | MV_72worst_medRaw500_badYdens |
| norm_opt | Sum |
| energy_filter | undef |
| ematrix | 22/07/2004 |
| ematrix_norm | Quadratic |
| ematrix_efilter | 30 |
| dist_metric | Pearson Uncentered |
| iterate | 2 |
| top_oligos | 5 |
| top_genomes | 5 |
| top_fams | 5 |
| sort_by | Distance|P value |
| eclust | None |
Parameters used for input into E-Predict.
Typical results from DetectiV
| GSM40806 | GSM40810 | GSM40820 | ||||||
| Virus | Mean | Virus | Mean | Virus | Mean | |||
| Human papillomavirus type 18 | 4.1E-10 | 6.8 | Human rhinovirus sp. | 9.9E-12 | 4.1 | Human herpesvirus 5 | 5.3E-16 | 0.57 |
| Human endogenous retrovirus K115 | 0.000016 | 4 | Human rhinovirus A | 2.3E-09 | 4.1 | Respiratory syncytial virus | 1.1E-09 | 4.26 |
| Halovirus HF2 | 0.0017 | 2.1 | Enterobacteria phage M13 | 2.2E-07 | 5.7 | Human rhinovirus sp. | 5.9E-08 | 0.75 |
| Human papillomavirus type 45 | 0.002 | 3.3 | Human rhinovirus 16 | 6.2E-07 | 3.5 | Human rhinovirus B | 1.4E-07 | 0.47 |
| Subterranean clover stunt virus | 0.0032 | 2.6 | Human rhinovirus 1B | 0.000001 | 3.5 | Human rhinovirus A | 6E-07 | 0.75 |
Top five hits from three microarrays showing typical results from DetectiV. All have been sorted by p value. GSM40806 and GSM40810 have been filtered such that mean ≥ 1.
Incorrect DetectiV result
| Virus | Mean | |
| Human herpesvirus 7 | 8.60E-06 | 1.7 |
| Bovine respiratory syncytial virus | 2.70E-04 | 2 |
| Respiratory syncytial virus | 3.30E-04 | 3.2 |
| Ictalurid herpesvirus 1 | 1.50E-03 | 1.7 |
| Human herpesvirus 6B | 1.50E-03 | 1.8 |
Top five hits from the DetectiV method from array GSM40816. The sample for this array was found to contain respiratory syncytial virus by DFA.
Incorrect E-Predict results
| GSM40809 | GSM40821 | GSM40847 | ||||||
| Virus | Similarity | Virus | Similarity | Virus | Similarity | |||
| Human enterovirus D | 0.000043 | 0.258894 | Orangutan hepadnavirus | 0.002291 | 0.148865 | Human enterovirus B | 0.000014 | 0.386095 |
| Human rhinovirus B | 0.000045 | 0.267815 | Hepatitis B virus | 0.002376 | 0.147182 | Human enterovirus A | 0.000016 | 0.378912 |
| Human enterovirus C | 0.000052 | 0.254504 | Woodchuck hepatitis B virus | 0.002716 | 0.10964 | Human echovirus 1 | 0.000022 | 0.414618 |
| Enterovirus Yanbian 96-83csf | 0.000094 | 0.276873 | Woolly monkey hepatitis B Virus | 0.00284 | 0.128919 | Enterovirus Yanbian 96-83csf | 0.000022 | 0.412299 |
| Human echovirus 1 | 0.000134 | 0.253816 | Arctic ground squirrel hepatitis B virus | 0.003227 | 0.103357 | Human enterovirus D | 0.000026 | 0.296065 |
Top five results from the E-Predict.dist method for arrays GSM40809, GSM40821 and GSM40847. In all cases results are ordered by p value.
DetectiV results for SARS array
| Virus | Mean | |
| SARS | 8.43E-09 | 1.906095 |
| Human herpesvirus 7 | 3.29E-06 | 1.292008 |
| Simian retrovirus 2 | 4.27E-05 | 1.328653 |
| Coliphage alpha3 | 6.08E-05 | 1.113462 |
| Transmissible gastroenteritis virus | 7.88E-05 | 1.463675 |
Top five results from the DetectiV method of analyzing array GSM8528 from GEO accession GSE546. The sample hybridized to the array contained the SARS virus.
Top hit for GSE8746
| Array | RNA | Top hit | Mean | |
| GSM216542 | Amplified RNA from cell cultured FMDV type O | FMDO | 1.51E-25 | 2.296645 |
| GSM217164 | Amplified RNA from cell cultured FMDV type O | FMDO | 1.07E-45 | 3.513068 |
| GSM217167 | Amplified RNA from cell cultured FMDV type O | FMDO | 2.36E-48 | 3.446262 |
| GSM217169 | Amplified RNA from cell cultured FMDV type O | FMDO | 5.91E-30 | 2.827877 |
| GSM217172 | Amplified RNA from cell cultured FMDV type A | FMDA | 6.96E-30 | 3.560941 |
| GSM217175 | Amplified RNA from cell cultured FMDV type A | FMDA | 8.71E-14 | 1.553392 |
| GSM217177 | Amplified RNA from sheep infected with FMDV type O | FMDO | 1.12E-27 | 2.431874 |
| GSM217180 | Amplified RNA from cell cultured FMDV type A | FMDA | 2.97E-33 | 3.609092 |
| GSM217183 | Amplified RNA from cell cultured Avian IBV | IBV | 1.05E-21 | 5.262134 |
| GSM217184 | Amplified RNA from cell cultured Avian IBV | IBV | 3.49E-33 | 7.958662 |
| GSM217186 | Amplified RNA from cell cultured Avian IBV | IBV | 6.20E-33 | 7.827526 |
| GSM217188 | Amplified RNA from cell cultured Avian IBV | IBV | 1.44E-35 | 8.0118 |
The top hit from DetectiV for the 12 arrays from the GSE8746 dataset. DetectiV produces the correct result in all 12 cases.