| Literature DB >> 15904493 |
Shea N Gardner1, Mark C Wagner.
Abstract
BACKGROUND: Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15904493 PMCID: PMC1156889 DOI: 10.1186/1471-2164-6-73
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Description of the output files created. Examples for the organisms analyzed here can be found at
| list of all the SNPs or PCR-RFLPs found in the input genomes | |
| lists the genome groupings that correspond to each of the character haplotypes | |
| lists the co-segregating SNPs or PCR-RFLPs that characterize each haplotype | |
| for each genome, lists haplotype combinations ("sets") that will discriminate the specified genome to the maximum degree possible | |
| lists combinations of sets of haplotypes to discriminate all the input genomes to the maximum degree possible using the fewest haplotypes. Each row is a unique combination that has the best score found, where the score is the number of haplotypes required. |
Summary of predictions for SNPs
| Organism | Number Sequences | Number Sites | Number Haplotypes | Number Unresolved clusters | Minimum Number Assays |
| Mumps | 17 | 171 | 85 | 11 | 10 |
| SARS | 102 | 218 | 164 | 65 | 114 |
The minimum number of assays indicates the fewest haplotypes that need to be queried to discriminate all the input genomes to the maximum degree possible, that is, the number of assays that would need to be done to classify an unknown isolate in terms of its similarity to the currently sequenced strains using this type of forensic marker.
Summary of predictions for PCR without restriction digest
| Organism | Number Variable Amplicons | No. Combinations of Amplicon X Unique Fragment Length Distributions | No. Haplo-types | No. Unresolved clusters | Min. No. Assays |
| Mumps | 0 | 0 | 0 | 1 | NA |
| SARS | 39 | 178 | 51 | 26 | 31 |
The number combinations of Amplicon X Unique Fragment Length Distributions is the number of unique fragment length distributions summed over all variable amplicons.
Summary of predictions for PCR-RFLP
| Organism | Number Variable Amplicon X Enzyme Combinations | Number Combinations of Amplicon X Enzyme X Unique Fragment Length Distributions | No. Haplo-types | No. Unresolved clusters | Min. No. Assays |
| Mumps | 1,070 | 3,113 | 288 | 13 | 13 |
| SARS | 1,694 | 7,344 | 440 | 75 | 99 |
The number of combinations of Amplicon X Enzyme X Unique Fragment Length Distributions is summed over all variable amplicon, enzyme combinations in the PCR-RFLP analyses.
Figure 2Phylogram of SARS created based on a multiple sequence alignment.
Figure 3Phylogram of SARS created based on all SNP haplotypes.
Figure 4Phylogram of SARS created based on all PCR-RFLP haplotypes.
Figure 5Phylogram of SARS created based on only SNP haplotypes in one optimal solution set (that maximally discriminates each of the input genomes using the fewest total haplotypes).
Enzyme frequency for creating fragment length polymorphisms in SARS and mumps viruses
| Enzyme | SARS | Mumps | Enzyme | SARS | Mumps |
| Tru9I | 335 | 149 | NciI | 54 | 32 |
| Hsp92II | 320 | 98 | EcoRV | 52 | 36 |
| Rsal | 318 | 101 | TthlllI | 52 | 15 |
| DdeI | 317 | 104 | EclHKI | 52 | 11 |
| Alul | 270 | 135 | Csp45I | 51 | 29 |
| Bsp1286I | 227 | 48 | Bbul | 47 | 14 |
| TaqI | 213 | 121 | SphI | 47 | 14 |
| CfoI | 195 | 54 | Bst98I | 44 | 20 |
| HhaI | 195 | 54 | XbaI | 41 | 28 |
| MboI | 190 | 135 | XmnI | 39 | 29 |
| Sau3AI | 190 | 135 | NheI | 39 | 14 |
| HinfI | 179 | 105 | AccB7I | 39 | 8 |
| AccI | 170 | 45 | NcoI | 35 | 8 |
| VspI | 161 | 62 | HpaI | 34 | 8 |
| BstOI | 160 | 77 | EcoRI | 32 | 23 |
| MspAI | 140 | 55 | AvaI | 30 | 32 |
| Styl | 133 | 39 | NruI | 24 | 4 |
| Sau96I | 132 | 74 | BclI | 23 | 33 |
| AvalI | 128 | 57 | HindIII | 22 | 27 |
| SinI | 128 | 57 | Sall | 21 | 15 |
| HincII | 126 | 24 | BssHII | 20 | 4 |
| BsrSI | 119 | 49 | Eco72I | 19 | 6 |
| RvuII | 114 | 40 | Smal | 17 | 12 |
| Dral | 113 | 26 | Xmal | 17 | 12 |
| Pstl | 113 | 13 | XhoI | 13 | 14 |
| XhoII | 112 | 69 | Mlul | 13 | 0 |
| Haell | 112 | 18 | Clal | 9 | 12 |
| HaelIl | 110 | 83 | SnaBI | 8 | 4 |
| Scal | 103 | 16 | Nael | 6 | 8 |
| HpaII | 91 | 82 | NgoMI | 6 | 8 |
| MspI | 91 | 82 | Eco47III | 3 | 18 |
| Sspl | 91 | 33 | Stul | 3 | 17 |
| BanII | 91 | 18 | BamHl | 0 | 24 |
| EcoICRI | 91 | 8 | AcyI | 0 | 18 |
| Spel | 85 | 21 | BalI | 0 | 18 |
| Ndel | 83 | 20 | Hsp92I | 0 | 18 |
| Nsil | 82 | 8 | Bgll | 0 | 12 |
| Acc65I | 74 | 9 | Agel | 0 | 11 |
| KpnI | 74 | 9 | BsaOI | 0 | 9 |
| BstXI | 73 | 6 | SacI | 0 | 6 |
| Bsu36I | 72 | 16 | AatII | 0 | 4 |
| BsrBRI | 65 | 41 | Apal | 0 | 4 |
| BglIl | 64 | 35 | Pvul | 0 | 4 |
| BanI | 62 | 17 | BstZI | 0 | 3 |
| BstEII | 61 | 8 | Eco52I | 0 | 3 |
| Alw441 | 59 | 6 | SacII | 0 | 2 |
Figure 1Combinations, or sets, of haplotypes can uniquely distinguish an individual genome if the intersection of the genomes across that set is uniquely the one genome in question.
Description of the user-specified parameters for SNP and PCR-RFLP analyses
| minimum length of conserved sequence upstream (5') of a SNP | |
| minimum length of conserved sequence downstream (3') of a SNP | |
| maximum amplicon length allowed for PCR-RFLP analysis | |
| the series of sequences input to primer3 for amplicon generation for PCR-RFLP analyses are chosen by sliding a window along the consensus gestalt. Each new window must start at least jump bases from the start of the previous window. | |
| there must be at least one difference in fragment lengths among all input genomes that is at least this long for the given amplicon+enzyme combination to be considered variable enough for further PCR-RFLP examination. | |
| number of restriction enzymes used in an PCR-RFLP analysis before a single electrophoretic determination of fragment length distributions is made. This number may range from 0–3. | |
| maximum number of haplotypes per combination tested in a multi-locus solution set to maximally discriminate all the input sequences. Thus, combinatorics are performed on number haplotypessCnumber_combinations_to_test total combinations. |