| Literature DB >> 19667082 |
Jill L Wegrzyn1, Jennifer M Lee, John Liechty, David B Neale.
Abstract
UNLABELLED: The Pine Alignment and SNP Identification Pipeline (PineSAP) provides a high-throughput solution to single nucleotide polymorphism (SNP) prediction using multiple sequence alignments from re-sequencing data. This pipeline integrates a hybrid of customized scripting, existing utilities and machine learning in order to increase the speed and accuracy of SNP calls. The implementation of this pipeline results in significantly improved multiple sequence alignments and SNP identifications when compared with existing solutions. The use of machine learning in the SNP identifications extends the pipeline's application to any eukaryotic species where full genome sequence information is unavailable. AVAILABILITY: All code used for this pipeline is freely available at the Dendrome project website (http://dendrome.ucdavis.edu/adept2/resequencing.html)Entities:
Mesh:
Year: 2009 PMID: 19667082 PMCID: PMC2752621 DOI: 10.1093/bioinformatics/btp477
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Results of SNP prediction on the test sequence data
| Evaluation | J48 | Polyphred | Polybayes |
|---|---|---|---|
| Accuracy | 93.6 | 76.25 | 78.02 |
| Sensitivity | 88.21 | 83.22 | 86.54 |
| Specificity | 98.73 | N/A | N/A |