MOTIVATION: Recent studies have revealed the importance of considering quality scores of reads generated by next-generation sequence (NGS) platforms in various downstream analyses. It is also known that probabilistic alignments based on marginal probabilities (e.g. aligned-column and/or gap probabilities) provide more accurate alignment than conventional maximum score-based alignment. There exists, however, no study about probabilistic alignment that considers quality scores explicitly, although the method is expected to be useful in SNP/indel callers and bisulfite mapping, because accurate estimation of aligned columns or gaps is important in those analyses. RESULTS: In this study, we propose methods of probabilistic alignment that consider quality scores of (one of) the sequences as well as a usual score matrix. The method is based on posterior decoding techniques in which various marginal probabilities are computed from a probabilistic model of alignments with quality scores, and can arbitrarily trade-off sensitivity and positive predictive value (PPV) of prediction (aligned columns and gaps). The method is directly applicable to read mapping (alignment) toward accurate detection of SNPs and indels. Several computational experiments indicated that probabilistic alignments can estimate aligned columns and gaps accurately, compared with other mapping algorithms e.g. SHRiMP2, Stampy, BWA and Novoalign. The study also suggested that our approach yields favorable precision for SNP/indel calling.
MOTIVATION: Recent studies have revealed the importance of considering quality scores of reads generated by next-generation sequence (NGS) platforms in various downstream analyses. It is also known that probabilistic alignments based on marginal probabilities (e.g. aligned-column and/or gap probabilities) provide more accurate alignment than conventional maximum score-based alignment. There exists, however, no study about probabilistic alignment that considers quality scores explicitly, although the method is expected to be useful in SNP/indel callers and bisulfite mapping, because accurate estimation of aligned columns or gaps is important in those analyses. RESULTS: In this study, we propose methods of probabilistic alignment that consider quality scores of (one of) the sequences as well as a usual score matrix. The method is based on posterior decoding techniques in which various marginal probabilities are computed from a probabilistic model of alignments with quality scores, and can arbitrarily trade-off sensitivity and positive predictive value (PPV) of prediction (aligned columns and gaps). The method is directly applicable to read mapping (alignment) toward accurate detection of SNPs and indels. Several computational experiments indicated that probabilistic alignments can estimate aligned columns and gaps accurately, compared with other mapping algorithms e.g. SHRiMP2, Stampy, BWA and Novoalign. The study also suggested that our approach yields favorable precision for SNP/indel calling.
Authors: Walter J Sandoval-Espinola; Satya T Makwana; Mari S Chinn; Michael R Thon; M Andrea Azcárate-Peril; José M Bruno-Bárcena Journal: Microbiology (Reading) Date: 2013-09-25 Impact factor: 2.777
Authors: Olivier Lamiable; Franca Ronchese; Johannes U Mayer; Kerry L Hilligan; Jodie S Chandler; David A Eccles; Samuel I Old; Rita G Domingues; Jianping Yang; Greta R Webb; Luis Munoz-Erazo; Evelyn J Hyde; Kirsty A Wakelin; Shiau-Choot Tang; Sally C Chappell; Sventja von Daake; Frank Brombacher; Charles R Mackay; Alan Sher; Roxane Tussiwand; Lisa M Connor; David Gallego-Ortega; Dragana Jankovic; Graham Le Gros; Matthew R Hepworth Journal: Nat Immunol Date: 2021-11-18 Impact factor: 25.606
Authors: Federica Torri; Ivo D Dinov; Alen Zamanyan; Sam Hobel; Alex Genco; Petros Petrosyan; Andrew P Clark; Zhizhong Liu; Paul Eggert; Jonathan Pierce; James A Knowles; Joseph Ames; Carl Kesselman; Arthur W Toga; Steven G Potkin; Marquis P Vawter; Fabio Macciardi Journal: Genes (Basel) Date: 2012-08-30 Impact factor: 4.096
Authors: Kristof De Beuf; Joachim De Schrijver; Olivier Thas; Wim Van Criekinge; Rafael A Irizarry; Lieven Clement Journal: BMC Bioinformatics Date: 2012-11-15 Impact factor: 3.169
Authors: Zhemin Zhou; Inge Lundstrøm; Alicia Tran-Dien; Sebastián Duchêne; Nabil-Fareed Alikhan; Martin J Sergeant; Gemma Langridge; Anna K Fotakis; Satheesh Nair; Hans K Stenøien; Stian S Hamre; Sherwood Casjens; Axel Christophersen; Christopher Quince; Nicholas R Thomson; François-Xavier Weill; Simon Y W Ho; M Thomas P Gilbert; Mark Achtman Journal: Curr Biol Date: 2018-07-19 Impact factor: 10.834