Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 BM-map: Bayesian mapping of multireads for next-generation sequencing data.

Literature DB >> 21517792

BM-map: Bayesian mapping of multireads for next-generation sequencing data.

Yuan Ji¹, Yanxun Xu, Qiong Zhang, Kam-Wah Tsui, Yuan Yuan, Clift Norris, Shoudan Liang, Han Liang.

Abstract

Next-generation sequencing (NGS) technology generates millions of short reads, which provide valuable information for various aspects of cellular activities and biological functions. A key step in NGS applications (e.g., RNA-Seq) is to map short reads to correct genomic locations within the source genome. While most reads are mapped to a unique location, a significant proportion of reads align to multiple genomic locations with equal or similar numbers of mismatches; these are called multireads. The ambiguity in mapping the multireads may lead to bias in downstream analyses. Currently, most practitioners discard the multireads in their analysis, resulting in a loss of valuable information, especially for the genes with similar sequences. To refine the read mapping, we develop a Bayesian model that computes the posterior probability of mapping a multiread to each competing location. The probabilities are used for downstream analyses, such as the quantification of gene expression. We show through simulation studies and RNA-Seq analysis of real life data that the Bayesian method yields better mapping than the current leading methods. We provide a C++ program for downloading that is being packaged into a user-friendly software.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Substances：
RNA

Year: 2011 PMID： 21517792 PMCID： PMC3190637 DOI： 10.1111/j.1541-0420.2011.01605.x

Source DB: PubMed Journal: Biometrics ISSN： 0006-341X Impact factor: 2.571

19 in total

1. Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing.

Authors: Ryan Morin; Matthew Bainbridge; Anthony Fejes; Martin Hirst; Martin Krzywinski; Trevor Pugh; Helen McDonald; Richard Varhol; Steven Jones; Marco Marra
Journal: Biotechniques Date: 2008-07 Impact factor: 1.993

2. Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors: Heng Li; Jue Ruan; Richard Durbin
Journal: Genome Res Date: 2008-08-19 Impact factor: 9.043

3. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors: John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal: Genome Res Date: 2008-06-11 Impact factor: 9.043

4. The transcriptional landscape of the yeast genome defined by RNA sequencing.

Authors: Ugrappa Nagalakshmi; Zhong Wang; Karl Waern; Chong Shou; Debasish Raha; Mark Gerstein; Michael Snyder
Journal: Science Date: 2008-05-01 Impact factor: 47.728

5. Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors: Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal: Nat Methods Date: 2008-05-30 Impact factor: 28.547

6. Stem cell transcriptome profiling via massive-scale mRNA sequencing.

Authors: Nicole Cloonan; Alistair R R Forrest; Gabriel Kolle; Brooke B A Gardiner; Geoffrey J Faulkner; Mellissa K Brown; Darrin F Taylor; Anita L Steptoe; Shivangi Wani; Graeme Bethel; Alan J Robertson; Andrew C Perkins; Stephen J Bruce; Clarence C Lee; Swati S Ranade; Heather E Peckham; Jonathan M Manning; Kevin J McKernan; Sean M Grimmond
Journal: Nat Methods Date: 2008-05-30 Impact factor: 28.547

7. BM-BC: a Bayesian method of base calling for Solexa sequence data.

Authors: Yuan Ji; Riten Mitra; Fernando Quintana; Alejandro Jara; Peter Mueller; Ping Liu; Yue Lu; Shoudan Liang
Journal: BMC Bioinformatics Date: 2012-08-24 Impact factor: 3.169

8. SHRiMP: accurate mapping of short color-space reads.

Authors: Stephen M Rumble; Phil Lacroute; Adrian V Dalca; Marc Fiume; Arend Sidow; Michael Brudno
Journal: PLoS Comput Biol Date: 2009-05-22 Impact factor: 4.475

9. Using quality scores and longer reads improves accuracy of Solexa read mapping.

Authors: Andrew D Smith; Zhenyu Xuan; Michael Q Zhang
Journal: BMC Bioinformatics Date: 2008-02-28 Impact factor: 3.169

10. Probabilistic base calling of Solexa sequencing data.

Authors: Jacques Rougemont; Arnaud Amzallag; Christian Iseli; Laurent Farinelli; Ioannis Xenarios; Felix Naef
Journal: BMC Bioinformatics Date: 2008-10-13 Impact factor: 3.169

15 in total

Review 1. The role of transposable elements in health and diseases of the central nervous system.

Authors: Matthew T Reilly; Geoffrey J Faulkner; Joshua Dubnau; Igor Ponomarev; Fred H Gage
Journal: J Neurosci Date: 2013-11-06 Impact factor: 6.167

2. A Bayesian Model for SNP Discovery Based on Next-Generation Sequencing Data.

Authors: Yanxun Xu; Xiaofeng Zheng; Yuan Yuan; Marcos R Estecio; Jean-Pierre Issa; Yuan Ji; Shoudan Liang
Journal: IEEE Int Workshop Genomic Signal Process Stat Date: 2012-12

3. Ultra-fast local-haplotype variant calling using paired-end DNA-sequencing data reveals somatic mosaicism in tumor and normal blood samples.

Authors: Subhajit Sengupta; Kamalakar Gulukota; Yitan Zhu; Carole Ober; Katherine Naughton; William Wentworth-Sheilds; Yuan Ji
Journal: Nucleic Acids Res Date: 2015-09-29 Impact factor: 16.971

4. MBD-seq as a cost-effective approach for methylome-wide association studies: demonstration in 1500 case--control samples.

Authors: Karolina A Aberg; Joseph L McClay; Srilaxmi Nerella; Lin Y Xie; Shaunna L Clark; Alexandra D Hudson; Jozsef Bukszár; Daniel Adkins; Christina M Hultman; Patrick F Sullivan; Patrik K E Magnusson; Edwin J C G van den Oord
Journal: Epigenomics Date: 2012-12 Impact factor: 4.778