Literature DB >> 21517792

BM-map: Bayesian mapping of multireads for next-generation sequencing data.

Yuan Ji1, Yanxun Xu, Qiong Zhang, Kam-Wah Tsui, Yuan Yuan, Clift Norris, Shoudan Liang, Han Liang.   

Abstract

Next-generation sequencing (NGS) technology generates millions of short reads, which provide valuable information for various aspects of cellular activities and biological functions. A key step in NGS applications (e.g., RNA-Seq) is to map short reads to correct genomic locations within the source genome. While most reads are mapped to a unique location, a significant proportion of reads align to multiple genomic locations with equal or similar numbers of mismatches; these are called multireads. The ambiguity in mapping the multireads may lead to bias in downstream analyses. Currently, most practitioners discard the multireads in their analysis, resulting in a loss of valuable information, especially for the genes with similar sequences. To refine the read mapping, we develop a Bayesian model that computes the posterior probability of mapping a multiread to each competing location. The probabilities are used for downstream analyses, such as the quantification of gene expression. We show through simulation studies and RNA-Seq analysis of real life data that the Bayesian method yields better mapping than the current leading methods. We provide a C++ program for downloading that is being packaged into a user-friendly software.
© 2011, The International Biometric Society.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21517792      PMCID: PMC3190637          DOI: 10.1111/j.1541-0420.2011.01605.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  19 in total

1.  Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing.

Authors:  Ryan Morin; Matthew Bainbridge; Anthony Fejes; Martin Hirst; Martin Krzywinski; Trevor Pugh; Helen McDonald; Richard Varhol; Steven Jones; Marco Marra
Journal:  Biotechniques       Date:  2008-07       Impact factor: 1.993

2.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

3.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

4.  The transcriptional landscape of the yeast genome defined by RNA sequencing.

Authors:  Ugrappa Nagalakshmi; Zhong Wang; Karl Waern; Chong Shou; Debasish Raha; Mark Gerstein; Michael Snyder
Journal:  Science       Date:  2008-05-01       Impact factor: 47.728

5.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

6.  Stem cell transcriptome profiling via massive-scale mRNA sequencing.

Authors:  Nicole Cloonan; Alistair R R Forrest; Gabriel Kolle; Brooke B A Gardiner; Geoffrey J Faulkner; Mellissa K Brown; Darrin F Taylor; Anita L Steptoe; Shivangi Wani; Graeme Bethel; Alan J Robertson; Andrew C Perkins; Stephen J Bruce; Clarence C Lee; Swati S Ranade; Heather E Peckham; Jonathan M Manning; Kevin J McKernan; Sean M Grimmond
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

7.  BM-BC: a Bayesian method of base calling for Solexa sequence data.

Authors:  Yuan Ji; Riten Mitra; Fernando Quintana; Alejandro Jara; Peter Mueller; Ping Liu; Yue Lu; Shoudan Liang
Journal:  BMC Bioinformatics       Date:  2012-08-24       Impact factor: 3.169

8.  SHRiMP: accurate mapping of short color-space reads.

Authors:  Stephen M Rumble; Phil Lacroute; Adrian V Dalca; Marc Fiume; Arend Sidow; Michael Brudno
Journal:  PLoS Comput Biol       Date:  2009-05-22       Impact factor: 4.475

9.  Using quality scores and longer reads improves accuracy of Solexa read mapping.

Authors:  Andrew D Smith; Zhenyu Xuan; Michael Q Zhang
Journal:  BMC Bioinformatics       Date:  2008-02-28       Impact factor: 3.169

10.  Probabilistic base calling of Solexa sequencing data.

Authors:  Jacques Rougemont; Arnaud Amzallag; Christian Iseli; Laurent Farinelli; Ioannis Xenarios; Felix Naef
Journal:  BMC Bioinformatics       Date:  2008-10-13       Impact factor: 3.169

View more
  15 in total

Review 1.  The role of transposable elements in health and diseases of the central nervous system.

Authors:  Matthew T Reilly; Geoffrey J Faulkner; Joshua Dubnau; Igor Ponomarev; Fred H Gage
Journal:  J Neurosci       Date:  2013-11-06       Impact factor: 6.167

2.  A Bayesian Model for SNP Discovery Based on Next-Generation Sequencing Data.

Authors:  Yanxun Xu; Xiaofeng Zheng; Yuan Yuan; Marcos R Estecio; Jean-Pierre Issa; Yuan Ji; Shoudan Liang
Journal:  IEEE Int Workshop Genomic Signal Process Stat       Date:  2012-12

3.  Ultra-fast local-haplotype variant calling using paired-end DNA-sequencing data reveals somatic mosaicism in tumor and normal blood samples.

Authors:  Subhajit Sengupta; Kamalakar Gulukota; Yitan Zhu; Carole Ober; Katherine Naughton; William Wentworth-Sheilds; Yuan Ji
Journal:  Nucleic Acids Res       Date:  2015-09-29       Impact factor: 16.971

4.  MBD-seq as a cost-effective approach for methylome-wide association studies: demonstration in 1500 case--control samples.

Authors:  Karolina A Aberg; Joseph L McClay; Srilaxmi Nerella; Lin Y Xie; Shaunna L Clark; Alexandra D Hudson; Jozsef Bukszár; Daniel Adkins; Christina M Hultman; Patrick F Sullivan; Patrik K E Magnusson; Edwin J C G van den Oord
Journal:  Epigenomics       Date:  2012-12       Impact factor: 4.778

5.  Next generation quantitative genetics in plants.

Authors:  José M Jiménez-Gómez
Journal:  Front Plant Sci       Date:  2011-11-15       Impact factor: 5.753

6.  Exploring the gonad transcriptome of two extreme male pigs with RNA-seq.

Authors:  Anna Esteve-Codina; Robert Kofler; Nicola Palmieri; Giovanni Bussotti; Cedric Notredame; Miguel Pérez-Enciso
Journal:  BMC Genomics       Date:  2011-11-08       Impact factor: 3.969

7.  Hobbes: optimized gram-based methods for efficient read alignment.

Authors:  Athena Ahmadi; Alexander Behm; Nagesh Honnalli; Chen Li; Lingjie Weng; Xiaohui Xie
Journal:  Nucleic Acids Res       Date:  2011-12-22       Impact factor: 16.971

8.  Transposable elements in TDP-43-mediated neurodegenerative disorders.

Authors:  Wanhe Li; Ying Jin; Lisa Prazak; Molly Hammell; Josh Dubnau
Journal:  PLoS One       Date:  2012-09-05       Impact factor: 3.240

9.  Estimation of CpG coverage in whole methylome next-generation sequencing studies.

Authors:  Edwin J C G van den Oord; Jozsef Bukszar; Gábor Rudolf; Srilaxmi Nerella; Joseph L McClay; Lin Y Xie; Karolina A Aberg
Journal:  BMC Bioinformatics       Date:  2013-02-12       Impact factor: 3.169

10.  BM-Map: an efficient software package for accurately allocating multireads of RNA-sequencing data.

Authors:  Yuan Yuan; Clift Norris; Yanxun Xu; Kam-Wah Tsui; Yuan Ji; Han Liang
Journal:  BMC Genomics       Date:  2012-12-17       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.