Literature DB >> 25621316

Read-mapping using personalized diploid reference genome for RNA sequencing data reduced bias for detecting allele-specific expression.

Shuai Yuan1, Zhaohui Qin2.   

Abstract

Next generation sequencing (NGS) technologies have been applied extensively in many areas of genetics and genomics research. A fundamental problem when comes to analyzing NGS data is mapping short sequencing reads back to the reference genome. Most of existing software packages rely on a single uniform reference genome and do not automatically take into the consideration of genetic variants. On the other hand, large proportions of incorrectly mapped reads affect the correct interpretation of the NGS experimental results. As an example, Degner et al. showed that detecting allele-specific expression from RNA sequencing data was biased toward the reference allele. In this study, we developed a method that utilize DirectX 11 enabled graphics processing unit (GPU)'s parallel computing power to produces a personalized diploid reference genome based on all known genetic variants of that particular individual. We show that using such a personalized diploid reference genome can improve mapping accuracy and significantly reduce the bias toward reference allele in allele-specific expression analysis. Our method can be applied to any individual that has genotype information obtained either from array-based genotyping or resequencing. Besides the reference genome, no additional changes to alignment algorithm are needed for performing read mapping therefore one can utilize any of the existing read mapping tools and achieve the improved read mapping result. C++ and GPU compute shader source code of the software program is available at: http://code.google.com/p/diploid-mapping/downloads/list.

Entities:  

Keywords:  Allele specific expression; GPU programming; RNA-sequencing; read mapping; reference genome; single nucleotide polymorphism

Year:  2012        PMID: 25621316      PMCID: PMC4304670          DOI: 10.1109/BIBMW.2012.6470225

Source DB:  PubMed          Journal:  IEEE Int Conf Bioinform Biomed Workshops


  22 in total

1.  Allelic variation in human gene expression.

Authors:  Hai Yan; Weishi Yuan; Victor E Velculescu; Bert Vogelstein; Kenneth W Kinzler
Journal:  Science       Date:  2002-08-16       Impact factor: 47.728

2.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

3.  Allele-specific gene expression patterns in primary leukemic cells reveal regulation of gene expression by CpG site methylation.

Authors:  Lili Milani; Anders Lundmark; Jessica Nordlund; Anna Kiialainen; Trond Flaegstad; Gudmundur Jonmundsson; Jukka Kanerva; Kjeld Schmiegelow; Kevin L Gunderson; Gudmar Lönnerholm; Ann-Christine Syvänen
Journal:  Genome Res       Date:  2008-11-07       Impact factor: 9.043

4.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

5.  Simultaneous genotyping, gene-expression measurement, and detection of allele-specific expression with oligonucleotide arrays.

Authors:  James Ronald; Joshua M Akey; Jacqueline Whittle; Erin N Smith; Gael Yvert; Leonid Kruglyak
Journal:  Genome Res       Date:  2005-02       Impact factor: 9.043

6.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

Review 7.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

8.  Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data.

Authors:  Jacob F Degner; John C Marioni; Athma A Pai; Joseph K Pickrell; Everlyne Nkadori; Yoav Gilad; Jonathan K Pritchard
Journal:  Bioinformatics       Date:  2009-10-06       Impact factor: 6.937

9.  A new strategy to reduce allelic bias in RNA-Seq readmapping.

Authors:  Ravi Vijaya Satya; Nela Zavaljevski; Jaques Reifman
Journal:  Nucleic Acids Res       Date:  2012-05-14       Impact factor: 16.971

10.  Differential allelic expression in the human genome: a robust approach to identify genetic and epigenetic cis-acting mechanisms regulating gene expression.

Authors:  David Serre; Scott Gurd; Bing Ge; Robert Sladek; Donna Sinnett; Eef Harmsen; Marina Bibikova; Eugene Chudin; David L Barker; Todd Dickinson; Jian-Bing Fan; Thomas J Hudson
Journal:  PLoS Genet       Date:  2008-02-29       Impact factor: 5.917

View more
  8 in total

1.  Buffering of Genetic Regulatory Networks in Drosophila melanogaster.

Authors:  Justin M Fear; Luis G León-Novelo; Alison M Morse; Alison R Gerken; Kjong Van Lehmann; John Tower; Sergey V Nuzhdin; Lauren M McIntyre
Journal:  Genetics       Date:  2016-05-18       Impact factor: 4.562

2.  Variation graph toolkit improves read mapping by representing genetic variation in the reference.

Authors:  Erik Garrison; Jouni Sirén; Adam M Novak; Glenn Hickey; Jordan M Eizenga; Eric T Dawson; William Jones; Shilpa Garg; Charles Markello; Michael F Lin; Benedict Paten; Richard Durbin
Journal:  Nat Biotechnol       Date:  2018-08-20       Impact factor: 54.908

3.  Is it time to change the reference genome?

Authors:  Sara Ballouz; Alexander Dobin; Jesse A Gillis
Journal:  Genome Biol       Date:  2019-08-09       Impact factor: 13.583

4.  CROSSMAPPER: estimating cross-mapping rates and optimizing experimental design in multi-species sequencing studies.

Authors:  Hrant Hovhannisyan; Ahmed Hafez; Carlos Llorens; Toni Gabaldón
Journal:  Bioinformatics       Date:  2020-02-01       Impact factor: 6.937

5.  The value of genotype-specific reference for transcriptome analyses in barley.

Authors:  Wenbin Guo; Max Coulter; Robbie Waugh; Runxuan Zhang
Journal:  Life Sci Alliance       Date:  2022-04-22

6.  One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies.

Authors:  Shuai Yuan; H Richard Johnston; Guosheng Zhang; Yun Li; Yi-Juan Hu; Zhaohui S Qin
Journal:  PLoS Comput Biol       Date:  2015-08-12       Impact factor: 4.475

7.  The genomic basis of environmental adaptation in house mice.

Authors:  Megan Phifer-Rixey; Ke Bi; Kathleen G Ferris; Michael J Sheehan; Dana Lin; Katya L Mack; Sara M Keeble; Taichi A Suzuki; Jeffrey M Good; Michael W Nachman
Journal:  PLoS Genet       Date:  2018-09-24       Impact factor: 5.917

8.  Reference flow: reducing reference bias using multiple population genomes.

Authors:  Nae-Chyun Chen; Brad Solomon; Taher Mun; Sheila Iyer; Ben Langmead
Journal:  Genome Biol       Date:  2021-01-04       Impact factor: 13.583

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.