Literature DB >> 24144111

eALPS: estimating abundance levels in pooled sequencing using available genotyping data.

Itamar Eskin1, Farhad Hormozdiari, Lucia Conde, Jacques Riby, Christine F Skibola, Eleazar Eskin, Eran Halperin.   

Abstract

The recent advances in high-throughput sequencing technologies bring the potential of a better characterization of the genetic variation in humans and other organisms. In many occasions, either by design or by necessity, the sequencing procedure is performed on a pool of DNA samples with different abundances, where the abundance of each sample is unknown. Such a scenario is naturally occurring in the case of metagenomics analysis where a pool of bacteria is sequenced, or in the case of population studies involving DNA pools by design. Particularly, various pooling designs were recently suggested that can identify carriers of rare alleles in large cohorts, dramatically reducing the cost of such large-scale sequencing projects. A fundamental problem with such approaches for population studies is that the uncertainty of DNA proportions from different individuals in the pools might lead to spurious associations. Fortunately, it is often the case that the genotype data of at least some of the individuals in the pool is known. Here, we propose a method (eALPS) that uses the genotype data in conjunction with the pooled sequence data in order to accurately estimate the proportions of the samples in the pool, even in cases where not all individuals in the pool were genotyped (eALPS-LD). Using real data from a sequencing pooling study of non-Hodgkin's lymphoma, we demonstrate that the estimation of the proportions is crucial, since otherwise there is a risk for false discoveries. Additionally, we demonstrate that our approach is also applicable to the problem of quantification of species in metagenomics samples (eALPS-BCR) and is particularly suitable for metagenomic quantification of closely related species.

Entities:  

Mesh:

Year:  2013        PMID: 24144111      PMCID: PMC4013753          DOI: 10.1089/cmb.2013.0105

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  32 in total

1.  Pooled association tests for rare variants in exon-resequencing studies.

Authors:  Alkes L Price; Gregory V Kryukov; Paul I W de Bakker; Shaun M Purcell; Jeff Staples; Lee-Jen Wei; Shamil R Sunyaev
Journal:  Am J Hum Genet       Date:  2010-05-13       Impact factor: 11.025

2.  Gut flora in health and disease: potential role of probiotics.

Authors:  M Heselmans; G Reid; L M A Akkermans; H Savelkoul; H Timmerman; F M Rombouts
Journal:  Curr Issues Intest Microbiol       Date:  2005-03

3.  Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays.

Authors:  Hajime Matsuzaki; Shoulian Dong; Halina Loi; Xiaojun Di; Guoying Liu; Earl Hubbell; Jane Law; Tam Berntsen; Monica Chadha; Henry Hui; Geoffrey Yang; Giulia C Kennedy; Teresa A Webster; Simon Cawley; P Sean Walsh; Keith W Jones; Stephen P A Fodor; Rui Mei
Journal:  Nat Methods       Date:  2004-11       Impact factor: 28.547

4.  Multilocus association mapping using variable-length Markov chains.

Authors:  Sharon R Browning
Journal:  Am J Hum Genet       Date:  2006-04-07       Impact factor: 11.025

5.  Molecular analysis of human forearm superficial skin bacterial biota.

Authors:  Zhan Gao; Chi-hong Tseng; Zhiheng Pei; Martin J Blaser
Journal:  Proc Natl Acad Sci U S A       Date:  2007-02-09       Impact factor: 11.205

6.  Overlapping pools for high-throughput targeted resequencing.

Authors:  Snehit Prabhu; Itsik Pe'er
Journal:  Genome Res       Date:  2009-05-15       Impact factor: 9.043

7.  Common sequence variants on 20q11.22 confer melanoma susceptibility.

Authors:  Kevin M Brown; Stuart Macgregor; Grant W Montgomery; David W Craig; Zhen Zhen Zhao; Kelly Iyadurai; Anjali K Henders; Nils Homer; Megan J Campbell; Mitchell Stark; Shane Thomas; Helen Schmid; Elizabeth A Holland; Elizabeth M Gillanders; David L Duffy; Judith A Maskiell; Jodie Jetann; Megan Ferguson; Dietrich A Stephan; Anne E Cust; David Whiteman; Adele Green; Håkan Olsson; Susana Puig; Paola Ghiorzo; Johan Hansson; Florence Demenais; Alisa M Goldstein; Nelleke A Gruis; David E Elder; Julia Newton Bishop; Richard F Kefford; Graham G Giles; Bruce K Armstrong; Joanne F Aitken; John L Hopper; Nicholas G Martin; Jeffrey M Trent; Graham J Mann; Nicholas K Hayward
Journal:  Nat Genet       Date:  2008-05-18       Impact factor: 38.330

8.  Genome-wide association study of follicular lymphoma identifies a risk locus at 6p21.32.

Authors:  Lucia Conde; Eran Halperin; Nicholas K Akers; Kevin M Brown; Karin E Smedby; Nathaniel Rothman; Alexandra Nieters; Susan L Slager; Angela Brooks-Wilson; Luz Agana; Jacques Riby; Jianjun Liu; Hans-Olov Adami; Hatef Darabi; Henrik Hjalgrim; Hui-Qi Low; Keith Humphreys; Mads Melbye; Ellen T Chang; Bengt Glimelius; Wendy Cozen; Scott Davis; Patricia Hartge; Lindsay M Morton; Maryjean Schenk; Sophia S Wang; Bruce Armstrong; Anne Kricker; Sam Milliken; Mark P Purdue; Claire M Vajdic; Peter Boyle; Qing Lan; Shelia H Zahm; Yawei Zhang; Tongzhang Zheng; Nikolaus Becker; Yolanda Benavente; Paolo Boffetta; Paul Brennan; Katja Butterbach; Pierluigi Cocco; Lenka Foretova; Marc Maynadié; Silvia de Sanjosé; Anthony Staines; John J Spinelli; Sara J Achenbach; Timothy G Call; Nicola J Camp; Martha Glenn; Neil E Caporaso; James R Cerhan; Julie M Cunningham; Lynn R Goldin; Curtis A Hanson; Neil E Kay; Mark C Lanasa; Jose F Leis; Gerald E Marti; Kari G Rabe; Laura Z Rassenti; Logan G Spector; Sara S Strom; Celine M Vachon; J Brice Weinberg; Elizabeth A Holly; Stephen Chanock; Martyn T Smith; Paige M Bracci; Christine F Skibola
Journal:  Nat Genet       Date:  2010-07-18       Impact factor: 38.330

9.  On optimal pooling designs to identify rare variants through massive resequencing.

Authors:  Joon Sang Lee; Murim Choi; Xiting Yan; Richard P Lifton; Hongyu Zhao
Journal:  Genet Epidemiol       Date:  2011-01-19       Impact factor: 2.135

10.  Accurate genome relative abundance estimation based on shotgun metagenomic reads.

Authors:  Li C Xia; Jacob A Cram; Ting Chen; Jed A Fuhrman; Fengzhu Sun
Journal:  PLoS One       Date:  2011-12-06       Impact factor: 3.240

View more
  3 in total

1.  Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA.

Authors:  Guido H Jajamovich; Alexandros Iliadis; Dimitris Anastassiou; Xiaodong Wang
Journal:  BMC Bioinformatics       Date:  2013-09-08       Impact factor: 3.169

2.  Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations.

Authors:  Vikas Bansal; Ondrej Libiger
Journal:  BMC Bioinformatics       Date:  2015-01-16       Impact factor: 3.169

Review 3.  Sequencing pools of individuals - mining genome-wide polymorphism data without big funding.

Authors:  Christian Schlötterer; Raymond Tobler; Robert Kofler; Viola Nolte
Journal:  Nat Rev Genet       Date:  2014-09-23       Impact factor: 53.242

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.