Literature DB >> 21653520

A probabilistic method for the detection and genotyping of small indels from population-scale sequence data.

Vikas Bansal1, Ondrej Libiger.   

Abstract

MOTIVATION: High-throughput sequencing technologies have made population-scale studies of human genetic variation possible. Accurate and comprehensive detection of DNA sequence variants is crucial for the success of these studies. Small insertions and deletions represent the second most frequent class of variation in the human genome after single nucleotide polymorphisms (SNPs). Although several alignment tools for the gapped alignment of sequence reads to a reference genome are available, computational methods for discriminating indels from sequencing errors and genotyping indels directly from sequence reads are needed.
RESULTS: We describe a probabilistic method for the accurate detection and genotyping of short indels from population-scale sequence data. In this approach, aligned sequence reads from a population of individuals are used to automatically account for context-specific sequencing errors associated with indels. We applied this approach to population sequence datasets from the 1000 Genomes exon pilot project generated using the Roche 454 and Illumina sequencing platforms, and were able to detect a significantly greater number of indels than reported previously. Comparison to indels identified in the 1000 Genomes pilot project demonstrated the sensitivity of our method. The consistency in the number of indels and the fraction of indels whose length is a multiple of three across different human populations and two different sequencing platforms indicated that our method has a low false discovery rate. Finally, the method represents a general approach for the detection and genotyping of small-scale DNA sequence variants for population-scale sequencing projects. AVAILABILITY: A program implementing this method is available at http://polymorphism.scripps.edu/~vbansal/software/piCALL/

Entities:  

Mesh:

Year:  2011        PMID: 21653520      PMCID: PMC3137221          DOI: 10.1093/bioinformatics/btr344

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  Comprehensive identification and characterization of diallelic insertion-deletion polymorphisms in 330 human candidate genes.

Authors:  Tushar R Bhangale; Mark J Rieder; Robert J Livingston; Deborah A Nickerson
Journal:  Hum Mol Genet       Date:  2004-11-03       Impact factor: 6.150

2.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

3.  The complete genome of an individual by massively parallel DNA sequencing.

Authors:  David A Wheeler; Maithreyan Srinivasan; Michael Egholm; Yufeng Shen; Lei Chen; Amy McGuire; Wen He; Yi-Ju Chen; Vinod Makhijani; G Thomas Roth; Xavier Gomes; Karrie Tartaro; Faheem Niazi; Cynthia L Turcotte; Gerard P Irzyk; James R Lupski; Craig Chinault; Xing-zhi Song; Yue Liu; Ye Yuan; Lynne Nazareth; Xiang Qin; Donna M Muzny; Marcel Margulies; George M Weinstock; Richard A Gibbs; Jonathan M Rothberg
Journal:  Nature       Date:  2008-04-17       Impact factor: 49.962

4.  Statistical properties of segregating sites.

Authors:  Y X Fu
Journal:  Theor Popul Biol       Date:  1995-10       Impact factor: 1.570

5.  An initial map of insertion and deletion (INDEL) variation in the human genome.

Authors:  Ryan E Mills; Christopher T Luttig; Christine E Larkins; Adam Beauchamp; Circe Tsui; W Stephen Pittard; Scott E Devine
Journal:  Genome Res       Date:  2006-08-10       Impact factor: 9.043

6.  The diploid genome sequence of an Asian individual.

Authors:  Jun Wang; Wei Wang; Ruiqiang Li; Yingrui Li; Geng Tian; Laurie Goodman; Wei Fan; Junqing Zhang; Jun Li; Juanbin Zhang; Yiran Guo; Binxiao Feng; Heng Li; Yao Lu; Xiaodong Fang; Huiqing Liang; Zhenglin Du; Dong Li; Yiqing Zhao; Yujie Hu; Zhenzhen Yang; Hancheng Zheng; Ines Hellmann; Michael Inouye; John Pool; Xin Yi; Jing Zhao; Jinjie Duan; Yan Zhou; Junjie Qin; Lijia Ma; Guoqing Li; Zhentao Yang; Guojie Zhang; Bin Yang; Chang Yu; Fang Liang; Wenjie Li; Shaochuan Li; Dawei Li; Peixiang Ni; Jue Ruan; Qibin Li; Hongmei Zhu; Dongyuan Liu; Zhike Lu; Ning Li; Guangwu Guo; Jianguo Zhang; Jia Ye; Lin Fang; Qin Hao; Quan Chen; Yu Liang; Yeyang Su; A San; Cuo Ping; Shuang Yang; Fang Chen; Li Li; Ke Zhou; Hongkun Zheng; Yuanyuan Ren; Ling Yang; Yang Gao; Guohua Yang; Zhuo Li; Xiaoli Feng; Karsten Kristiansen; Gane Ka-Shu Wong; Rasmus Nielsen; Richard Durbin; Lars Bolund; Xiuqing Zhang; Songgang Li; Huanming Yang; Jian Wang
Journal:  Nature       Date:  2008-11-06       Impact factor: 49.962

7.  Proportionally more deleterious genetic variation in European than in African populations.

Authors:  Kirk E Lohmueller; Amit R Indap; Steffen Schmidt; Adam R Boyko; Ryan D Hernandez; Melissa J Hubisz; John J Sninsky; Thomas J White; Shamil R Sunyaev; Rasmus Nielsen; Andrew G Clark; Carlos D Bustamante
Journal:  Nature       Date:  2008-02-21       Impact factor: 49.962

8.  The diploid genome sequence of an individual human.

Authors:  Samuel Levy; Granger Sutton; Pauline C Ng; Lars Feuk; Aaron L Halpern; Brian P Walenz; Nelson Axelrod; Jiaqi Huang; Ewen F Kirkness; Gennady Denisov; Yuan Lin; Jeffrey R MacDonald; Andy Wing Chun Pang; Mary Shago; Timothy B Stockwell; Alexia Tsiamouri; Vineet Bafna; Vikas Bansal; Saul A Kravitz; Dana A Busam; Karen Y Beeson; Tina C McIntosh; Karin A Remington; Josep F Abril; John Gill; Jon Borman; Yu-Hui Rogers; Marvin E Frazier; Stephen W Scherer; Robert L Strausberg; J Craig Venter
Journal:  PLoS Biol       Date:  2007-09-04       Impact factor: 8.029

9.  SHRiMP: accurate mapping of short color-space reads.

Authors:  Stephen M Rumble; Phil Lacroute; Adrian V Dalca; Marc Fiume; Arend Sidow; Michael Brudno
Journal:  PLoS Comput Biol       Date:  2009-05-22       Impact factor: 4.475

10.  Accurate whole human genome sequencing using reversible terminator chemistry.

Authors:  David R Bentley; Shankar Balasubramanian; Harold P Swerdlow; Geoffrey P Smith; John Milton; Clive G Brown; Kevin P Hall; Dirk J Evers; Colin L Barnes; Helen R Bignell; Jonathan M Boutell; Jason Bryant; Richard J Carter; R Keira Cheetham; Anthony J Cox; Darren J Ellis; Michael R Flatbush; Niall A Gormley; Sean J Humphray; Leslie J Irving; Mirian S Karbelashvili; Scott M Kirk; Heng Li; Xiaohai Liu; Klaus S Maisinger; Lisa J Murray; Bojan Obradovic; Tobias Ost; Michael L Parkinson; Mark R Pratt; Isabelle M J Rasolonjatovo; Mark T Reed; Roberto Rigatti; Chiara Rodighiero; Mark T Ross; Andrea Sabot; Subramanian V Sankar; Aylwyn Scally; Gary P Schroth; Mark E Smith; Vincent P Smith; Anastassia Spiridou; Peta E Torrance; Svilen S Tzonev; Eric H Vermaas; Klaudia Walter; Xiaolin Wu; Lu Zhang; Mohammed D Alam; Carole Anastasi; Ify C Aniebo; David M D Bailey; Iain R Bancarz; Saibal Banerjee; Selena G Barbour; Primo A Baybayan; Vincent A Benoit; Kevin F Benson; Claire Bevis; Phillip J Black; Asha Boodhun; Joe S Brennan; John A Bridgham; Rob C Brown; Andrew A Brown; Dale H Buermann; Abass A Bundu; James C Burrows; Nigel P Carter; Nestor Castillo; Maria Chiara E Catenazzi; Simon Chang; R Neil Cooley; Natasha R Crake; Olubunmi O Dada; Konstantinos D Diakoumakos; Belen Dominguez-Fernandez; David J Earnshaw; Ugonna C Egbujor; David W Elmore; Sergey S Etchin; Mark R Ewan; Milan Fedurco; Louise J Fraser; Karin V Fuentes Fajardo; W Scott Furey; David George; Kimberley J Gietzen; Colin P Goddard; George S Golda; Philip A Granieri; David E Green; David L Gustafson; Nancy F Hansen; Kevin Harnish; Christian D Haudenschild; Narinder I Heyer; Matthew M Hims; Johnny T Ho; Adrian M Horgan; Katya Hoschler; Steve Hurwitz; Denis V Ivanov; Maria Q Johnson; Terena James; T A Huw Jones; Gyoung-Dong Kang; Tzvetana H Kerelska; Alan D Kersey; Irina Khrebtukova; Alex P Kindwall; Zoya Kingsbury; Paula I Kokko-Gonzales; Anil Kumar; Marc A Laurent; Cynthia T Lawley; Sarah E Lee; Xavier Lee; Arnold K Liao; Jennifer A Loch; Mitch Lok; Shujun Luo; Radhika M Mammen; John W Martin; Patrick G McCauley; Paul McNitt; Parul Mehta; Keith W Moon; Joe W Mullens; Taksina Newington; Zemin Ning; Bee Ling Ng; Sonia M Novo; Michael J O'Neill; Mark A Osborne; Andrew Osnowski; Omead Ostadan; Lambros L Paraschos; Lea Pickering; Andrew C Pike; Alger C Pike; D Chris Pinkard; Daniel P Pliskin; Joe Podhasky; Victor J Quijano; Come Raczy; Vicki H Rae; Stephen R Rawlings; Ana Chiva Rodriguez; Phyllida M Roe; John Rogers; Maria C Rogert Bacigalupo; Nikolai Romanov; Anthony Romieu; Rithy K Roth; Natalie J Rourke; Silke T Ruediger; Eli Rusman; Raquel M Sanches-Kuiper; Martin R Schenker; Josefina M Seoane; Richard J Shaw; Mitch K Shiver; Steven W Short; Ning L Sizto; Johannes P Sluis; Melanie A Smith; Jean Ernest Sohna Sohna; Eric J Spence; Kim Stevens; Neil Sutton; Lukasz Szajkowski; Carolyn L Tregidgo; Gerardo Turcatti; Stephanie Vandevondele; Yuli Verhovsky; Selene M Virk; Suzanne Wakelin; Gregory C Walcott; Jingwen Wang; Graham J Worsley; Juying Yan; Ling Yau; Mike Zuerlein; Jane Rogers; James C Mullikin; Matthew E Hurles; Nick J McCooke; John S West; Frank L Oaks; Peter L Lundberg; David Klenerman; Richard Durbin; Anthony J Smith
Journal:  Nature       Date:  2008-11-06       Impact factor: 49.962

View more
  10 in total

1.  Automated cleaning and pre-processing of immunoglobulin gene sequences from high-throughput sequencing.

Authors:  Miri Michaeli; Hila Noga; Hilla Tabibian-Keissar; Iris Barshack; Ramit Mehr
Journal:  Front Immunol       Date:  2012-12-28       Impact factor: 7.561

2.  CRISPR/Cas9-mediated gene editing in human tripronuclear zygotes.

Authors:  Puping Liang; Yanwen Xu; Xiya Zhang; Chenhui Ding; Rui Huang; Zhen Zhang; Jie Lv; Xiaowei Xie; Yuxi Chen; Yujing Li; Ying Sun; Yaofu Bai; Zhou Songyang; Wenbin Ma; Canquan Zhou; Junjiu Huang
Journal:  Protein Cell       Date:  2015-04-18       Impact factor: 14.870

3.  Using population data for assessing next-generation sequencing performance.

Authors:  Darren T Houniet; Thahira J Rahman; Saeed Al Turki; Matthew E Hurles; Yaobo Xu; Judith Goodship; Bernard Keavney; Mauro Santibanez Koref
Journal:  Bioinformatics       Date:  2014-09-17       Impact factor: 6.937

4.  Effective gene editing by high-fidelity base editor 2 in mouse zygotes.

Authors:  Puping Liang; Hongwei Sun; Ying Sun; Xiya Zhang; Xiaowei Xie; Jinran Zhang; Zhen Zhang; Yuxi Chen; Chenhui Ding; Yuanyan Xiong; Wenbin Ma; Dan Liu; Junjiu Huang; Zhou Songyang
Journal:  Protein Cell       Date:  2017-06-05       Impact factor: 14.870

5.  GBS Mapping and Analysis of Genes Conserved between Gossypium tomentosum and Gossypium hirsutum Cotton Cultivars that Respond to Drought Stress at the Seedling Stage of the BC₂F₂ Generation.

Authors:  Richard Odongo Magwanga; Pu Lu; Joy Nyangasi Kirungu; Latyr Diouf; Qi Dong; Yangguang Hu; Xiaoyan Cai; Yanchao Xu; Yuqing Hou; Zhongli Zhou; Xingxing Wang; Kunbo Wang; Fang Liu
Journal:  Int J Mol Sci       Date:  2018-05-30       Impact factor: 5.923

6.  Next-generation sequencing using microfluidic PCR enrichment for molecular autopsy.

Authors:  Hariharan Raju; James S Ware; Jonathan R Skinner; Paula L Hedley; Gavin Arno; Donald R Love; Christian van der Werf; Jacob Tfelt-Hansen; Bo Gregers Winkel; Marta C Cohen; Xinzhong Li; Shibu John; Sanjay Sharma; Steve Jeffery; Arthur A M Wilde; Michael Christiansen; Mary N Sheppard; Elijah R Behr
Journal:  BMC Cardiovasc Disord       Date:  2019-07-23       Impact factor: 2.298

7.  A population model for genotyping indels from next-generation sequence data.

Authors:  Haojing Shao; Evangelos Bellos; Hanjiudai Yin; Xiao Liu; Jing Zou; Yingrui Li; Jun Wang; Lachlan J M Coin
Journal:  Nucleic Acids Res       Date:  2012-12-05       Impact factor: 16.971

8.  Targeted sequence capture and GS-FLX Titanium sequencing of 23 hypertrophic and dilated cardiomyopathy genes: implementation into diagnostics.

Authors:  Olaf R F Mook; Martin A Haagmans; Jean-François Soucy; Judith B A van de Meerakker; Frank Baas; Marja E Jakobs; Nynke Hofman; Imke Christiaans; Ronald H Lekanne Deprez; Marcel M A M Mannens
Journal:  J Med Genet       Date:  2013-06-19       Impact factor: 6.318

9.  Detection and characterization of small insertion and deletion genetic variants in modern layer chicken genomes.

Authors:  Clarissa Boschiero; Almas A Gheyas; Hannah K Ralph; Lel Eory; Bob Paton; Richard Kuo; Janet Fulton; Rudolf Preisinger; Pete Kaiser; David W Burt
Journal:  BMC Genomics       Date:  2015-07-31       Impact factor: 3.969

10.  A machine learning framework for genotyping the structural variations with copy number variant.

Authors:  Tian Zheng; Xiaoyan Zhu; Xuanping Zhang; Zhongmeng Zhao; Xin Yi; Jiayin Wang; Hongle Li
Journal:  BMC Med Genomics       Date:  2020-08-27       Impact factor: 3.063

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.