Literature DB >> 20980555

Dindel: accurate indel calls from short-read data.

Cornelis A Albers1, Gerton Lunter, Daniel G MacArthur, Gilean McVean, Willem H Ouwehand, Richard Durbin.   

Abstract

Small insertions and deletions (indels) are a common and functionally important type of sequence polymorphism. Most of the focus of studies of sequence variation is on single nucleotide variants (SNVs) and large structural variants. In principle, high-throughput sequencing studies should allow identification of indels just as SNVs. However, inference of indels from next-generation sequence data is challenging, and so far methods for identifying indels lag behind methods for calling SNVs in terms of sensitivity and specificity. We propose a Bayesian method to call indels from short-read sequence data in individuals and populations by realigning reads to candidate haplotypes that represent alternative sequence to the reference. The candidate haplotypes are formed by combining candidate indels and SNVs identified by the read mapper, while allowing for known sequence variants or candidates from other methods to be included. In our probabilistic realignment model we account for base-calling errors, mapping errors, and also, importantly, for increased sequencing error indel rates in long homopolymer runs. We show that our method is sensitive and achieves low false discovery rates on simulated and real data sets, although challenges remain. The algorithm is implemented in the program Dindel, which has been used in the 1000 Genomes Project call sets.

Mesh:

Year:  2010        PMID: 20980555      PMCID: PMC3106329          DOI: 10.1101/gr.112326.110

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  29 in total

1.  The ENCODE (ENCyclopedia Of DNA Elements) Project.

Authors: 
Journal:  Science       Date:  2004-10-22       Impact factor: 47.728

2.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

3.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

4.  Rapid whole-genome mutational profiling using next-generation sequencing technologies.

Authors:  Douglas R Smith; Aaron R Quinlan; Heather E Peckham; Kathryn Makowsky; Wei Tao; Betty Woolf; Lei Shen; William F Donahue; Nadeem Tusneem; Michael P Stromberg; Donald A Stewart; Lu Zhang; Swati S Ranade; Jason B Warner; Clarence C Lee; Brittney E Coleman; Zheng Zhang; Stephen F McLaughlin; Joel A Malek; Jon M Sorenson; Alan P Blanchard; Jarrod Chapman; David Hillman; Feng Chen; Daniel S Rokhsar; Kevin J McKernan; Thomas W Jeffries; Gabor T Marth; Paul M Richardson
Journal:  Genome Res       Date:  2008-09-04       Impact factor: 9.043

5.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

6.  An initial map of insertion and deletion (INDEL) variation in the human genome.

Authors:  Ryan E Mills; Christopher T Luttig; Christine E Larkins; Adam Beauchamp; Circe Tsui; W Stephen Pittard; Scott E Devine
Journal:  Genome Res       Date:  2006-08-10       Impact factor: 9.043

7.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.

Authors:  Ewan Birney; John A Stamatoyannopoulos; Anindya Dutta; Roderic Guigó; Thomas R Gingeras; Elliott H Margulies; Zhiping Weng; Michael Snyder; Emmanouil T Dermitzakis; Robert E Thurman; Michael S Kuehn; Christopher M Taylor; Shane Neph; Christoph M Koch; Saurabh Asthana; Ankit Malhotra; Ivan Adzhubei; Jason A Greenbaum; Robert M Andrews; Paul Flicek; Patrick J Boyle; Hua Cao; Nigel P Carter; Gayle K Clelland; Sean Davis; Nathan Day; Pawandeep Dhami; Shane C Dillon; Michael O Dorschner; Heike Fiegler; Paul G Giresi; Jeff Goldy; Michael Hawrylycz; Andrew Haydock; Richard Humbert; Keith D James; Brett E Johnson; Ericka M Johnson; Tristan T Frum; Elizabeth R Rosenzweig; Neerja Karnani; Kirsten Lee; Gregory C Lefebvre; Patrick A Navas; Fidencio Neri; Stephen C J Parker; Peter J Sabo; Richard Sandstrom; Anthony Shafer; David Vetrie; Molly Weaver; Sarah Wilcox; Man Yu; Francis S Collins; Job Dekker; Jason D Lieb; Thomas D Tullius; Gregory E Crawford; Shamil Sunyaev; William S Noble; Ian Dunham; France Denoeud; Alexandre Reymond; Philipp Kapranov; Joel Rozowsky; Deyou Zheng; Robert Castelo; Adam Frankish; Jennifer Harrow; Srinka Ghosh; Albin Sandelin; Ivo L Hofacker; Robert Baertsch; Damian Keefe; Sujit Dike; Jill Cheng; Heather A Hirsch; Edward A Sekinger; Julien Lagarde; Josep F Abril; Atif Shahab; Christoph Flamm; Claudia Fried; Jörg Hackermüller; Jana Hertel; Manja Lindemeyer; Kristin Missal; Andrea Tanzer; Stefan Washietl; Jan Korbel; Olof Emanuelsson; Jakob S Pedersen; Nancy Holroyd; Ruth Taylor; David Swarbreck; Nicholas Matthews; Mark C Dickson; Daryl J Thomas; Matthew T Weirauch; James Gilbert; Jorg Drenkow; Ian Bell; XiaoDong Zhao; K G Srinivasan; Wing-Kin Sung; Hong Sain Ooi; Kuo Ping Chiu; Sylvain Foissac; Tyler Alioto; Michael Brent; Lior Pachter; Michael L Tress; Alfonso Valencia; Siew Woh Choo; Chiou Yu Choo; Catherine Ucla; Caroline Manzano; Carine Wyss; Evelyn Cheung; Taane G Clark; James B Brown; Madhavan Ganesh; Sandeep Patel; Hari Tammana; Jacqueline Chrast; Charlotte N Henrichsen; Chikatoshi Kai; Jun Kawai; Ugrappa Nagalakshmi; Jiaqian Wu; Zheng Lian; Jin Lian; Peter Newburger; Xueqing Zhang; Peter Bickel; John S Mattick; Piero Carninci; Yoshihide Hayashizaki; Sherman Weissman; Tim Hubbard; Richard M Myers; Jane Rogers; Peter F Stadler; Todd M Lowe; Chia-Lin Wei; Yijun Ruan; Kevin Struhl; Mark Gerstein; Stylianos E Antonarakis; Yutao Fu; Eric D Green; Ulaş Karaöz; Adam Siepel; James Taylor; Laura A Liefer; Kris A Wetterstrand; Peter J Good; Elise A Feingold; Mark S Guyer; Gregory M Cooper; George Asimenos; Colin N Dewey; Minmei Hou; Sergey Nikolaev; Juan I Montoya-Burgos; Ari Löytynoja; Simon Whelan; Fabio Pardi; Tim Massingham; Haiyan Huang; Nancy R Zhang; Ian Holmes; James C Mullikin; Abel Ureta-Vidal; Benedict Paten; Michael Seringhaus; Deanna Church; Kate Rosenbloom; W James Kent; Eric A Stone; Serafim Batzoglou; Nick Goldman; Ross C Hardison; David Haussler; Webb Miller; Arend Sidow; Nathan D Trinklein; Zhengdong D Zhang; Leah Barrera; Rhona Stuart; David C King; Adam Ameur; Stefan Enroth; Mark C Bieda; Jonghwan Kim; Akshay A Bhinge; Nan Jiang; Jun Liu; Fei Yao; Vinsensius B Vega; Charlie W H Lee; Patrick Ng; Atif Shahab; Annie Yang; Zarmik Moqtaderi; Zhou Zhu; Xiaoqin Xu; Sharon Squazzo; Matthew J Oberley; David Inman; Michael A Singer; Todd A Richmond; Kyle J Munn; Alvaro Rada-Iglesias; Ola Wallerman; Jan Komorowski; Joanna C Fowler; Phillippe Couttet; Alexander W Bruce; Oliver M Dovey; Peter D Ellis; Cordelia F Langford; David A Nix; Ghia Euskirchen; Stephen Hartman; Alexander E Urban; Peter Kraus; Sara Van Calcar; Nate Heintzman; Tae Hoon Kim; Kun Wang; Chunxu Qu; Gary Hon; Rosa Luna; Christopher K Glass; M Geoff Rosenfeld; Shelley Force Aldred; Sara J Cooper; Anason Halees; Jane M Lin; Hennady P Shulha; Xiaoling Zhang; Mousheng Xu; Jaafar N S Haidar; Yong Yu; Yijun Ruan; Vishwanath R Iyer; Roland D Green; Claes Wadelius; Peggy J Farnham; Bing Ren; Rachel A Harte; Angie S Hinrichs; Heather Trumbower; Hiram Clawson; Jennifer Hillman-Jackson; Ann S Zweig; Kayla Smith; Archana Thakkapallayil; Galt Barber; Robert M Kuhn; Donna Karolchik; Lluis Armengol; Christine P Bird; Paul I W de Bakker; Andrew D Kern; Nuria Lopez-Bigas; Joel D Martin; Barbara E Stranger; Abigail Woodroffe; Eugene Davydov; Antigone Dimas; Eduardo Eyras; Ingileif B Hallgrímsdóttir; Julian Huppert; Michael C Zody; Gonçalo R Abecasis; Xavier Estivill; Gerard G Bouffard; Xiaobin Guan; Nancy F Hansen; Jacquelyn R Idol; Valerie V B Maduro; Baishali Maskeri; Jennifer C McDowell; Morgan Park; Pamela J Thomas; Alice C Young; Robert W Blakesley; Donna M Muzny; Erica Sodergren; David A Wheeler; Kim C Worley; Huaiyang Jiang; George M Weinstock; Richard A Gibbs; Tina Graves; Robert Fulton; Elaine R Mardis; Richard K Wilson; Michele Clamp; James Cuff; Sante Gnerre; David B Jaffe; Jean L Chang; Kerstin Lindblad-Toh; Eric S Lander; Maxim Koriabine; Mikhail Nefedov; Kazutoyo Osoegawa; Yuko Yoshinaga; Baoli Zhu; Pieter J de Jong
Journal:  Nature       Date:  2007-06-14       Impact factor: 49.962

8.  A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1.

Authors:  Y Miki; J Swensen; D Shattuck-Eidens; P A Futreal; K Harshman; S Tavtigian; Q Liu; C Cochran; L M Bennett; W Ding
Journal:  Science       Date:  1994-10-07       Impact factor: 47.728

9.  Loss of ACTN3 gene function alters mouse muscle metabolism and shows evidence of positive selection in humans.

Authors:  Daniel G MacArthur; Jane T Seto; Joanna M Raftery; Kate G Quinlan; Gavin A Huttley; Jeff W Hook; Frances A Lemckert; Anthony J Kee; Michael R Edwards; Yemima Berman; Edna C Hardeman; Peter W Gunning; Simon Easteal; Nan Yang; Kathryn N North
Journal:  Nat Genet       Date:  2007-09-09       Impact factor: 38.330

10.  Probabilistic whole-genome alignments reveal high indel rates in the human and mouse genomes.

Authors:  Gerton Lunter
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

View more
  238 in total

1.  Mutations in STX1B, encoding a presynaptic protein, cause fever-associated epilepsy syndromes.

Authors:  Julian Schubert; Aleksandra Siekierska; Mélanie Langlois; Patrick May; Clément Huneau; Felicitas Becker; Hiltrud Muhle; Arvid Suls; Johannes R Lemke; Carolien G F de Kovel; Holger Thiele; Kathryn Konrad; Amit Kawalia; Mohammad R Toliat; Thomas Sander; Franz Rüschendorf; Almuth Caliebe; Inga Nagel; Bernard Kohl; Angela Kecskés; Maxime Jacmin; Katia Hardies; Sarah Weckhuysen; Erik Riesch; Thomas Dorn; Eva H Brilstra; Stephanie Baulac; Rikke S Møller; Helle Hjalgrim; Bobby P C Koeleman; Karin Jurkat-Rott; Frank Lehman-Horn; Jared C Roach; Gustavo Glusman; Leroy Hood; David J Galas; Benoit Martin; Peter A M de Witte; Saskia Biskup; Peter De Jonghe; Ingo Helbig; Rudi Balling; Peter Nürnberg; Alexander D Crawford; Camila V Esguerra; Yvonne G Weber; Holger Lerche
Journal:  Nat Genet       Date:  2014-11-02       Impact factor: 38.330

2.  Polymorphic NumtS trace human population relationships.

Authors:  Martin Lang; Marco Sazzini; Francesco Maria Calabrese; Domenico Simone; Alessio Boattini; Giovanni Romeo; Donata Luiselli; Marcella Attimonelli; Giuseppe Gasparre
Journal:  Hum Genet       Date:  2011-12-08       Impact factor: 4.132

3.  SNP calling using genotype model selection on high-throughput sequencing data.

Authors:  Na You; Gabriel Murillo; Xiaoquan Su; Xiaowei Zeng; Jian Xu; Kang Ning; Shoudong Zhang; Jiankang Zhu; Xinping Cui
Journal:  Bioinformatics       Date:  2012-01-16       Impact factor: 6.937

4.  Heterozygous missense mutations in SMARCA2 cause Nicolaides-Baraitser syndrome.

Authors:  Jeroen K J Van Houdt; Beata Anna Nowakowska; Sérgio B Sousa; Barbera D C van Schaik; Eve Seuntjens; Nelson Avonce; Alejandro Sifrim; Omar A Abdul-Rahman; Marie-José H van den Boogaard; Armand Bottani; Marco Castori; Valérie Cormier-Daire; Matthew A Deardorff; Isabel Filges; Alan Fryer; Jean-Pierre Fryns; Simone Gana; Livia Garavelli; Gabriele Gillessen-Kaesbach; Bryan D Hall; Denise Horn; Danny Huylebroeck; Jakub Klapecki; Malgorzata Krajewska-Walasek; Alma Kuechler; Matthew A Lines; Saskia Maas; Kay D Macdermot; Shane McKee; Alex Magee; Stella A de Man; Yves Moreau; Fanny Morice-Picard; Ewa Obersztyn; Jacek Pilch; Elizabeth Rosser; Nora Shannon; Irene Stolte-Dijkstra; Patrick Van Dijck; Catheline Vilain; Annick Vogels; Emma Wakeling; Dagmar Wieczorek; Louise Wilson; Orsetta Zuffardi; Antoine H C van Kampen; Koenraad Devriendt; Raoul Hennekam; Joris Robert Vermeesch
Journal:  Nat Genet       Date:  2012-02-26       Impact factor: 38.330

5.  SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples.

Authors:  Si Quang Le; Richard Durbin
Journal:  Genome Res       Date:  2010-10-27       Impact factor: 9.043

6.  MultiGeMS: detection of SNVs from multiple samples using model selection on high-throughput sequencing data.

Authors:  Gabriel H Murillo; Na You; Xiaoquan Su; Wei Cui; Muredach P Reilly; Mingyao Li; Kang Ning; Xinping Cui
Journal:  Bioinformatics       Date:  2016-01-18       Impact factor: 6.937

7.  A Hybrid Drug Limits Resistance by Evading the Action of the Multiple Antibiotic Resistance Pathway.

Authors:  Kathy K Wang; Laura K Stone; Tami D Lieberman; Michal Shavit; Timor Baasov; Roy Kishony
Journal:  Mol Biol Evol       Date:  2015-11-03       Impact factor: 16.240

8.  16GT: a fast and sensitive variant caller using a 16-genotype probabilistic model.

Authors:  Ruibang Luo; Michael C Schatz; Steven L Salzberg
Journal:  Gigascience       Date:  2017-07-01       Impact factor: 6.524

9.  A recurrent inactivating mutation in RHOA GTPase in angioimmunoblastic T cell lymphoma.

Authors:  Hae Yong Yoo; Min Kyung Sung; Seung Ho Lee; Sangok Kim; Haeseung Lee; Seongjin Park; Sang Cheol Kim; Byungwook Lee; Kyoohyoung Rho; Jong-Eun Lee; Kwang-Hwi Cho; Wankyu Kim; Hyunjung Ju; Jaesang Kim; Seok Jin Kim; Won Seog Kim; Sanghyuk Lee; Young Hyeh Ko
Journal:  Nat Genet       Date:  2014-03-02       Impact factor: 38.330

Review 10.  Clinical analysis and interpretation of cancer genome data.

Authors:  Eliezer M Van Allen; Nikhil Wagle; Mia A Levy
Journal:  J Clin Oncol       Date:  2013-04-15       Impact factor: 44.544

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.