Literature DB >> 35410384

Pangenome-based genome inference allows efficient and accurate genotyping across a wide spectrum of variant classes.

Jana Ebler1, Peter Ebert1, Wayne E Clarke2, Tobias Rausch3,4, Peter A Audano5, Torsten Houwaart6, Yafei Mao5, Jan O Korbel3, Evan E Eichler5,7, Michael C Zody2, Alexander T Dilthey6,8,9, Tobias Marschall10.   

Abstract

Typical genotyping workflows map reads to a reference genome before identifying genetic variants. Generating such alignments introduces reference biases and comes with substantial computational burden. Furthermore, short-read lengths limit the ability to characterize repetitive genomic regions, which are particularly challenging for fast k-mer-based genotypers. In the present study, we propose a new algorithm, PanGenie, that leverages a haplotype-resolved pangenome reference together with k-mer counts from short-read sequencing data to genotype a wide spectrum of genetic variation-a process we refer to as genome inference. Compared with mapping-based approaches, PanGenie is more than 4 times faster at 30-fold coverage and achieves better genotype concordances for almost all variant types and coverages tested. Improvements are especially pronounced for large insertions (≥50 bp) and variants in repetitive regions, enabling the inclusion of these classes of variants in genome-wide association studies. PanGenie efficiently leverages the increasing amount of haplotype-resolved assemblies to unravel the functional impact of previously inaccessible variants while being faster compared with alignment-based workflows.
© 2022. The Author(s).

Entities:  

Mesh:

Year:  2022        PMID: 35410384      PMCID: PMC9005351          DOI: 10.1038/s41588-022-01043-w

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


  56 in total

1.  Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism.

Authors:  Stephan J Sanders; A Gulhan Ercan-Sencicek; Vanessa Hus; Rui Luo; Michael T Murtha; Daniel Moreno-De-Luca; Su H Chu; Michael P Moreau; Abha R Gupta; Susanne A Thomson; Christopher E Mason; Kaya Bilguvar; Patricia B S Celestino-Soper; Murim Choi; Emily L Crawford; Lea Davis; Nicole R Davis Wright; Rahul M Dhodapkar; Michael DiCola; Nicholas M DiLullo; Thomas V Fernandez; Vikram Fielding-Singh; Daniel O Fishman; Stephanie Frahm; Rouben Garagaloyan; Gerald S Goh; Sindhuja Kammela; Lambertus Klei; Jennifer K Lowe; Sabata C Lund; Anna D McGrew; Kyle A Meyer; William J Moffat; John D Murdoch; Brian J O'Roak; Gordon T Ober; Rebecca S Pottenger; Melanie J Raubeson; Youeun Song; Qi Wang; Brian L Yaspan; Timothy W Yu; Ilana R Yurkiewicz; Arthur L Beaudet; Rita M Cantor; Martin Curland; Dorothy E Grice; Murat Günel; Richard P Lifton; Shrikant M Mane; Donna M Martin; Chad A Shaw; Michael Sheldon; Jay A Tischfield; Christopher A Walsh; Eric M Morrow; David H Ledbetter; Eric Fombonne; Catherine Lord; Christa Lese Martin; Andrew I Brooks; James S Sutcliffe; Edwin H Cook; Daniel Geschwind; Kathryn Roeder; Bernie Devlin; Matthew W State
Journal:  Neuron       Date:  2011-06-09       Impact factor: 17.173

2.  High frequencies of de novo CNVs in bipolar disorder and schizophrenia.

Authors:  Dheeraj Malhotra; Shane McCarthy; Jacob J Michaelson; Vladimir Vacic; Katherine E Burdick; Seungtai Yoon; Sven Cichon; Aiden Corvin; Sydney Gary; Elliot S Gershon; Michael Gill; Maria Karayiorgou; John R Kelsoe; Olga Krastoshevsky; Verena Krause; Ellen Leibenluft; Deborah L Levy; Vladimir Makarov; Abhishek Bhandari; Anil K Malhotra; Francis J McMahon; Markus M Nöthen; James B Potash; Marcella Rietschel; Thomas G Schulze; Jonathan Sebat
Journal:  Neuron       Date:  2011-12-22       Impact factor: 17.173

3.  Rare chromosomal deletions and duplications in attention-deficit hyperactivity disorder: a genome-wide analysis.

Authors:  Nigel M Williams; Irina Zaharieva; Andrew Martin; Kate Langley; Kiran Mantripragada; Ragnheidur Fossdal; Hreinn Stefansson; Kari Stefansson; Pall Magnusson; Olafur O Gudmundsson; Omar Gustafsson; Peter Holmans; Michael J Owen; Michael O'Donovan; Anita Thapar
Journal:  Lancet       Date:  2010-09-29       Impact factor: 79.321

4.  Rare structural variants disrupt multiple genes in neurodevelopmental pathways in schizophrenia.

Authors:  Tom Walsh; Jon M McClellan; Shane E McCarthy; Anjené M Addington; Sarah B Pierce; Greg M Cooper; Alex S Nord; Mary Kusenda; Dheeraj Malhotra; Abhishek Bhandari; Sunday M Stray; Caitlin F Rippey; Patricia Roccanova; Vlad Makarov; B Lakshmi; Robert L Findling; Linmarie Sikich; Thomas Stromberg; Barry Merriman; Nitin Gogtay; Philip Butler; Kristen Eckstrand; Laila Noory; Peter Gochman; Robert Long; Zugen Chen; Sean Davis; Carl Baker; Evan E Eichler; Paul S Meltzer; Stanley F Nelson; Andrew B Singleton; Ming K Lee; Judith L Rapoport; Mary-Claire King; Jonathan Sebat
Journal:  Science       Date:  2008-03-27       Impact factor: 47.728

5.  Strong association of de novo copy number mutations with autism.

Authors:  Jonathan Sebat; B Lakshmi; Dheeraj Malhotra; Jennifer Troge; Christa Lese-Martin; Tom Walsh; Boris Yamrom; Seungtai Yoon; Alex Krasnitz; Jude Kendall; Anthony Leotta; Deepa Pai; Ray Zhang; Yoon-Ha Lee; James Hicks; Sarah J Spence; Annette T Lee; Kaija Puura; Terho Lehtimäki; David Ledbetter; Peter K Gregersen; Joel Bregman; James S Sutcliffe; Vaidehi Jobanputra; Wendy Chung; Dorothy Warburton; Mary-Claire King; David Skuse; Daniel H Geschwind; T Conrad Gilliam; Kenny Ye; Michael Wigler
Journal:  Science       Date:  2007-03-15       Impact factor: 47.728

6.  Haplotype-resolved diverse human genomes and integrated analysis of structural variation.

Authors:  Peter Ebert; Peter A Audano; Qihui Zhu; Bernardo Rodriguez-Martin; Charles Lee; Jan O Korbel; Tobias Marschall; Evan E Eichler; David Porubsky; Marc Jan Bonder; Arvis Sulovari; Jana Ebler; Weichen Zhou; Rebecca Serra Mari; Feyza Yilmaz; Xuefang Zhao; PingHsun Hsieh; Joyce Lee; Sushant Kumar; Jiadong Lin; Tobias Rausch; Yu Chen; Jingwen Ren; Martin Santamarina; Wolfram Höps; Hufsah Ashraf; Nelson T Chuang; Xiaofei Yang; Katherine M Munson; Alexandra P Lewis; Susan Fairley; Luke J Tallon; Wayne E Clarke; Anna O Basile; Marta Byrska-Bishop; André Corvelo; Uday S Evani; Tsung-Yu Lu; Mark J P Chaisson; Junjie Chen; Chong Li; Harrison Brand; Aaron M Wenger; Maryam Ghareghani; William T Harvey; Benjamin Raeder; Patrick Hasenfeld; Allison A Regier; Haley J Abel; Ira M Hall; Paul Flicek; Oliver Stegle; Mark B Gerstein; Jose M C Tubio; Zepeng Mu; Yang I Li; Xinghua Shi; Alex R Hastie; Kai Ye; Zechen Chong; Ashley D Sanders; Michael C Zody; Michael E Talkowski; Ryan E Mills; Scott E Devine
Journal:  Science       Date:  2021-02-25       Impact factor: 47.728

7.  An integrated map of structural variation in 2,504 human genomes.

Authors:  Peter H Sudmant; Tobias Rausch; Eugene J Gardner; Robert E Handsaker; Alexej Abyzov; John Huddleston; Yan Zhang; Kai Ye; Goo Jun; Markus Hsi-Yang Fritz; Miriam K Konkel; Ankit Malhotra; Adrian M Stütz; Xinghua Shi; Francesco Paolo Casale; Jieming Chen; Fereydoun Hormozdiari; Gargi Dayama; Ken Chen; Maika Malig; Mark J P Chaisson; Klaudia Walter; Sascha Meiers; Seva Kashin; Erik Garrison; Adam Auton; Hugo Y K Lam; Xinmeng Jasmine Mu; Can Alkan; Danny Antaki; Taejeong Bae; Eliza Cerveira; Peter Chines; Zechen Chong; Laura Clarke; Elif Dal; Li Ding; Sarah Emery; Xian Fan; Madhusudan Gujral; Fatma Kahveci; Jeffrey M Kidd; Yu Kong; Eric-Wubbo Lameijer; Shane McCarthy; Paul Flicek; Richard A Gibbs; Gabor Marth; Christopher E Mason; Androniki Menelaou; Donna M Muzny; Bradley J Nelson; Amina Noor; Nicholas F Parrish; Matthew Pendleton; Andrew Quitadamo; Benjamin Raeder; Eric E Schadt; Mallory Romanovitch; Andreas Schlattl; Robert Sebra; Andrey A Shabalin; Andreas Untergasser; Jerilyn A Walker; Min Wang; Fuli Yu; Chengsheng Zhang; Jing Zhang; Xiangqun Zheng-Bradley; Wanding Zhou; Thomas Zichner; Jonathan Sebat; Mark A Batzer; Steven A McCarroll; Ryan E Mills; Mark B Gerstein; Ali Bashir; Oliver Stegle; Scott E Devine; Charles Lee; Evan E Eichler; Jan O Korbel
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  Multi-platform discovery of haplotype-resolved structural variation in human genomes.

Authors:  Mark J P Chaisson; Ashley D Sanders; Xuefang Zhao; Ankit Malhotra; David Porubsky; Tobias Rausch; Eugene J Gardner; Oscar L Rodriguez; Li Guo; Ryan L Collins; Xian Fan; Jia Wen; Robert E Handsaker; Susan Fairley; Zev N Kronenberg; Xiangmeng Kong; Fereydoun Hormozdiari; Dillon Lee; Aaron M Wenger; Alex R Hastie; Danny Antaki; Thomas Anantharaman; Peter A Audano; Harrison Brand; Stuart Cantsilieris; Han Cao; Eliza Cerveira; Chong Chen; Xintong Chen; Chen-Shan Chin; Zechen Chong; Nelson T Chuang; Christine C Lambert; Deanna M Church; Laura Clarke; Andrew Farrell; Joey Flores; Timur Galeev; David U Gorkin; Madhusudan Gujral; Victor Guryev; William Haynes Heaton; Jonas Korlach; Sushant Kumar; Jee Young Kwon; Ernest T Lam; Jong Eun Lee; Joyce Lee; Wan-Ping Lee; Sau Peng Lee; Shantao Li; Patrick Marks; Karine Viaud-Martinez; Sascha Meiers; Katherine M Munson; Fabio C P Navarro; Bradley J Nelson; Conor Nodzak; Amina Noor; Sofia Kyriazopoulou-Panagiotopoulou; Andy W C Pang; Yunjiang Qiu; Gabriel Rosanio; Mallory Ryan; Adrian Stütz; Diana C J Spierings; Alistair Ward; AnneMarie E Welch; Ming Xiao; Wei Xu; Chengsheng Zhang; Qihui Zhu; Xiangqun Zheng-Bradley; Ernesto Lowy; Sergei Yakneen; Steven McCarroll; Goo Jun; Li Ding; Chong Lek Koh; Bing Ren; Paul Flicek; Ken Chen; Mark B Gerstein; Pui-Yan Kwok; Peter M Lansdorp; Gabor T Marth; Jonathan Sebat; Xinghua Shi; Ali Bashir; Kai Ye; Scott E Devine; Michael E Talkowski; Ryan E Mills; Tobias Marschall; Jan O Korbel; Evan E Eichler; Charles Lee
Journal:  Nat Commun       Date:  2019-04-16       Impact factor: 17.694

9.  DELLY: structural variant discovery by integrated paired-end and split-read analysis.

Authors:  Tobias Rausch; Thomas Zichner; Andreas Schlattl; Adrian M Stütz; Vladimir Benes; Jan O Korbel
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

10.  A structural variation reference for medical and population genetics.

Authors:  Ryan L Collins; Harrison Brand; Konrad J Karczewski; Xuefang Zhao; Jessica Alföldi; Laurent C Francioli; Amit V Khera; Chelsea Lowther; Laura D Gauthier; Harold Wang; Nicholas A Watts; Matthew Solomonson; Anne O'Donnell-Luria; Alexander Baumann; Ruchi Munshi; Mark Walker; Christopher W Whelan; Yongqing Huang; Ted Brookings; Ted Sharpe; Matthew R Stone; Elise Valkanas; Jack Fu; Grace Tiao; Kristen M Laricchia; Valentin Ruano-Rubio; Christine Stevens; Namrata Gupta; Caroline Cusick; Lauren Margolin; Kent D Taylor; Henry J Lin; Stephen S Rich; Wendy S Post; Yii-Der Ida Chen; Jerome I Rotter; Chad Nusbaum; Anthony Philippakis; Eric Lander; Stacey Gabriel; Benjamin M Neale; Sekar Kathiresan; Mark J Daly; Eric Banks; Daniel G MacArthur; Michael E Talkowski
Journal:  Nature       Date:  2020-05-27       Impact factor: 69.504

View more
  6 in total

1.  Structural variant-based pangenome construction has low sensitivity to variability of haplotype-resolved bovine assemblies.

Authors:  Alexander S Leonard; Danang Crysnanto; Zih-Hua Fang; Michael P Heaton; Brian L Vander Ley; Carolina Herrera; Heinrich Bollwein; Derek M Bickhart; Kristen L Kuhn; Timothy P L Smith; Benjamin D Rosen; Hubert Pausch
Journal:  Nat Commun       Date:  2022-05-31       Impact factor: 17.694

2.  Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders.

Authors:  David Porubsky; Wolfram Höps; Hufsah Ashraf; PingHsun Hsieh; Bernardo Rodriguez-Martin; Feyza Yilmaz; Jana Ebler; Pille Hallast; Flavia Angela Maria Maggiolini; William T Harvey; Barbara Henning; Peter A Audano; David S Gordon; Peter Ebert; Patrick Hasenfeld; Eva Benito; Qihui Zhu; Charles Lee; Francesca Antonacci; Matthias Steinrücken; Christine R Beck; Ashley D Sanders; Tobias Marschall; Evan E Eichler; Jan O Korbel
Journal:  Cell       Date:  2022-05-06       Impact factor: 66.850

3.  Expanding the conservation genomics toolbox: Incorporating structural variants to enhance genomic studies for species of conservation concern.

Authors:  Jana Wold; Klaus-Peter Koepfli; Stephanie J Galla; David Eccles; Carolyn J Hogg; Marissa F Le Lec; Joseph Guhlin; Anna W Santure; Tammy E Steeves
Journal:  Mol Ecol       Date:  2021-09-12       Impact factor: 6.622

4.  High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.

Authors:  Marta Byrska-Bishop; Uday S Evani; Xuefang Zhao; Anna O Basile; Haley J Abel; Allison A Regier; André Corvelo; Wayne E Clarke; Rajeeva Musunuri; Kshithija Nagulapalli; Susan Fairley; Alexi Runnels; Lara Winterkorn; Ernesto Lowy; Soren Germer; Harrison Brand; Ira M Hall; Michael E Talkowski; Giuseppe Narzisi; Michael C Zody
Journal:  Cell       Date:  2022-09-01       Impact factor: 66.850

5.  BubbleGun: Enumerating Bubbles and Superbubbles in Genome Graphs.

Authors:  Fawaz Dabbaghie; Jana Ebler; Tobias Marschall
Journal:  Bioinformatics       Date:  2022-07-07       Impact factor: 6.931

6.  KAGE: fast alignment-free graph-based genotyping of SNPs and short indels.

Authors:  Ivar Grytten; Knut Dagestad Rand; Geir Kjetil Sandve
Journal:  Genome Biol       Date:  2022-10-04       Impact factor: 17.906

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.