Literature DB >> 33347570

Parliament2: Accurate structural variant calling at scale.

Samantha Zarate1,2, Andrew Carroll1, Medhat Mahmoud3, Olga Krasheninina3, Goo Jun4, William J Salerno3, Michael C Schatz2, Eric Boerwinkle3,4, Richard A Gibbs3, Fritz J Sedlazeck3.   

Abstract

BACKGROUND: Structural variants (SVs) are critical contributors to genetic diversity and genomic disease. To predict the phenotypic impact of SVs, there is a need for better estimates of both the occurrence and frequency of SVs, preferably from large, ethnically diverse cohorts. Thus, the current standard approach requires the use of short paired-end reads, which remain challenging to detect, especially at the scale of hundreds to thousands of samples.
FINDINGS: We present Parliament2, a consensus SV framework that leverages multiple best-in-class methods to identify high-quality SVs from short-read DNA sequence data at scale. Parliament2 incorporates pre-installed SV callers that are optimized for efficient execution in parallel to reduce the overall runtime and costs. We demonstrate the accuracy of Parliament2 when applied to data from NovaSeq and HiSeq X platforms with the Genome in a Bottle (GIAB) SV call set across all size classes. The reported quality score per SV is calibrated across different SV types and size classes. Parliament2 has the highest F1 score (74.27%) measured across the independent gold standard from GIAB. We illustrate the compute performance by processing all 1000 Genomes samples (2,691 samples) in <1 day on GRCH38. Parliament2 improves the runtime performance of individual methods and is open source (https://github.com/slzarate/parliament2), and a Docker image, as well as a WDL implementation, is available.
CONCLUSION: Parliament2 provides both a highly accurate single-sample SV call set from short-read DNA sequence data and enables cost-efficient application over cloud or cluster environments, processing thousands of samples.
© The Author(s) 2020. Published by Oxford University Press GigaScience.

Entities:  

Keywords:  high-throughput sequencing; next-generation sequencing; structural variation

Mesh:

Year:  2020        PMID: 33347570      PMCID: PMC7751401          DOI: 10.1093/gigascience/giaa145

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   7.658


  23 in total

1.  CREST maps somatic structural variation in cancer genomes with base-pair resolution.

Authors:  Jianmin Wang; Charles G Mullighan; John Easton; Stefan Roberts; Sue L Heatley; Jing Ma; Michael C Rusch; Ken Chen; Christopher C Harris; Li Ding; Linda Holmfeldt; Debbie Payne-Turner; Xian Fan; Lei Wei; David Zhao; John C Obenauer; Clayton Naeve; Elaine R Mardis; Richard K Wilson; James R Downing; Jinghui Zhang
Journal:  Nat Methods       Date:  2011-06-12       Impact factor: 28.547

2.  Resolving the complexity of the human genome using single-molecule sequencing.

Authors:  Mark J P Chaisson; John Huddleston; Megan Y Dennis; Peter H Sudmant; Maika Malig; Fereydoun Hormozdiari; Francesca Antonacci; Urvashi Surti; Richard Sandstrom; Matthew Boitano; Jane M Landolin; John A Stamatoyannopoulos; Michael W Hunkapiller; Jonas Korlach; Evan E Eichler
Journal:  Nature       Date:  2014-11-10       Impact factor: 49.962

3.  BreakDancer: an algorithm for high-resolution mapping of genomic structural variation.

Authors:  Ken Chen; John W Wallis; Michael D McLellan; David E Larson; Joelle M Kalicki; Craig S Pohl; Sean D McGrath; Michael C Wendl; Qunyuan Zhang; Devin P Locke; Xiaoqi Shi; Robert S Fulton; Timothy J Ley; Richard K Wilson; Li Ding; Elaine R Mardis
Journal:  Nat Methods       Date:  2009-08-09       Impact factor: 28.547

4.  Major Impacts of Widespread Structural Variation on Gene Expression and Crop Improvement in Tomato.

Authors:  Michael Alonge; Xingang Wang; Matthias Benoit; Sebastian Soyk; Lara Pereira; Lei Zhang; Hamsini Suresh; Srividya Ramakrishnan; Florian Maumus; Danielle Ciren; Yuval Levy; Tom Hai Harel; Gili Shalev-Schlosser; Ziva Amsellem; Hamid Razifard; Ana L Caicedo; Denise M Tieman; Harry Klee; Melanie Kirsche; Sergey Aganezov; T Rhyker Ranallo-Benavidez; Zachary H Lemmon; Jennifer Kim; Gina Robitaille; Melissa Kramer; Sara Goodwin; W Richard McCombie; Samuel Hutton; Joyce Van Eck; Jesse Gillis; Yuval Eshed; Fritz J Sedlazeck; Esther van der Knaap; Michael C Schatz; Zachary B Lippman
Journal:  Cell       Date:  2020-06-17       Impact factor: 66.850

5.  An integrated map of structural variation in 2,504 human genomes.

Authors:  Peter H Sudmant; Tobias Rausch; Eugene J Gardner; Robert E Handsaker; Alexej Abyzov; John Huddleston; Yan Zhang; Kai Ye; Goo Jun; Markus Hsi-Yang Fritz; Miriam K Konkel; Ankit Malhotra; Adrian M Stütz; Xinghua Shi; Francesco Paolo Casale; Jieming Chen; Fereydoun Hormozdiari; Gargi Dayama; Ken Chen; Maika Malig; Mark J P Chaisson; Klaudia Walter; Sascha Meiers; Seva Kashin; Erik Garrison; Adam Auton; Hugo Y K Lam; Xinmeng Jasmine Mu; Can Alkan; Danny Antaki; Taejeong Bae; Eliza Cerveira; Peter Chines; Zechen Chong; Laura Clarke; Elif Dal; Li Ding; Sarah Emery; Xian Fan; Madhusudan Gujral; Fatma Kahveci; Jeffrey M Kidd; Yu Kong; Eric-Wubbo Lameijer; Shane McCarthy; Paul Flicek; Richard A Gibbs; Gabor Marth; Christopher E Mason; Androniki Menelaou; Donna M Muzny; Bradley J Nelson; Amina Noor; Nicholas F Parrish; Matthew Pendleton; Andrew Quitadamo; Benjamin Raeder; Eric E Schadt; Mallory Romanovitch; Andreas Schlattl; Robert Sebra; Andrey A Shabalin; Andreas Untergasser; Jerilyn A Walker; Min Wang; Fuli Yu; Chengsheng Zhang; Jing Zhang; Xiangqun Zheng-Bradley; Wanding Zhou; Thomas Zichner; Jonathan Sebat; Mark A Batzer; Steven A McCarroll; Ryan E Mills; Mark B Gerstein; Ali Bashir; Oliver Stegle; Scott E Devine; Charles Lee; Evan E Eichler; Jan O Korbel
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

6.  DELLY: structural variant discovery by integrated paired-end and split-read analysis.

Authors:  Tobias Rausch; Thomas Zichner; Andreas Schlattl; Adrian M Stütz; Vladimir Benes; Jan O Korbel
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

7.  SpeedSeq: ultra-fast personal genome analysis and interpretation.

Authors:  Colby Chiang; Ryan M Layer; Gregory G Faust; Michael R Lindberg; David B Rose; Erik P Garrison; Gabor T Marth; Aaron R Quinlan; Ira M Hall
Journal:  Nat Methods       Date:  2015-08-10       Impact factor: 28.547

8.  LUMPY: a probabilistic framework for structural variant discovery.

Authors:  Ryan M Layer; Colby Chiang; Aaron R Quinlan; Ira M Hall
Journal:  Genome Biol       Date:  2014-06-26       Impact factor: 13.583

9.  Accurate detection of complex structural variations using single-molecule sequencing.

Authors:  Fritz J Sedlazeck; Philipp Rescheneder; Moritz Smolka; Han Fang; Maria Nattestad; Arndt von Haeseler; Michael C Schatz
Journal:  Nat Methods       Date:  2018-04-30       Impact factor: 28.547

10.  A robust benchmark for detection of germline large deletions and insertions.

Authors:  Justin M Zook; Nancy F Hansen; Nathan D Olson; Lesley Chapman; James C Mullikin; Chunlin Xiao; Stephen Sherry; Sergey Koren; Adam M Phillippy; Paul C Boutros; Sayed Mohammad E Sahraeian; Vincent Huang; Alexandre Rouette; Noah Alexander; Christopher E Mason; Iman Hajirasouliha; Camir Ricketts; Joyce Lee; Rick Tearle; Ian T Fiddes; Alvaro Martinez Barrio; Jeremiah Wala; Andrew Carroll; Noushin Ghaffari; Oscar L Rodriguez; Ali Bashir; Shaun Jackman; John J Farrell; Aaron M Wenger; Can Alkan; Arda Soylev; Michael C Schatz; Shilpa Garg; George Church; Tobias Marschall; Ken Chen; Xian Fan; Adam C English; Jeffrey A Rosenfeld; Weichen Zhou; Ryan E Mills; Jay M Sage; Jennifer R Davis; Michael D Kaiser; John S Oliver; Anthony P Catalano; Mark J P Chaisson; Noah Spies; Fritz J Sedlazeck; Marc Salit
Journal:  Nat Biotechnol       Date:  2020-06-15       Impact factor: 54.908

View more
  12 in total

1.  High prevalence of multilocus pathogenic variation in neurodevelopmental disorders in the Turkish population.

Authors:  Tadahiro Mitani; Sedat Isikay; Alper Gezdirici; Elif Yilmaz Gulec; Jaya Punetha; Jawid M Fatih; Isabella Herman; Gulsen Akay; Haowei Du; Daniel G Calame; Akif Ayaz; Tulay Tos; Gozde Yesil; Hatip Aydin; Bilgen Geckinli; Nursel Elcioglu; Sukru Candan; Ozlem Sezer; Haktan Bagis Erdem; Davut Gul; Emine Demiral; Muhsin Elmas; Osman Yesilbas; Betul Kilic; Serdal Gungor; Ahmet C Ceylan; Sevcan Bozdogan; Ozge Ozalp; Salih Cicek; Huseyin Aslan; Sinem Yalcintepe; Vehap Topcu; Yavuz Bayram; Christopher M Grochowski; Angad Jolly; Moez Dawood; Ruizhi Duan; Shalini N Jhangiani; Harsha Doddapaneni; Jianhong Hu; Donna M Muzny; Dana Marafi; Zeynep Coban Akdemir; Ender Karaca; Claudia M B Carvalho; Richard A Gibbs; Jennifer E Posey; James R Lupski; Davut Pehlivan
Journal:  Am J Hum Genet       Date:  2021-09-28       Impact factor: 11.025

2.  A comprehensive benchmarking of WGS-based deletion structural variant callers.

Authors:  Varuni Sarwal; Sebastian Niehus; Ram Ayyala; Minyoung Kim; Aditya Sarkar; Sei Chang; Angela Lu; Neha Rajkumar; Nicholas Darfci-Maher; Russell Littman; Karishma Chhugani; Arda Soylev; Zoia Comarova; Emily Wesel; Jacqueline Castellanos; Rahul Chikka; Margaret G Distler; Eleazar Eskin; Jonathan Flint; Serghei Mangul
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

3.  The impact of rare germline variants on human somatic mutation processes.

Authors:  Mischan Vali-Pour; Ben Lehner; Fran Supek
Journal:  Nat Commun       Date:  2022-06-28       Impact factor: 17.694

Review 4.  Guidelines for bioinformatics of single-cell sequencing data analysis in Alzheimer's disease: review, recommendation, implementation and application.

Authors:  Minghui Wang; Won-Min Song; Chen Ming; Qian Wang; Xianxiao Zhou; Peng Xu; Azra Krek; Yonejung Yoon; Lap Ho; Miranda E Orr; Guo-Cheng Yuan; Bin Zhang
Journal:  Mol Neurodegener       Date:  2022-03-02       Impact factor: 18.879

5.  A monoallelic SEC23A variant E599K associated with cranio-lenticulo-sutural dysplasia.

Authors:  Katarina Cisarova; Livia Garavelli; Stefano Giuseppe Caraffi; Francesca Peluso; Lara Valeri; Giancarlo Gargano; Sara Gavioli; Gabriele Trimarchi; Alberto Neri; Belinda Campos-Xavier; Andrea Superti-Furga
Journal:  Am J Med Genet A       Date:  2021-09-28       Impact factor: 2.578

6.  Dysgu: efficient structural variant calling using short or long reads.

Authors:  Kez Cleal; Duncan M Baird
Journal:  Nucleic Acids Res       Date:  2022-05-20       Impact factor: 19.160

7.  PopDel identifies medium-size deletions simultaneously in tens of thousands of genomes.

Authors:  Sebastian Niehus; Hákon Jónsson; Janina Schönberger; Eythór Björnsson; Doruk Beyter; Hannes P Eggertsson; Patrick Sulem; Kári Stefánsson; Bjarni V Halldórsson; Birte Kehr
Journal:  Nat Commun       Date:  2021-02-01       Impact factor: 14.919

Review 8.  Structural variant detection in cancer genomes: computational challenges and perspectives for precision oncology.

Authors:  Ianthe A E M van Belzen; Alexander Schönhuth; Patrick Kemmeren; Jayne Y Hehir-Kwa
Journal:  NPJ Precis Oncol       Date:  2021-03-02

9.  CNV-P: a machine-learning framework for predicting high confident copy number variations.

Authors:  Taifu Wang; Jinghua Sun; Xiuqing Zhang; Wen-Jing Wang; Qing Zhou
Journal:  PeerJ       Date:  2021-12-02       Impact factor: 2.984

Review 10.  Towards accurate and reliable resolution of structural variants for clinical diagnosis.

Authors:  Zhichao Liu; Ruth Roberts; Timothy R Mercer; Joshua Xu; Fritz J Sedlazeck; Weida Tong
Journal:  Genome Biol       Date:  2022-03-03       Impact factor: 17.906

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.