Literature DB >> 33789087

Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.

Xuefang Zhao1, Ryan L Collins2, Wan-Ping Lee3, Alexandra M Weber4, Yukyung Jun3, Qihui Zhu3, Ben Weisburd5, Yongqing Huang6, Peter A Audano7, Harold Wang8, Mark Walker9, Chelsea Lowther1, Jack Fu1, Mark B Gerstein10, Scott E Devine11, Tobias Marschall12, Jan O Korbel13, Evan E Eichler14, Mark J P Chaisson15, Charles Lee16, Ryan E Mills4, Harrison Brand1, Michael E Talkowski17.   

Abstract

Virtually all genome sequencing efforts in national biobanks, complex and Mendelian disease programs, and medical genetic initiatives are reliant upon short-read whole-genome sequencing (srWGS), which presents challenges for the detection of structural variants (SVs) relative to emerging long-read WGS (lrWGS) technologies. Given this ubiquity of srWGS in large-scale genomics initiatives, we sought to establish expectations for routine SV detection from this data type by comparison with lrWGS assembly, as well as to quantify the genomic properties and added value of SVs uniquely accessible to each technology. Analyses from the Human Genome Structural Variation Consortium (HGSVC) of three families captured ~11,000 SVs per genome from srWGS and ~25,000 SVs per genome from lrWGS assembly. Detection power and precision for SV discovery varied dramatically by genomic context and variant class: 9.7% of the current GRCh38 reference is defined by segmental duplication (SD) and simple repeat (SR), yet 91.4% of deletions that were specifically discovered by lrWGS localized to these regions. Across the remaining 90.3% of reference sequence, we observed extremely high (93.8%) concordance between technologies for deletions in these datasets. In contrast, lrWGS was superior for detection of insertions across all genomic contexts. Given that non-SD/SR sequences encompass 95.9% of currently annotated disease-associated exons, improved sensitivity from lrWGS to discover novel pathogenic deletions in these currently interpretable genomic regions is likely to be incremental. However, these analyses highlight the considerable added value of assembly-based lrWGS to create new catalogs of insertions and transposable elements, as well as disease-associated repeat expansions in genomic sequences that were previously recalcitrant to routine assessment.
Copyright © 2021. Published by Elsevier Inc.

Entities:  

Keywords:  copy number variation; genome assembly; long-read technology; segmental duplication; simple repeats; structural variation; whole-genome sequencing

Mesh:

Year:  2021        PMID: 33789087      PMCID: PMC8206509          DOI: 10.1016/j.ajhg.2021.03.014

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  55 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

2.  Single-cell template strand sequencing by Strand-seq enables the characterization of individual homologs.

Authors:  Ashley D Sanders; Ester Falconer; Mark Hills; Diana C J Spierings; Peter M Lansdorp
Journal:  Nat Protoc       Date:  2017-05-11       Impact factor: 13.491

3.  Paternally inherited cis-regulatory structural variants are associated with autism.

Authors:  William M Brandler; Danny Antaki; Madhusudan Gujral; Morgan L Kleiber; Joe Whitney; Michelle S Maile; Oanh Hong; Timothy R Chapman; Shirley Tan; Prateek Tandon; Timothy Pang; Shih C Tang; Keith K Vaux; Yan Yang; Eoghan Harrington; Sissel Juul; Daniel J Turner; Bhooma Thiruvahindrapuram; Gaganjot Kaur; Zhuozhi Wang; Stephen F Kingsmore; Joseph G Gleeson; Denis Bisson; Boyko Kakaradov; Amalio Telenti; J Craig Venter; Roser Corominas; Claudio Toma; Bru Cormand; Isabel Rueda; Silvina Guijarro; Karen S Messer; Caroline M Nievergelt; Maria J Arranz; Eric Courchesne; Karen Pierce; Alysson R Muotri; Lilia M Iakoucheva; Amaia Hervas; Stephen W Scherer; Christina Corsello; Jonathan Sebat
Journal:  Science       Date:  2018-04-20       Impact factor: 47.728

4.  An analytical framework for whole-genome sequence association studies and its implications for autism spectrum disorder.

Authors:  Donna M Werling; Harrison Brand; Joon-Yong An; Matthew R Stone; Lingxue Zhu; Joseph T Glessner; Ryan L Collins; Shan Dong; Ryan M Layer; Eirene Markenscoff-Papadimitriou; Andrew Farrell; Grace B Schwartz; Harold Z Wang; Benjamin B Currall; Xuefang Zhao; Jeanselle Dea; Clif Duhn; Carolyn A Erdman; Michael C Gilson; Rachita Yadav; Robert E Handsaker; Seva Kashin; Lambertus Klei; Jeffrey D Mandell; Tomasz J Nowakowski; Yuwen Liu; Sirisha Pochareddy; Louw Smith; Michael F Walker; Matthew J Waterman; Xin He; Arnold R Kriegstein; John L Rubenstein; Nenad Sestan; Steven A McCarroll; Benjamin M Neale; Hilary Coon; A Jeremy Willsey; Joseph D Buxbaum; Mark J Daly; Matthew W State; Aaron R Quinlan; Gabor T Marth; Kathryn Roeder; Bernie Devlin; Michael E Talkowski; Stephan J Sanders
Journal:  Nat Genet       Date:  2018-04-26       Impact factor: 38.330

5.  Repetitive elements may comprise over two-thirds of the human genome.

Authors:  A P Jason de Koning; Wanjun Gu; Todd A Castoe; Mark A Batzer; David D Pollock
Journal:  PLoS Genet       Date:  2011-12-01       Impact factor: 5.917

6.  Identification and characterization of occult human-specific LINE-1 insertions using long-read sequencing technology.

Authors:  Weichen Zhou; Sarah B Emery; Diane A Flasch; Yifan Wang; Kenneth Y Kwan; Jeffrey M Kidd; John V Moran; Ryan E Mills
Journal:  Nucleic Acids Res       Date:  2020-02-20       Impact factor: 16.971

7.  Telomere-to-telomere assembly of a complete human X chromosome.

Authors:  Karen H Miga; Sergey Koren; Arang Rhie; Mitchell R Vollger; Ariel Gershman; Andrey Bzikadze; Shelise Brooks; Edmund Howe; David Porubsky; Glennis A Logsdon; Valerie A Schneider; Tamara Potapova; Jonathan Wood; William Chow; Joel Armstrong; Jeanne Fredrickson; Evgenia Pak; Kristof Tigyi; Milinn Kremitzki; Christopher Markovic; Valerie Maduro; Amalia Dutra; Gerard G Bouffard; Alexander M Chang; Nancy F Hansen; Amy B Wilfert; Françoise Thibaud-Nissen; Anthony D Schmitt; Jon-Matthew Belton; Siddarth Selvaraj; Megan Y Dennis; Daniela C Soto; Ruta Sahasrabudhe; Gulhan Kaya; Josh Quick; Nicholas J Loman; Nadine Holmes; Matthew Loose; Urvashi Surti; Rosa Ana Risques; Tina A Graves Lindsay; Robert Fulton; Ira Hall; Benedict Paten; Kerstin Howe; Winston Timp; Alice Young; James C Mullikin; Pavel A Pevzner; Jennifer L Gerton; Beth A Sullivan; Evan E Eichler; Adam M Phillippy
Journal:  Nature       Date:  2020-07-14       Impact factor: 49.962

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Characterizing the Major Structural Variant Alleles of the Human Genome.

Authors:  Peter A Audano; Arvis Sulovari; Tina A Graves-Lindsay; Stuart Cantsilieris; Melanie Sorensen; AnneMarie E Welch; Max L Dougherty; Bradley J Nelson; Ankeeta Shah; Susan K Dutcher; Wesley C Warren; Vincent Magrini; Sean D McGrath; Yang I Li; Richard K Wilson; Evan E Eichler
Journal:  Cell       Date:  2019-01-17       Impact factor: 41.582

10.  Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.

Authors:  Daniel Taliun; Daniel N Harris; Michael D Kessler; Jedidiah Carlson; Zachary A Szpiech; Raul Torres; Sarah A Gagliano Taliun; André Corvelo; Stephanie M Gogarten; Hyun Min Kang; Achilleas N Pitsillides; Jonathon LeFaive; Seung-Been Lee; Xiaowen Tian; Brian L Browning; Sayantan Das; Anne-Katrin Emde; Wayne E Clarke; Douglas P Loesch; Amol C Shetty; Thomas W Blackwell; Albert V Smith; Quenna Wong; Xiaoming Liu; Matthew P Conomos; Dean M Bobo; François Aguet; Christine Albert; Alvaro Alonso; Kristin G Ardlie; Dan E Arking; Stella Aslibekyan; Paul L Auer; John Barnard; R Graham Barr; Lucas Barwick; Lewis C Becker; Rebecca L Beer; Emelia J Benjamin; Lawrence F Bielak; John Blangero; Michael Boehnke; Donald W Bowden; Jennifer A Brody; Esteban G Burchard; Brian E Cade; James F Casella; Brandon Chalazan; Daniel I Chasman; Yii-Der Ida Chen; Michael H Cho; Seung Hoan Choi; Mina K Chung; Clary B Clish; Adolfo Correa; Joanne E Curran; Brian Custer; Dawood Darbar; Michelle Daya; Mariza de Andrade; Dawn L DeMeo; Susan K Dutcher; Patrick T Ellinor; Leslie S Emery; Celeste Eng; Diane Fatkin; Tasha Fingerlin; Lukas Forer; Myriam Fornage; Nora Franceschini; Christian Fuchsberger; Stephanie M Fullerton; Soren Germer; Mark T Gladwin; Daniel J Gottlieb; Xiuqing Guo; Michael E Hall; Jiang He; Nancy L Heard-Costa; Susan R Heckbert; Marguerite R Irvin; Jill M Johnsen; Andrew D Johnson; Robert Kaplan; Sharon L R Kardia; Tanika Kelly; Shannon Kelly; Eimear E Kenny; Douglas P Kiel; Robert Klemmer; Barbara A Konkle; Charles Kooperberg; Anna Köttgen; Leslie A Lange; Jessica Lasky-Su; Daniel Levy; Xihong Lin; Keng-Han Lin; Chunyu Liu; Ruth J F Loos; Lori Garman; Robert Gerszten; Steven A Lubitz; Kathryn L Lunetta; Angel C Y Mak; Ani Manichaikul; Alisa K Manning; Rasika A Mathias; David D McManus; Stephen T McGarvey; James B Meigs; Deborah A Meyers; Julie L Mikulla; Mollie A Minear; Braxton D Mitchell; Sanghamitra Mohanty; May E Montasser; Courtney Montgomery; Alanna C Morrison; Joanne M Murabito; Andrea Natale; Pradeep Natarajan; Sarah C Nelson; Kari E North; Jeffrey R O'Connell; Nicholette D Palmer; Nathan Pankratz; Gina M Peloso; Patricia A Peyser; Jacob Pleiness; Wendy S Post; Bruce M Psaty; D C Rao; Susan Redline; Alexander P Reiner; Dan Roden; Jerome I Rotter; Ingo Ruczinski; Chloé Sarnowski; Sebastian Schoenherr; David A Schwartz; Jeong-Sun Seo; Sudha Seshadri; Vivien A Sheehan; Wayne H Sheu; M Benjamin Shoemaker; Nicholas L Smith; Jennifer A Smith; Nona Sotoodehnia; Adrienne M Stilp; Weihong Tang; Kent D Taylor; Marilyn Telen; Timothy A Thornton; Russell P Tracy; David J Van Den Berg; Ramachandran S Vasan; Karine A Viaud-Martinez; Scott Vrieze; Daniel E Weeks; Bruce S Weir; Scott T Weiss; Lu-Chen Weng; Cristen J Willer; Yingze Zhang; Xutong Zhao; Donna K Arnett; Allison E Ashley-Koch; Kathleen C Barnes; Eric Boerwinkle; Stacey Gabriel; Richard Gibbs; Kenneth M Rice; Stephen S Rich; Edwin K Silverman; Pankaj Qasba; Weiniu Gan; George J Papanicolaou; Deborah A Nickerson; Sharon R Browning; Michael C Zody; Sebastian Zöllner; James G Wilson; L Adrienne Cupples; Cathy C Laurie; Cashell E Jaquish; Ryan D Hernandez; Timothy D O'Connor; Gonçalo R Abecasis
Journal:  Nature       Date:  2021-02-10       Impact factor: 69.504

View more
  12 in total

1.  Pangenome-based genome inference allows efficient and accurate genotyping across a wide spectrum of variant classes.

Authors:  Jana Ebler; Peter Ebert; Wayne E Clarke; Tobias Rausch; Peter A Audano; Torsten Houwaart; Yafei Mao; Jan O Korbel; Evan E Eichler; Michael C Zody; Alexander T Dilthey; Tobias Marschall
Journal:  Nat Genet       Date:  2022-04-11       Impact factor: 38.330

2.  A comprehensive benchmarking of WGS-based deletion structural variant callers.

Authors:  Varuni Sarwal; Sebastian Niehus; Ram Ayyala; Minyoung Kim; Aditya Sarkar; Sei Chang; Angela Lu; Neha Rajkumar; Nicholas Darfci-Maher; Russell Littman; Karishma Chhugani; Arda Soylev; Zoia Comarova; Emily Wesel; Jacqueline Castellanos; Rahul Chikka; Margaret G Distler; Eleazar Eskin; Jonathan Flint; Serghei Mangul
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

3.  Rare coding variation provides insight into the genetic architecture and phenotypic context of autism.

Authors:  Jack M Fu; F Kyle Satterstrom; Minshi Peng; Harrison Brand; Ryan L Collins; Shan Dong; Brie Wamsley; Lambertus Klei; Lily Wang; Stephanie P Hao; Christine R Stevens; Caroline Cusick; Mehrtash Babadi; Eric Banks; Brett Collins; Sheila Dodge; Stacey B Gabriel; Laura Gauthier; Samuel K Lee; Lindsay Liang; Alicia Ljungdahl; Behrang Mahjani; Laura Sloofman; Andrey N Smirnov; Mafalda Barbosa; Catalina Betancur; Alfredo Brusco; Brian H Y Chung; Edwin H Cook; Michael L Cuccaro; Enrico Domenici; Giovanni Battista Ferrero; J Jay Gargus; Gail E Herman; Irva Hertz-Picciotto; Patricia Maciel; Dara S Manoach; Maria Rita Passos-Bueno; Antonio M Persico; Alessandra Renieri; James S Sutcliffe; Flora Tassone; Elisabetta Trabetti; Gabriele Campos; Simona Cardaropoli; Diana Carli; Marcus C Y Chan; Chiara Fallerini; Elisa Giorgio; Ana Cristina Girardi; Emily Hansen-Kiss; So Lun Lee; Carla Lintas; Yunin Ludena; Rachel Nguyen; Lisa Pavinato; Margaret Pericak-Vance; Isaac N Pessah; Rebecca J Schmidt; Moyra Smith; Claudia I S Costa; Slavica Trajkova; Jaqueline Y T Wang; Mullin H C Yu; David J Cutler; Silvia De Rubeis; Joseph D Buxbaum; Mark J Daly; Bernie Devlin; Kathryn Roeder; Stephan J Sanders; Michael E Talkowski
Journal:  Nat Genet       Date:  2022-08-18       Impact factor: 41.307

4.  High-Density Mapping of Quantitative Trait Loci Controlling Agronomically Important Traits in Quinoa (Chenopodium quinoa Willd.).

Authors:  Nathaly Maldonado-Taipe; Federico Barbier; Karl Schmid; Christian Jung; Nazgol Emrani
Journal:  Front Plant Sci       Date:  2022-06-09       Impact factor: 6.627

5.  Local adaptation and archaic introgression shape global diversity at human structural variant loci.

Authors:  Stephanie M Yan; Rachel M Sherman; Dylan J Taylor; Divya R Nair; Andrew N Bortvin; Michael C Schatz; Rajiv C McCoy
Journal:  Elife       Date:  2021-09-16       Impact factor: 8.140

Review 6.  Best practices for the interpretation and reporting of clinical whole genome sequencing.

Authors:  Christina A Austin-Tse; Vaidehi Jobanputra; Hutton M Kearney; Heidi L Rehm; Denise L Perry; David Bick; Ryan J Taft; Eric Venner; Richard A Gibbs; Ted Young; Sarah Barnett; John W Belmont; Nicole Boczek; Shimul Chowdhury; Katarzyna A Ellsworth; Saurav Guha; Shashikant Kulkarni; Cherisse Marcou; Linyan Meng; David R Murdock; Atteeq U Rehman; Elizabeth Spiteri; Amanda Thomas-Wilson
Journal:  NPJ Genom Med       Date:  2022-04-08       Impact factor: 6.083

Review 7.  Lessons learned: next-generation sequencing applied to undiagnosed genetic diseases.

Authors:  Bryce A Schuler; Erica T Nelson; Mary Koziura; Joy D Cogan; Rizwan Hamid; John A Phillips
Journal:  J Clin Invest       Date:  2022-04-01       Impact factor: 14.808

8.  SvAnna: efficient and accurate pathogenicity prediction of coding and regulatory structural variants in long-read genome sequencing.

Authors:  Daniel Danis; Julius O B Jacobsen; Parithi Balachandran; Qihui Zhu; Feyza Yilmaz; Justin Reese; Matthias Haimel; Gholson J Lyon; Ingo Helbig; Christopher J Mungall; Christine R Beck; Charles Lee; Damian Smedley; Peter N Robinson
Journal:  Genome Med       Date:  2022-04-28       Impact factor: 15.266

9.  TT-Mars: structural variants assessment based on haplotype-resolved assemblies.

Authors:  Jianzhi Yang; Mark J P Chaisson
Journal:  Genome Biol       Date:  2022-05-06       Impact factor: 17.906

10.  Discovery of genomic variation across a generation.

Authors:  Brett Trost; Livia O Loureiro; Stephen W Scherer
Journal:  Hum Mol Genet       Date:  2021-10-01       Impact factor: 6.150

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.