Literature DB >> 28873962

A recurrence-based approach for validating structural variation using long-read sequencing technology.

Xuefang Zhao1, Alexandra M Weber1, Ryan E Mills1,2.   

Abstract

Although numerous algorithms have been developed to identify structural variations (SVs) in genomic sequences, there is a dearth of approaches that can be used to evaluate their results. This is significant as the accurate identification of structural variation is still an outstanding but important problem in genomics. The emergence of new sequencing technologies that generate longer sequence reads can, in theory, provide direct evidence for all types of SVs regardless of the length of the region through which it spans. However, current efforts to use these data in this manner require the use of large computational resources to assemble these sequences as well as visual inspection of each region. Here we present VaPoR, a highly efficient algorithm that autonomously validates large SV sets using long-read sequencing data. We assessed the performance of VaPoR on SVs in both simulated and real genomes and report a high-fidelity rate for overall accuracy across different levels of sequence depths. We show that VaPoR can interrogate a much larger range of SVs while still matching existing methods in terms of false positive validations and providing additional features considering breakpoint precision and predicted genotype. We further show that VaPoR can run quickly and efficiency without requiring a large processing or assembly pipeline. VaPoR provides a long read-based validation approach for genomic SVs that requires relatively low read depth and computing resources and thus will provide utility with targeted or low-pass sequencing coverage for accurate SV assessment. The VaPoR Software is available at: https://github.com/mills-lab/vapor.
© The Authors 2017. Published by Oxford University Press.

Entities:  

Keywords:  copy number variation; sequence analysis; structural variation

Mesh:

Year:  2017        PMID: 28873962      PMCID: PMC5737365          DOI: 10.1093/gigascience/gix061

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  21 in total

1.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

2.  Cryptic and complex chromosomal aberrations in early-onset neuropsychiatric disorders.

Authors:  Harrison Brand; Vamsee Pillalamarri; Ryan L Collins; Stacey Eggert; Colm O'Dushlaine; Ellen B Braaten; Matthew R Stone; Kimberly Chambert; Nathan D Doty; Carrie Hanscom; Jill A Rosenfeld; Hillary Ditmars; Jessica Blais; Ryan Mills; Charles Lee; James F Gusella; Steven McCarroll; Jordan W Smoller; Michael E Talkowski; Alysa E Doyle
Journal:  Am J Hum Genet       Date:  2014-10-02       Impact factor: 11.025

3.  A flexible and efficient template format for circular consensus sequencing and SNP detection.

Authors:  Kevin J Travers; Chen-Shan Chin; David R Rank; John S Eid; Stephen W Turner
Journal:  Nucleic Acids Res       Date:  2010-06-22       Impact factor: 16.971

4.  PBSIM: PacBio reads simulator--toward accurate genome assembly.

Authors:  Yukiteru Ono; Kiyoshi Asai; Michiaki Hamada
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

5.  Integrative genomics viewer.

Authors:  James T Robinson; Helga Thorvaldsdóttir; Wendy Winckler; Mitchell Guttman; Eric S Lander; Gad Getz; Jill P Mesirov
Journal:  Nat Biotechnol       Date:  2011-01       Impact factor: 54.908

6.  Complex reorganization and predominant non-homologous repair following chromosomal breakage in karyotypically balanced germline rearrangements and transgenic integration.

Authors:  Colby Chiang; Jessie C Jacobsen; Carl Ernst; Carrie Hanscom; Adrian Heilbut; Ian Blumenthal; Ryan E Mills; Andrew Kirby; Amelia M Lindgren; Skye R Rudiger; Clive J McLaughlan; C Simon Bawden; Suzanne J Reid; Richard L M Faull; Russell G Snell; Ira M Hall; Yiping Shen; Toshiro K Ohsumi; Mark L Borowsky; Mark J Daly; Charles Lee; Cynthia C Morton; Marcy E MacDonald; James F Gusella; Michael E Talkowski
Journal:  Nat Genet       Date:  2012-03-04       Impact factor: 38.330

7.  Hybrid error correction and de novo assembly of single-molecule sequencing reads.

Authors:  Sergey Koren; Michael C Schatz; Brian P Walenz; Jeffrey Martin; Jason T Howard; Ganeshkumar Ganapathy; Zhong Wang; David A Rasko; W Richard McCombie; Erich D Jarvis
Journal:  Nat Biotechnol       Date:  2012-07-01       Impact factor: 54.908

8.  An integrated encyclopedia of DNA elements in the human genome.

Authors: 
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

9.  DELLY: structural variant discovery by integrated paired-end and split-read analysis.

Authors:  Tobias Rausch; Thomas Zichner; Andreas Schlattl; Adrian M Stütz; Vladimir Benes; Jan O Korbel
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

10.  LUMPY: a probabilistic framework for structural variant discovery.

Authors:  Ryan M Layer; Colby Chiang; Aaron R Quinlan; Ira M Hall
Journal:  Genome Biol       Date:  2014-06-26       Impact factor: 13.583

View more
  10 in total

1.  Assessment of human diploid genome assembly with 10x Linked-Reads data.

Authors:  Lu Zhang; Xin Zhou; Ziming Weng; Arend Sidow
Journal:  Gigascience       Date:  2019-11-01       Impact factor: 6.524

2.  Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.

Authors:  Xuefang Zhao; Ryan L Collins; Wan-Ping Lee; Alexandra M Weber; Yukyung Jun; Qihui Zhu; Ben Weisburd; Yongqing Huang; Peter A Audano; Harold Wang; Mark Walker; Chelsea Lowther; Jack Fu; Mark B Gerstein; Scott E Devine; Tobias Marschall; Jan O Korbel; Evan E Eichler; Mark J P Chaisson; Charles Lee; Ryan E Mills; Harrison Brand; Michael E Talkowski
Journal:  Am J Hum Genet       Date:  2021-03-30       Impact factor: 11.025

3.  Haplotype-resolved diverse human genomes and integrated analysis of structural variation.

Authors:  Peter Ebert; Peter A Audano; Qihui Zhu; Bernardo Rodriguez-Martin; Charles Lee; Jan O Korbel; Tobias Marschall; Evan E Eichler; David Porubsky; Marc Jan Bonder; Arvis Sulovari; Jana Ebler; Weichen Zhou; Rebecca Serra Mari; Feyza Yilmaz; Xuefang Zhao; PingHsun Hsieh; Joyce Lee; Sushant Kumar; Jiadong Lin; Tobias Rausch; Yu Chen; Jingwen Ren; Martin Santamarina; Wolfram Höps; Hufsah Ashraf; Nelson T Chuang; Xiaofei Yang; Katherine M Munson; Alexandra P Lewis; Susan Fairley; Luke J Tallon; Wayne E Clarke; Anna O Basile; Marta Byrska-Bishop; André Corvelo; Uday S Evani; Tsung-Yu Lu; Mark J P Chaisson; Junjie Chen; Chong Li; Harrison Brand; Aaron M Wenger; Maryam Ghareghani; William T Harvey; Benjamin Raeder; Patrick Hasenfeld; Allison A Regier; Haley J Abel; Ira M Hall; Paul Flicek; Oliver Stegle; Mark B Gerstein; Jose M C Tubio; Zepeng Mu; Yang I Li; Xinghua Shi; Alex R Hastie; Kai Ye; Zechen Chong; Ashley D Sanders; Michael C Zody; Michael E Talkowski; Ryan E Mills; Scott E Devine
Journal:  Science       Date:  2021-02-25       Impact factor: 47.728

4.  Multi-platform discovery of haplotype-resolved structural variation in human genomes.

Authors:  Mark J P Chaisson; Ashley D Sanders; Xuefang Zhao; Ankit Malhotra; David Porubsky; Tobias Rausch; Eugene J Gardner; Oscar L Rodriguez; Li Guo; Ryan L Collins; Xian Fan; Jia Wen; Robert E Handsaker; Susan Fairley; Zev N Kronenberg; Xiangmeng Kong; Fereydoun Hormozdiari; Dillon Lee; Aaron M Wenger; Alex R Hastie; Danny Antaki; Thomas Anantharaman; Peter A Audano; Harrison Brand; Stuart Cantsilieris; Han Cao; Eliza Cerveira; Chong Chen; Xintong Chen; Chen-Shan Chin; Zechen Chong; Nelson T Chuang; Christine C Lambert; Deanna M Church; Laura Clarke; Andrew Farrell; Joey Flores; Timur Galeev; David U Gorkin; Madhusudan Gujral; Victor Guryev; William Haynes Heaton; Jonas Korlach; Sushant Kumar; Jee Young Kwon; Ernest T Lam; Jong Eun Lee; Joyce Lee; Wan-Ping Lee; Sau Peng Lee; Shantao Li; Patrick Marks; Karine Viaud-Martinez; Sascha Meiers; Katherine M Munson; Fabio C P Navarro; Bradley J Nelson; Conor Nodzak; Amina Noor; Sofia Kyriazopoulou-Panagiotopoulou; Andy W C Pang; Yunjiang Qiu; Gabriel Rosanio; Mallory Ryan; Adrian Stütz; Diana C J Spierings; Alistair Ward; AnneMarie E Welch; Ming Xiao; Wei Xu; Chengsheng Zhang; Qihui Zhu; Xiangqun Zheng-Bradley; Ernesto Lowy; Sergei Yakneen; Steven McCarroll; Goo Jun; Li Ding; Chong Lek Koh; Bing Ren; Paul Flicek; Ken Chen; Mark B Gerstein; Pui-Yan Kwok; Peter M Lansdorp; Gabor T Marth; Jonathan Sebat; Xinghua Shi; Ali Bashir; Kai Ye; Scott E Devine; Michael E Talkowski; Ryan E Mills; Tobias Marschall; Jan O Korbel; Evan E Eichler; Charles Lee
Journal:  Nat Commun       Date:  2019-04-16       Impact factor: 17.694

5.  Hecaton: reliably detecting copy number variation in plant genomes using short read sequencing data.

Authors:  Raúl Y Wijfjes; Sandra Smit; Dick de Ridder
Journal:  BMC Genomics       Date:  2019-11-07       Impact factor: 3.969

6.  Identification and characterization of occult human-specific LINE-1 insertions using long-read sequencing technology.

Authors:  Weichen Zhou; Sarah B Emery; Diane A Flasch; Yifan Wang; Kenneth Y Kwan; Jeffrey M Kidd; John V Moran; Ryan E Mills
Journal:  Nucleic Acids Res       Date:  2020-02-20       Impact factor: 16.971

7.  Comprehensive evaluation of structural variant genotyping methods based on long-read sequencing data.

Authors:  Xiaoke Duan; Mingpei Pan; Shaohua Fan
Journal:  BMC Genomics       Date:  2022-04-23       Impact factor: 4.547

8.  TT-Mars: structural variants assessment based on haplotype-resolved assemblies.

Authors:  Jianzhi Yang; Mark J P Chaisson
Journal:  Genome Biol       Date:  2022-05-06       Impact factor: 17.906

9.  High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.

Authors:  Marta Byrska-Bishop; Uday S Evani; Xuefang Zhao; Anna O Basile; Haley J Abel; Allison A Regier; André Corvelo; Wayne E Clarke; Rajeeva Musunuri; Kshithija Nagulapalli; Susan Fairley; Alexi Runnels; Lara Winterkorn; Ernesto Lowy; Soren Germer; Harrison Brand; Ira M Hall; Michael E Talkowski; Giuseppe Narzisi; Michael C Zody
Journal:  Cell       Date:  2022-09-01       Impact factor: 66.850

10.  Cas9 targeted enrichment of mobile elements using nanopore sequencing.

Authors:  Torrin L McDonald; Weichen Zhou; Christopher P Castro; Camille Mumm; Jessica A Switzenberg; Ryan E Mills; Alan P Boyle
Journal:  Nat Commun       Date:  2021-06-11       Impact factor: 14.919

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.