Literature DB >> 30559433

Long-read sequence and assembly of segmental duplications.

Mitchell R Vollger1, Philip C Dishuck1, Melanie Sorensen1, AnneMarie E Welch1, Vy Dang1, Max L Dougherty1, Tina A Graves-Lindsay2, Richard K Wilson3,4, Mark J P Chaisson5, Evan E Eichler6,7.   

Abstract

We have developed a computational method based on polyploid phasing of long sequence reads to resolve collapsed regions of segmental duplications within genome assemblies. Segmental Duplication Assembler (SDA; https://github.com/mvollger/SDA ) constructs graphs in which paralogous sequence variants define the nodes and long-read sequences provide attraction and repulsion edges, enabling the partition and assembly of long reads corresponding to distinct paralogs. We apply it to single-molecule, real-time sequence data from three human genomes and recover 33-79 megabase pairs (Mb) of duplications in which approximately half of the loci are diverged (<99.8%) compared to the reference genome. We show that the corresponding sequence is highly accurate (>99.9%) and that the diverged sequence corresponds to copy-number-variable paralogs that are absent from the human reference genome. Our method can be applied to other complex genomes to resolve the last gene-rich gaps, improve duplicate gene annotation, and better understand copy-number-variant genetic diversity at the base-pair level.

Entities:  

Mesh:

Year:  2018        PMID: 30559433      PMCID: PMC6382464          DOI: 10.1038/s41592-018-0236-3

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


  47 in total

1.  HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.

Authors:  Sergey Nurk; Brian P Walenz; Arang Rhie; Mitchell R Vollger; Glennis A Logsdon; Robert Grothe; Karen H Miga; Evan E Eichler; Adam M Phillippy; Sergey Koren
Journal:  Genome Res       Date:  2020-08-14       Impact factor: 9.043

Review 2.  Long-read human genome sequencing and its applications.

Authors:  Glennis A Logsdon; Mitchell R Vollger; Evan E Eichler
Journal:  Nat Rev Genet       Date:  2020-06-05       Impact factor: 53.242

3.  Sensitive alignment using paralogous sequence variants improves long-read mapping and variant calling in segmental duplications.

Authors:  Timofey Prodanov; Vikas Bansal
Journal:  Nucleic Acids Res       Date:  2020-11-04       Impact factor: 16.971

Review 4.  Structural variation in the sequencing era.

Authors:  Steve S Ho; Alexander E Urban; Ryan E Mills
Journal:  Nat Rev Genet       Date:  2019-11-15       Impact factor: 53.242

5.  Sequence diversity analyses of an improved rhesus macaque genome enhance its biomedical utility.

Authors:  Wesley C Warren; R Alan Harris; Marina Haukness; Ian T Fiddes; Shwetha C Murali; Jason Fernandes; Philip C Dishuck; Jessica M Storer; Muthuswamy Raveendran; LaDeana W Hillier; David Porubsky; Yafei Mao; David Gordon; Mitchell R Vollger; Alexandra P Lewis; Katherine M Munson; Elizabeth DeVogelaere; Joel Armstrong; Mark Diekhans; Jerilyn A Walker; Chad Tomlinson; Tina A Graves-Lindsay; Milinn Kremitzki; Sofie R Salama; Peter A Audano; Merly Escalona; Nicholas W Maurer; Francesca Antonacci; Ludovica Mercuri; Flavia A M Maggiolini; Claudia Rita Catacchio; Jason G Underwood; David H O'Connor; Ashley D Sanders; Jan O Korbel; Betsy Ferguson; H Michael Kubisch; Louis Picker; Ned H Kalin; Douglas Rosene; Jon Levine; David H Abbott; Stanton B Gray; Mar M Sanchez; Zsofia A Kovacs-Balint; Joseph W Kemnitz; Sara M Thomasy; Jeffrey A Roberts; Erin L Kinnally; John P Capitanio; J H Pate Skene; Michael Platt; Shelley A Cole; Richard E Green; Mario Ventura; Roger W Wiseman; Benedict Paten; Mark A Batzer; Jeffrey Rogers; Evan E Eichler
Journal:  Science       Date:  2020-12-18       Impact factor: 47.728

6.  Nanopore Guided Assembly of Segmental Duplications near Telomeres.

Authors:  Eleni Adam; Tunazzina Islam; Desh Ranjan; Harold Riethman
Journal:  Proc IEEE Int Symp Bioinformatics Bioeng       Date:  2019-12-26

7.  Sex-Based Analysis of De Novo Variants in Neurodevelopmental Disorders.

Authors:  Tychele N Turner; Amy B Wilfert; Trygve E Bakken; Raphael A Bernier; Micah R Pepper; Zhancheng Zhang; Rebecca I Torene; Kyle Retterer; Evan E Eichler
Journal:  Am J Hum Genet       Date:  2019-11-27       Impact factor: 11.025

Review 8.  Genetic Variation, Comparative Genomics, and the Diagnosis of Disease.

Authors:  Evan E Eichler
Journal:  N Engl J Med       Date:  2019-07-04       Impact factor: 91.245

9.  Characterizing the Major Structural Variant Alleles of the Human Genome.

Authors:  Peter A Audano; Arvis Sulovari; Tina A Graves-Lindsay; Stuart Cantsilieris; Melanie Sorensen; AnneMarie E Welch; Max L Dougherty; Bradley J Nelson; Ankeeta Shah; Susan K Dutcher; Wesley C Warren; Vincent Magrini; Sean D McGrath; Yang I Li; Richard K Wilson; Evan E Eichler
Journal:  Cell       Date:  2019-01-17       Impact factor: 41.582

10.  Large X-Linked Palindromes Undergo Arm-to-Arm Gene Conversion across Mus Lineages.

Authors:  Callie M Swanepoel; Emma R Gerlinger; Jacob L Mueller
Journal:  Mol Biol Evol       Date:  2020-07-01       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.