Literature DB >> 33632895

Haplotype-resolved diverse human genomes and integrated analysis of structural variation.

Peter Ebert1, Peter A Audano2, Qihui Zhu3, Bernardo Rodriguez-Martin4, Charles Lee5,6,7, Jan O Korbel8,9, Tobias Marschall10, Evan E Eichler11,12, David Porubsky2, Marc Jan Bonder4,13, Arvis Sulovari2, Jana Ebler1, Weichen Zhou14, Rebecca Serra Mari1, Feyza Yilmaz3, Xuefang Zhao15,16, PingHsun Hsieh2, Joyce Lee17, Sushant Kumar18, Jiadong Lin19, Tobias Rausch4, Yu Chen20, Jingwen Ren21, Martin Santamarina22,23, Wolfram Höps4, Hufsah Ashraf1, Nelson T Chuang24, Xiaofei Yang25, Katherine M Munson2, Alexandra P Lewis2, Susan Fairley9, Luke J Tallon24, Wayne E Clarke26, Anna O Basile26, Marta Byrska-Bishop26, André Corvelo26, Uday S Evani26, Tsung-Yu Lu21, Mark J P Chaisson21, Junjie Chen27, Chong Li27, Harrison Brand15,16, Aaron M Wenger28, Maryam Ghareghani29,30,1, William T Harvey2, Benjamin Raeder4, Patrick Hasenfeld4, Allison A Regier31, Haley J Abel31, Ira M Hall32, Paul Flicek9, Oliver Stegle4,13, Mark B Gerstein18, Jose M C Tubio22,23, Zepeng Mu33, Yang I Li34, Xinghua Shi27, Alex R Hastie17, Kai Ye19,35, Zechen Chong20, Ashley D Sanders4, Michael C Zody26, Michael E Talkowski15,16, Ryan E Mills14,35, Scott E Devine24.   

Abstract

Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average minimum contig length needed to cover 50% of the genome: 26 million base pairs) integrate all forms of genetic variation, even across complex loci. We identified 107,590 structural variants (SVs), of which 68% were not discovered with short-read sequencing, and 278 SV hotspots (spanning megabases of gene-rich sequence). We characterized 130 of the most active mobile element source elements and found that 63% of all SVs arise through homology-mediated mechanisms. This resource enables reliable graph-based genotyping from short reads of up to 50,340 SVs, resulting in the identification of 1526 expression quantitative trait loci as well as SV candidates for adaptive selection within the human population.
Copyright © 2021 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33632895      PMCID: PMC8026704          DOI: 10.1126/science.abf7117

Source DB:  PubMed          Journal:  Science        ISSN: 0036-8075            Impact factor:   47.728


  172 in total

1.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

2.  SVA elements: a hominid-specific retroposon family.

Authors:  Hui Wang; Jinchuan Xing; Deepak Grover; Dale J Hedges; Kyudong Han; Jerilyn A Walker; Mark A Batzer
Journal:  J Mol Biol       Date:  2005-10-19       Impact factor: 5.469

3.  RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference.

Authors:  Brian K Maples; Simon Gravel; Eimear E Kenny; Carlos D Bustamante
Journal:  Am J Hum Genet       Date:  2013-08-01       Impact factor: 11.025

4.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets.

Authors:  Zhihong Zhu; Futao Zhang; Han Hu; Andrew Bakshi; Matthew R Robinson; Joseph E Powell; Grant W Montgomery; Michael E Goddard; Naomi R Wray; Peter M Visscher; Jian Yang
Journal:  Nat Genet       Date:  2016-03-28       Impact factor: 38.330

5.  HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.

Authors:  Sergey Nurk; Brian P Walenz; Arang Rhie; Mitchell R Vollger; Glennis A Logsdon; Robert Grothe; Karen H Miga; Evan E Eichler; Adam M Phillippy; Sergey Koren
Journal:  Genome Res       Date:  2020-08-14       Impact factor: 9.043

6.  The diabetes susceptibility gene Clec16a regulates mitophagy.

Authors:  Scott A Soleimanpour; Aditi Gupta; Marina Bakay; Alana M Ferrari; David N Groff; João Fadista; Lynn A Spruce; Jake A Kushner; Leif Groop; Steven H Seeholzer; Brett A Kaufman; Hakon Hakonarson; Doris A Stoffers
Journal:  Cell       Date:  2014-06-19       Impact factor: 41.582

7.  Versatile genome assembly evaluation with QUAST-LG.

Authors:  Alla Mikheenko; Andrey Prjibelski; Vladislav Saveliev; Dmitry Antipov; Alexey Gurevich
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  DELLY: structural variant discovery by integrated paired-end and split-read analysis.

Authors:  Tobias Rausch; Thomas Zichner; Andreas Schlattl; Adrian M Stütz; Vladimir Benes; Jan O Korbel
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

Review 10.  The Role of Somatic L1 Retrotransposition in Human Cancers.

Authors:  Emma C Scott; Scott E Devine
Journal:  Viruses       Date:  2017-05-31       Impact factor: 5.048

View more
  85 in total

1.  Mining the gaps of chromosome 8.

Authors:  Glennis A Logsdon; Evan E Eichler
Journal:  Nature       Date:  2021-05-14       Impact factor: 49.962

2.  StainedGlass: Interactive visualization of massive tandem repeat structures with identity heatmaps.

Authors:  Mitchell R Vollger; Peter Kerpedjiev; Adam M Phillippy; Evan E Eichler
Journal:  Bioinformatics       Date:  2022-01-10       Impact factor: 6.937

3.  Characterizing mobile element insertions in 5675 genomes.

Authors:  Yiwei Niu; Xueyi Teng; Honghong Zhou; Yirong Shi; Yanyan Li; Yiheng Tang; Peng Zhang; Huaxia Luo; Quan Kang; Tao Xu; Shunmin He
Journal:  Nucleic Acids Res       Date:  2022-03-21       Impact factor: 16.971

4.  Targeted long-read sequencing identifies missing disease-causing variation.

Authors:  Danny E Miller; Arvis Sulovari; Tianyun Wang; Hailey Loucks; Kendra Hoekzema; Katherine M Munson; Alexandra P Lewis; Edith P Almanza Fuerte; Catherine R Paschal; Tom Walsh; Jenny Thies; James T Bennett; Ian Glass; Katrina M Dipple; Karynne Patterson; Emily S Bonkowski; Zoe Nelson; Audrey Squire; Megan Sikes; Erika Beckman; Robin L Bennett; Dawn Earl; Winston Lee; Rando Allikmets; Seth J Perlman; Penny Chow; Anne V Hing; Tara L Wenger; Margaret P Adam; Angela Sun; Christina Lam; Irene Chang; Xue Zou; Stephanie L Austin; Erin Huggins; Alexias Safi; Apoorva K Iyengar; Timothy E Reddy; William H Majoros; Andrew S Allen; Gregory E Crawford; Priya S Kishnani; Mary-Claire King; Tim Cherry; Jessica X Chong; Michael J Bamshad; Deborah A Nickerson; Heather C Mefford; Dan Doherty; Evan E Eichler
Journal:  Am J Hum Genet       Date:  2021-07-02       Impact factor: 11.025

5.  Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.

Authors:  Xuefang Zhao; Ryan L Collins; Wan-Ping Lee; Alexandra M Weber; Yukyung Jun; Qihui Zhu; Ben Weisburd; Yongqing Huang; Peter A Audano; Harold Wang; Mark Walker; Chelsea Lowther; Jack Fu; Mark B Gerstein; Scott E Devine; Tobias Marschall; Jan O Korbel; Evan E Eichler; Mark J P Chaisson; Charles Lee; Ryan E Mills; Harrison Brand; Michael E Talkowski
Journal:  Am J Hum Genet       Date:  2021-03-30       Impact factor: 11.025

Review 6.  Genomic Tackling of Human Satellite DNA: Breaking Barriers through Time.

Authors:  Mariana Lopes; Sandra Louzada; Margarida Gama-Carvalho; Raquel Chaves
Journal:  Int J Mol Sci       Date:  2021-04-29       Impact factor: 5.923

Review 7.  Genetic mosaicism in the human brain: from lineage tracing to neuropsychiatric disorders.

Authors:  Sara Bizzotto; Christopher A Walsh
Journal:  Nat Rev Neurosci       Date:  2022-03-23       Impact factor: 34.870

8.  Next-generation cytogenetics: Comprehensive assessment of 52 hematological malignancy genomes by optical genome mapping.

Authors:  Kornelia Neveling; Tuomo Mantere; Susan Vermeulen; Michiel Oorsprong; Ronald van Beek; Ellen Kater-Baats; Marc Pauper; Guillaume van der Zande; Dominique Smeets; Daniel Olde Weghuis; Marian J P L Stevens-Kroef; Alexander Hoischen
Journal:  Am J Hum Genet       Date:  2021-07-07       Impact factor: 11.025

Review 9.  Towards population-scale long-read sequencing.

Authors:  Wouter De Coster; Matthias H Weissensteiner; Fritz J Sedlazeck
Journal:  Nat Rev Genet       Date:  2021-05-28       Impact factor: 53.242

10.  Cas9 targeted enrichment of mobile elements using nanopore sequencing.

Authors:  Torrin L McDonald; Weichen Zhou; Christopher P Castro; Camille Mumm; Jessica A Switzenberg; Ryan E Mills; Alan P Boyle
Journal:  Nat Commun       Date:  2021-06-11       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.