Literature DB >> 25383537

Resolving the complexity of the human genome using single-molecule sequencing.

Mark J P Chaisson1, John Huddleston2, Megan Y Dennis1, Peter H Sudmant1, Maika Malig1, Fereydoun Hormozdiari1, Francesca Antonacci3, Urvashi Surti4, Richard Sandstrom1, Matthew Boitano5, Jane M Landolin5, John A Stamatoyannopoulos1, Michael W Hunkapiller5, Jonas Korlach5, Evan E Eichler2.   

Abstract

The human genome is arguably the most complete mammalian reference assembly, yet more than 160 euchromatic gaps remain and aspects of its structural variation remain poorly understood ten years after its completion. To identify missing sequence and genetic variation, here we sequence and analyse a haploid human genome (CHM1) using single-molecule, real-time DNA sequencing. We close or extend 55% of the remaining interstitial gaps in the human GRCh37 reference genome--78% of which carried long runs of degenerate short tandem repeats, often several kilobases in length, embedded within (G+C)-rich genomic regions. We resolve the complete sequence of 26,079 euchromatic structural variants at the base-pair level, including inversions, complex insertions and long tracts of tandem repeats. Most have not been previously reported, with the greatest increases in sensitivity occurring for events less than 5 kilobases in size. Compared to the human reference, we find a significant insertional bias (3:1) in regions corresponding to complex insertions and long short tandem repeats. Our results suggest a greater complexity of the human genome in the form of variation of longer and more complex repetitive DNA that can now be largely resolved with the application of this longer-read sequencing technology.

Entities:  

Mesh:

Year:  2014        PMID: 25383537      PMCID: PMC4317254          DOI: 10.1038/nature13907

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


  25 in total

1.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

2.  A high-resolution recombination map of the human genome.

Authors:  Augustine Kong; Daniel F Gudbjartsson; Jesus Sainz; Gudrun M Jonsdottir; Sigurjon A Gudjonsson; Bjorgvin Richardsson; Sigrun Sigurdardottir; John Barnard; Bjorn Hallbeck; Gisli Masson; Adam Shlien; Stefan T Palsson; Michael L Frigge; Thorgeir E Thorgeirsson; Jeffrey R Gulcher; Kari Stefansson
Journal:  Nat Genet       Date:  2002-06-10       Impact factor: 38.330

3.  The International HapMap Project.

Authors: 
Journal:  Nature       Date:  2003-12-18       Impact factor: 49.962

Review 4.  An assessment of the sequence gaps: unfinished business in a finished human genome.

Authors:  Evan E Eichler; Royden A Clark; Xinwei She
Journal:  Nat Rev Genet       Date:  2004-05       Impact factor: 53.242

5.  Molecular cloning of a translocation breakpoint hotspot in 22q11.

Authors:  Hiroki Kurahashi; Hidehito Inagaki; Eriko Hosoba; Takema Kato; Tamae Ohye; Hiroshi Kogo; Beverly S Emanuel
Journal:  Genome Res       Date:  2007-01-31       Impact factor: 9.043

6.  CENSOR--a program for identification and elimination of repetitive elements from DNA sequences.

Authors:  J Jurka; P Klonowski; V Dagman; P Pelton
Journal:  Comput Chem       Date:  1996-03

7.  Miropeats: graphical DNA sequence comparisons.

Authors:  J D Parsons
Journal:  Comput Appl Biosci       Date:  1995-12

8.  A whole-genome assembly of Drosophila.

Authors:  E W Myers; G G Sutton; A L Delcher; I M Dew; D P Fasulo; M J Flanigan; S A Kravitz; C M Mobarry; K H Reinert; K A Remington; E L Anson; R A Bolanos; H H Chou; C M Jordan; A L Halpern; S Lonardi; E M Beasley; R C Brandon; L Chen; P J Dunn; Z Lai; Y Liang; D R Nusskern; M Zhan; Q Zhang; X Zheng; G M Rubin; M D Adams; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

9.  Finishing the euchromatic sequence of the human genome.

Authors: 
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

10.  Single haplotype assembly of the human genome from a hydatidiform mole.

Authors:  Karyn Meltz Steinberg; Valerie A Schneider; Tina A Graves-Lindsay; Robert S Fulton; Richa Agarwala; John Huddleston; Sergey A Shiryev; Aleksandr Morgulis; Urvashi Surti; Wesley C Warren; Deanna M Church; Evan E Eichler; Richard K Wilson
Journal:  Genome Res       Date:  2014-11-04       Impact factor: 9.043

View more
  304 in total

1.  Whole genome?

Authors: 
Journal:  Nat Genet       Date:  2015-09       Impact factor: 38.330

Review 2.  Completing the human genome: the progress and challenge of satellite DNA assembly.

Authors:  Karen H Miga
Journal:  Chromosome Res       Date:  2015-09       Impact factor: 5.239

3.  Counting copy number and calories.

Authors:  Stefan White
Journal:  Nat Genet       Date:  2015-08       Impact factor: 38.330

4.  Variant discovery and breakpoint region prediction for studying the human 22q11.2 deletion using BAC clone and whole genome sequencing analysis.

Authors:  Xingyi Guo; Maria Delio; Nousin Haque; Raquel Castellanos; Matthew S Hestand; Joris R Vermeesch; Bernice E Morrow; Deyou Zheng
Journal:  Hum Mol Genet       Date:  2016-07-19       Impact factor: 6.150

5.  Discovery and quality analysis of a comprehensive set of structural variants and short tandem repeats.

Authors:  David Jakubosky; Erin N Smith; Matteo D'Antonio; Marc Jan Bonder; William W Young Greenwald; Agnieszka D'Antonio-Chronowska; Hiroko Matsui; Oliver Stegle; Stephen B Montgomery; Christopher DeBoever; Kelly A Frazer
Journal:  Nat Commun       Date:  2020-06-10       Impact factor: 14.919

6.  LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.

Authors:  Gui-Cai Xu; Tian-Jun Xu; Rui Zhu; Yan Zhang; Shang-Qi Li; Hong-Wei Wang; Jiong-Tang Li
Journal:  Gigascience       Date:  2019-01-01       Impact factor: 6.524

7.  Identification of large rearrangements in cancer genomes with barcode linked reads.

Authors:  Li C Xia; John M Bell; Christina Wood-Bouwens; Jiamin J Chen; Nancy R Zhang; Hanlee P Ji
Journal:  Nucleic Acids Res       Date:  2018-02-28       Impact factor: 16.971

8.  Jointly aligning a group of DNA reads improves accuracy of identifying large deletions.

Authors:  Anish M S Shrestha; Martin C Frith; Kiyoshi Asai; Hugues Richard
Journal:  Nucleic Acids Res       Date:  2018-02-16       Impact factor: 16.971

9.  An Incomplete Understanding of Human Genetic Variation.

Authors:  John Huddleston; Evan E Eichler
Journal:  Genetics       Date:  2016-04       Impact factor: 4.562

10.  Evaluating droplet digital PCR for the quantification of human genomic DNA: converting copies per nanoliter to nanograms nuclear DNA per microliter.

Authors:  David L Duewer; Margaret C Kline; Erica L Romsos; Blaza Toman
Journal:  Anal Bioanal Chem       Date:  2018-03-19       Impact factor: 4.142

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.