Literature DB >> 11381028

Segmental duplications: organization and impact within the current human genome project assembly.

J A Bailey1, A M Yavor, H F Massa, B J Trask, E E Eichler.   

Abstract

Segmental duplications play fundamental roles in both genomic disease and gene evolution. To understand their organization within the human genome, we have developed the computational tools and methods necessary to detect identity between long stretches of genomic sequence despite the presence of high copy repeats and large insertion-deletions. Here we present our analysis of the most recent genome assembly (January 2001) in which we focus on the global organization of these segments and the role they play in the whole-genome assembly process. Initially, we considered only large recent duplication events that fell well-below levels of draft sequencing error (alignments 90%-98% similar and > or =1 kb in length). Duplications (90%-98%; > or =1 kb) comprise 3.6% of all human sequence. These duplications show clustering and up to 10-fold enrichment within pericentromeric and subtelomeric regions. In terms of assembly, duplicated sequences were found to be over-represented in unordered and unassigned contigs indicating that duplicated sequences are difficult to assign to their proper position. To assess coverage of these regions within the genome, we selected BACs containing interchromosomal duplications and characterized their duplication pattern by FISH. Only 47% (106/224) of chromosomes positive by FISH had a corresponding chromosomal position by comparison. We present data that indicate that this is attributable to misassembly, misassignment, and/or decreased sequencing coverage within duplicated regions. Surprisingly, if we consider putative duplications >98% identity, we identify 10.6% (286 Mb) of the current assembly as paralogous. The majority of these alignments, we believe, represent unmerged overlaps within unique regions. Taken together the above data indicate that segmental duplications represent a significant impediment to accurate human genome assembly, requiring the development of specialized techniques to finish these exceptional regions of the genome. The identification and characterization of these highly duplicated regions represents an important step in the complete sequencing of a human reference genome.

Entities:  

Mesh:

Year:  2001        PMID: 11381028      PMCID: PMC311093          DOI: 10.1101/gr.gr-1871r

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  40 in total

1.  The DNA sequence of human chromosome 21.

Authors:  M Hattori; A Fujiyama; T D Taylor; H Watanabe; T Yada; H S Park; A Toyoda; K Ishii; Y Totoki; D K Choi; Y Groner; E Soeda; M Ohki; T Takagi; Y Sakaki; S Taudien; K Blechschmidt; A Polley; U Menzel; J Delabar; K Kumpf; R Lehmann; D Patterson; K Reichwald; A Rump; M Schillhabel; A Schudy; W Zimmermann; A Rosenthal; J Kudoh; K Schibuya; K Kawasaki; S Asakawa; A Shintani; T Sasaki; K Nagamine; S Mitsuyama; S E Antonarakis; S Minoshima; N Shimizu; G Nordsiek; K Hornischer; P Brant; M Scharfe; O Schon; A Desario; J Reichelt; G Kauer; H Blocker; J Ramser; A Beck; S Klages; S Hennig; L Riesselmann; E Dagand; T Haaf; S Wehrmeyer; K Borzym; K Gardiner; D Nizetic; F Francis; H Lehrach; R Reinhardt; M L Yaspo
Journal:  Nature       Date:  2000-05-18       Impact factor: 49.962

Review 2.  Structure of chromosomal duplicons and their role in mediating human genomic disorders.

Authors:  Y Ji; E E Eichler; S Schwartz; R D Nicholls
Journal:  Genome Res       Date:  2000-05       Impact factor: 9.043

3.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

4.  The physical maps for sequencing human chromosomes 1, 6, 9, 10, 13, 20 and X.

Authors:  D R Bentley; P Deloukas; A Dunham; L French; S G Gregory; S J Humphray; A J Mungall; M T Ross; N P Carter; I Dunham; C E Scott; K J Ashcroft; A L Atkinson; K Aubin; D M Beare; G Bethel; N Brady; J C Brook; D C Burford; W D Burrill; C Burrows; A P Butler; C Carder; J J Catanese; C M Clee; S M Clegg; V Cobley; A J Coffey; C G Cole; J E Collins; J S Conquer; R A Cooper; K M Culley; E Dawson; F L Dearden; R M Durbin; P J de Jong; P D Dhami; M E Earthrowl; C A Edwards; R S Evans; C J Gillson; J Ghori; L Green; R Gwilliam; K S Halls; S Hammond; G L Harper; R W Heathcott; J L Holden; E Holloway; B L Hopkins; P J Howard; G R Howell; E J Huckle; J Hughes; P J Hunt; S E Hunt; M Izmajlowicz; C A Jones; S S Joseph; G Laird; C F Langford; M H Lehvaslaiho; M A Leversha; O T McCann; L M McDonald; J McDowall; G L Maslen; D Mistry; N K Moschonas; V Neocleous; D M Pearson; K J Phillips; K M Porter; S R Prathalingam; Y H Ramsey; S A Ranby; C M Rice; J Rogers; L J Rogers; T Sarafidou; D J Scott; G J Sharp; C J Shaw-Smith; L J Smink; C Soderlund; E C Sotheran; H E Steingruber; J E Sulston; A Taylor; R G Taylor; A A Thorpe; E Tinsley; G L Warry; A Whittaker; P Whittaker; S H Williams; T E Wilmer; R Wooster; C L Wright
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

Review 5.  Centromeres of mammalian chromosomes.

Authors:  H F Willard
Journal:  Trends Genet       Date:  1990-12       Impact factor: 11.639

6.  A physical map of the human genome.

Authors:  J D McPherson; M Marra; L Hillier; R H Waterston; A Chinwalla; J Wallis; M Sekhon; K Wylie; E R Mardis; R K Wilson; R Fulton; T A Kucaba; C Wagner-McPherson; W B Barbazuk; S G Gregory; S J Humphray; L French; R S Evans; G Bethel; A Whittaker; J L Holden; O T McCann; A Dunham; C Soderlund; C E Scott; D R Bentley; G Schuler; H C Chen; W Jang; E D Green; J R Idol; V V Maduro; K T Montgomery; E Lee; A Miller; S Emerling; R Gibbs; S Scherer; J H Gorrell; E Sodergren; K Clerc-Blankenburg; P Tabor; S Naylor; D Garcia; P J de Jong; J J Catanese; N Nowak; K Osoegawa; S Qin; L Rowen; A Madan; M Dors; L Hood; B Trask; C Friedman; H Massa; V G Cheung; I R Kirsch; T Reid; R Yonescu; J Weissenbach; T Bruls; R Heilig; E Branscomb; A Olsen; N Doggett; J F Cheng; T Hawkins; R M Myers; J Shang; L Ramirez; J Schmutz; O Velasquez; K Dixon; N E Stone; D R Cox; D Haussler; W J Kent; T Furey; S Rogic; S Kennedy; S Jones; A Rosenthal; G Wen; M Schilhabel; G Gloeckner; G Nyakatura; R Siebert; B Schlegelberger; J Korenberg; X N Chen; A Fujiyama; M Hattori; A Toyoda; T Yada; H S Park; Y Sakaki; N Shimizu; S Asakawa; K Kawasaki; T Sasaki; A Shintani; A Shimizu; K Shibuya; J Kudoh; S Minoshima; J Ramser; P Seranski; C Hoff; A Poustka; R Reinhardt; H Lehrach
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

7.  Optimal alignments in linear space.

Authors:  E W Myers; W Miller
Journal:  Comput Appl Biosci       Date:  1988-03

8.  Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome.

Authors:  S Brenner; G Elgar; R Sandford; A Macrae; B Venkatesh; S Aparicio
Journal:  Nature       Date:  1993-11-18       Impact factor: 49.962

9.  A novel human DNA polymorphism resulting from transfer of DNA from chromosome 6 to chromosome 16.

Authors:  Z Wong; N J Royle; A J Jeffreys
Journal:  Genomics       Date:  1990-06       Impact factor: 5.736

10.  Integration of cytogenetic landmarks into the draft sequence of the human genome.

Authors:  V G Cheung; N Nowak; W Jang; I R Kirsch; S Zhao; X N Chen; T S Furey; U J Kim; W L Kuo; M Olivier; J Conroy; A Kasprzyk; H Massa; R Yonescu; S Sait; C Thoreen; A Snijders; E Lemyre; J A Bailey; A Bruzel; W D Burrill; S M Clegg; S Collins; P Dhami; C Friedman; C S Han; S Herrick; J Lee; A H Ligon; S Lowry; M Morley; S Narasimhan; K Osoegawa; Z Peng; I Plajzer-Frick; B J Quade; D Scott; K Sirotkin; A A Thorpe; J W Gray; J Hudson; D Pinkel; T Ried; L Rowen; G L Shen-Ong; R L Strausberg; E Birney; D F Callen; J F Cheng; D R Cox; N A Doggett; N P Carter; E E Eichler; D Haussler; J R Korenberg; C C Morton; D Albertson; G Schuler; P J de Jong; B J Trask
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

View more
  307 in total

1.  The evolutionary origin of human subtelomeric homologies--or where the ends begin.

Authors:  Christa Lese Martin; Andrew Wong; Alyssa Gross; June Chung; Judy A Fantes; David H Ledbetter
Journal:  Am J Hum Genet       Date:  2002-03-01       Impact factor: 11.025

2.  Systematic evaluation of map quality: human chromosome 22.

Authors:  Tara C Matise; Christopher J Porter; Steven Buyske; A Jamie Cuttichia; Erik P Sulman; Peter S White
Journal:  Am J Hum Genet       Date:  2002-04-19       Impact factor: 11.025

3.  Segmental duplications in euchromatic regions of human chromosome 5: a source of evolutionary instability and transcriptional innovation.

Authors:  Anouk Courseaux; Florence Richard; Josiane Grosgeorge; Christine Ortola; Agnes Viale; Claude Turc-Carel; Bernard Dutrillaux; Patrick Gaudray; Jean-Louis Nahon
Journal:  Genome Res       Date:  2003-03       Impact factor: 9.043

4.  Comparison of whole genome assemblies of the human genome.

Authors:  Eric C Rouchka; Warren Gish; David J States
Journal:  Nucleic Acids Res       Date:  2002-11-15       Impact factor: 16.971

5.  Complex evolution of 7E olfactory receptor genes in segmental duplications.

Authors:  Tera Newman; Barbara J Trask
Journal:  Genome Res       Date:  2003-05       Impact factor: 9.043

6.  Duplication, coclustering, and selection of human Alu retrotransposons.

Authors:  Jerzy Jurka; Oleksiy Kohany; Adam Pavlicek; Vladimir V Kapitonov; Michael V Jurka
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-21       Impact factor: 11.205

7.  Mapping and initial analysis of human subtelomeric sequence assemblies.

Authors:  Harold Riethman; Anthony Ambrosini; Carlos Castaneda; Jeffrey Finklestein; Xue-Lan Hu; Uma Mudunuri; Sheila Paul; Jun Wei
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

8.  Pericentromeric duplications in the laboratory mouse.

Authors:  James W Thomas; Mary G Schueler; Tyrone J Summers; Robert W Blakesley; Jennifer C McDowell; Pamela J Thomas; Jacquelyn R Idol; Valerie V B Maduro; Shih-Queen Lee-Lin; Jeffrey W Touchman; Gerard G Bouffard; Stephen M Beckstrom-Sternberg; Eric D Green
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

9.  Co-localization of centromere activity, proteins and topoisomerase II within a subdomain of the major human X alpha-satellite array.

Authors:  Jennifer M Spence; Ricky Critcher; Thomas A Ebersole; Manuel M Valdivia; William C Earnshaw; Tatsuo Fukagawa; Christine J Farr
Journal:  EMBO J       Date:  2002-10-01       Impact factor: 11.598

10.  Extensive normal copy number variation of a beta-defensin antimicrobial-gene cluster.

Authors:  E J Hollox; J A L Armour; J C K Barber
Journal:  Am J Hum Genet       Date:  2003-08-12       Impact factor: 11.025

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.