Literature DB >> 11875030

Computational comparison of human genomic sequence assemblies for a region of chromosome 4.

Colin A M Semple1, Stewart W Morris, David J Porteous, Kathryn L Evans.   

Abstract

Much of the available human genomic sequence data exist in a fragmentary draft state following the completion of the initial high-volume sequencing performed by the International Human Genome Sequencing Consortium (IHGSC) and Celera Genomics (CG). We compared six draft genome assemblies over a region of chromosome 4p (D4S394-D4S403), two consecutive releases by the IHGSC at University of California, Santa Cruz (UCSC), two consecutive releases from the National Centre for Biotechnology Information (NCBI), the public release from CG, and a hybrid assembly we have produced using IHGSC and CG sequence data. This region presents particular problems for genomic sequence assembly algorithms as it contains a large tandem repeat and is sparsely covered by draft sequences. The six assemblies differed both in terms of their relative coverage of sequence data from the region and in their estimated rates of misassembly. The CG assembly method attained the lowest level of misassembly, whereas NCBI and UCSC assemblies had the highest levels of coverage. All assemblies examined included <60% of the publicly available sequence from the region. At least 6% of the sequence data within the CG assembly for the D4S394-D4S403 region was not present in publicly available sequence data. We also show that even in a problematic region, existing software tools can be used with high-quality mapping data to produce genomic sequence contigs with a low rate of rearrangements.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11875030      PMCID: PMC155292          DOI: 10.1101/gr.207902

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  23 in total

1.  An SNP map of the human genome generated by reduced representation shotgun sequencing.

Authors:  D Altshuler; V J Pollara; C R Cowles; W J Van Etten; J Baldwin; L Linton; E S Lander
Journal:  Nature       Date:  2000-09-28       Impact factor: 49.962

2.  A comparison of the Celera and Ensembl predicted gene sets reveals little overlap in novel genes.

Authors:  J B Hogenesch; K A Ching; S Batalov; A I Su; J R Walker; Y Zhou; S A Kay; P G Schultz; M P Cooke
Journal:  Cell       Date:  2001-08-24       Impact factor: 41.582

3.  Gaps in the Human Genome Project.

Authors:  J C Roach; A F Siegel; G van den Engh; B Trask; L Hood
Journal:  Nature       Date:  1999-10-28       Impact factor: 49.962

Review 4.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

5.  Sequence mapping by electronic PCR

Authors:  Gregory D Schuler
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

6.  A novel tandem repeat sequence located on human chromosome 4p: isolation and characterization.

Authors:  M Kogi; S Fukushige; C Lefevre; S Hadano; J E Ikeda
Journal:  Genomics       Date:  1997-06-01       Impact factor: 5.736

7.  Transforming a set of biological flat file libraries to a fast access network.

Authors:  T Etzold; P Argos
Journal:  Comput Appl Biosci       Date:  1993-02

8.  Database resources of the National Center for Biotechnology Information.

Authors:  D L Wheeler; D M Church; A E Lash; D D Leipe; T L Madden; J U Pontius; G D Schuler; L M Schriml; T A Tatusova; L Wagner; B A Rapp
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

9.  SAM: a system for iteratively building marker maps.

Authors:  C Soderlund; I Dunham
Journal:  Comput Appl Biosci       Date:  1995-12

10.  A locus for bipolar affective disorder on chromosome 4p.

Authors:  D H Blackwood; L He; S W Morris; A McLean; C Whitton; M Thomson; M T Walker; K Woodburn; C M Sharp; A F Wright; Y Shibasaki; D M St Clair; D J Porteous; W J Muir
Journal:  Nat Genet       Date:  1996-04       Impact factor: 38.330

View more
  6 in total

1.  Comparison of whole genome assemblies of the human genome.

Authors:  Eric C Rouchka; Warren Gish; David J States
Journal:  Nucleic Acids Res       Date:  2002-11-15       Impact factor: 16.971

2.  Segments missing from the draft human genome sequence can be isolated by transformation-associated recombination cloning in yeast.

Authors:  Natalay Kouprina; Sun-Hee Leem; Greg Solomon; Albert Ly; Maxim Koriabine; John Otstot; Eugene Pak; Amalia Dutra; Shaying Zhao; J Carl Barrett; Vladimir Larionov
Journal:  EMBO Rep       Date:  2003-03       Impact factor: 8.807

3.  A computational/functional genomics approach for the enrichment of the retinal transcriptome and the identification of positional candidate retinopathy genes.

Authors:  Nicholas Katsanis; Kim C Worley; Guillermo Gonzalez; Stephen J Ansley; James R Lupski
Journal:  Proc Natl Acad Sci U S A       Date:  2002-10-21       Impact factor: 11.205

4.  A haplome alignment and reference sequence of the highly polymorphic Ciona savignyi genome.

Authors:  Kerrin S Small; Michael Brudno; Matthew M Hill; Arend Sidow
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

5.  Single Nucleotide Polymorphisms Caused by Assembly Errors.

Authors:  Jürgen Kleffe; Robert Weißmann; Florian F Schmitzberger
Journal:  Genomics Insights       Date:  2010-02-04

6.  Genome assembly forensics: finding the elusive mis-assembly.

Authors:  Adam M Phillippy; Michael C Schatz; Mihai Pop
Journal:  Genome Biol       Date:  2008-03-14       Impact factor: 13.583

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.