| Literature DB >> 18477386 |
Charlotte G Cole1, Owen T McCann, John E Collins, Karen Oliver, David Willey, Susan M Gribble, Fengtang Yang, Karen McLaren, Jane Rogers, Zemin Ning, David M Beare, Ian Dunham.
Abstract
BACKGROUND: Although the human genome sequence was declared complete in 2004, the sequence was interrupted by 341 gaps of which 308 lay in an estimated approximately 28 Mb of euchromatin. While these gaps constitute only approximately 1% of the sequence, knowledge of the full complement of human genes and regulatory elements is incomplete without their sequences.Entities:
Mesh:
Year: 2008 PMID: 18477386 PMCID: PMC2441464 DOI: 10.1186/gb-2008-9-5-r78
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Schematic view of sequence contigs (blue boxes) covering chromosome 22q in [8] (1999, left) and in this report (2008, right). Contigs are drawn approximately to scale and shown in relation to a simple representation of the chromosome 22 ideogram. Gaps are indicated by the arrows, and are labeled according to the terminology used in this report.
Figure 2Sequence analysis of deleted BAC CTA-437G10 (AL022330) as originally finished in [8]. A sequence identity dot plot analysis of the revised sequence of this region (x-axis) against the originally submitted sequence (y-axis) generated using dotter [48]. The revised sequence (x-axis) encompasses AL133397.1, AL390209.1, AL133398.2, AL121885.22, AL390210.1 assembled as detailed at [49], and numbered from the start of AL133397.1. Numbering is in base-pairs. Arrows indicate the positions of tandem repeated sequences at the boundary of the deletion in this BAC, with a core region of (AAAG)46 at the centromeric (left) side and a core of (AAAG)68 at the telomeric (right) side.
Sources of additional sequence for gaps 1-7 in human chromosome 22q13
| Source of sequence | Number | Sequence contributed (kb) |
| Flow sorted chromosome 22 fosmids | 14 | 307 |
| Whole genome fosmids | 6 | 151 |
| BAC/PAC | 3 | 137 |
| PCR products | 28 | 129 |
Note that due to the arbitrary nature of the positions of clipping of submitted sequences, and their contribution to the final golden path, the new sequence contribution numbers are, of necessity, approximated.
Figure 3DNA fiber FISH using pools of long PCR products at gap 2 prior to final closure by long PCR. (a) Schematic representation of the three long PCR products from either side of the gap that were labeled and hybridized to DNA fibers. See Figure S5 in Additional data file 7 for identities. Note that non-repetitive sequence represented 71% of 19,858 kb on the left-hand side of the gap and only 50% of 6,239 kb on the right-hand side, as determined by Repeatmasker. (b) Three example DNA fibers showing detection by FISH of hybridized centromeric and telomeric PCR product pools in red (Texas Red) and green (FITC), respectively. Estimation from the DNA fiber FISH images suggested that the remaining gap was 2-3 kb in size and this was closed by the PCR product c658c926rcL (sequence CU210860).