| Literature DB >> 10617197 |
X Lin1, S Kaul, S Rounsley, T P Shea, M I Benito, C D Town, C Y Fujii, T Mason, C L Bowman, M Barnstead, T V Feldblyum, C R Buell, K A Ketchum, J Lee, C M Ronning, H L Koo, K S Moffat, L A Cronin, M Shen, G Pai, S Van Aken, L Umayam, L J Tallon, J E Gill, M D Adams, A J Carrera, T H Creasy, H M Goodman, C R Somerville, G P Copenhaver, D Preuss, W C Nierman, O White, J A Eisen, S L Salzberg, C M Fraser, J C Venter.
Abstract
Arabidopsis thaliana (Arabidopsis) is unique among plant model organisms in having a small genome (130-140 Mb), excellent physical and genetic maps, and little repetitive DNA. Here we report the sequence of chromosome 2 from the Columbia ecotype in two gap-free assemblies (contigs) of 3.6 and 16 megabases (Mb). The latter represents the longest published stretch of uninterrupted DNA sequence assembled from any organism to date. Chromosome 2 represents 15% of the genome and encodes 4,037 genes, 49% of which have no predicted function. Roughly 250 tandem gene duplications were found in addition to large-scale duplications of about 0.5 and 4.5 Mb between chromosomes 2 and 1 and between chromosomes 2 and 4, respectively. Sequencing of nearly 2 Mb within the genetically defined centromere revealed a low density of recognizable genes, and a high density and diverse range of vestigial and presumably inactive mobile elements. More unexpected is what appears to be a recent insertion of a continuous stretch of 75% of the mitochondrial genome into chromosome 2.Entities:
Mesh:
Substances:
Year: 1999 PMID: 10617197 DOI: 10.1038/45471
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962