| Literature DB >> 11305933 |
Abstract
The completion of the Arabidopsis thaliana (mustard weed) genome sequence constitutes a major breakthrough in plant biology. It will revolutionize how we answer questions about the biology and evolution of plants as well as how we confront and resolve world-wide agricultural problems.Entities:
Mesh:
Substances:
Year: 2001 PMID: 11305933 PMCID: PMC138921 DOI: 10.1186/gb-2001-2-4-comment2004
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1The Arabidopsis Genome Initiative (AGI), an international project for sequencing the genome of the flowering plant Arabidopsis thaliana. For each chromosome, black represents centromeres; grey telomeres; green rDNA repeats; and red contiguous DNA sequence. Sequencing groups are: TIGR, The Institute for Genomic Research (PIs: Craig Venter and Claire Fraser) [24]; Kazusa DNA research Institute (PI: Satoshi Tabata); SPPC, The SPP Consortium, Stanford, University of Pennsylvania, PGEC-USDA and University of California Berkeley (PIs: Ronald Davis, Joseph Ecker, and Athanasios Theologis) [25]; EUC-1 and EUC-2, European Union Consortia for Chromosomes 3, 4 and 5 (PIs: Michael Bevan and Francis Quetier); and the CSH/WU/ABI consortium of Cold Spring Harbor Laboratory, Washington University and Applied Biosystems, Inc. (PIs: Richard McCombie, Richard Wilson and Ellson Chen). PI, principle investigator.
Figure 2Sequencing strategies for (a) chromosomes and (b) individual BACs.
Some features of the Arabidopsis chromosomes and proteome
| Number | % | |
| (a) The five DNA molecules | ||
| Total length (Mb) | ~115 | |
| Number of genes | ~25,500 | |
| Average gene density (kb per gene) | ~4.5 | |
| Average gene length (kb) | ~2 | |
| Average peptide length (amino acid) | 430 | |
| Number of genes with ESTs | 15,300 | 60% |
| Number of ESTs | 105,733 | |
| (b) The proteome | ||
| Total proteins | ~25,500 | 100 |
| Proteins with similarity to GenBank entries | 17,850 | 70 |
| Unknown proteins | 5865 | 23 |
| Hypothetical proteins | 7140 | 28 |
| Proteins with putative functions | 12,495 | 49 |
| (c) Classification of proteins with putative functions | ||
| Cellular metabolism | 22.5 | |
| Transcription | 16.9 | |
| Defense | 11.5 | |
| Signaling | 10.4 | |
| Growth | 11.7 | |
| Protein fate | 9.9 | |
| Intracellular transport | 8.3 | |
| Transport | 4.8 | |
| Protein synthesis | 4.1 |
~denotes approximate numbers.
Gene content among sequenced eukaryotes
| Organism | Approximate number of genes | Reference |
| Yeast | 6,000 | [6,54] |
| Worm | 19,000 | [7] |
| Fruitfly | 14,000 | [8] |
| Mustard weed | 25,500 | [4] |
| Human | 25,000-40,000 | [9,10] |
Figure 3Frequency distribution of genes in multigene families with tandem gene arrangements. Figure adapted from [4].