| Literature DB >> 17986450 |
David Swarbreck1, Christopher Wilks, Philippe Lamesch, Tanya Z Berardini, Margarita Garcia-Hernandez, Hartmut Foerster, Donghui Li, Tom Meyer, Robert Muller, Larry Ploetz, Amie Radenbaugh, Shanker Singh, Vanessa Swing, Christophe Tissier, Peifen Zhang, Eva Huala.
Abstract
The Arabidopsis Information Resource (TAIR, http://arabidopsis.org) is the model organism database for the fully sequenced and intensively studied model plant Arabidopsis thaliana. Data in TAIR is derived in large part from manual curation of the Arabidopsis research literature and direct submissions from the research community. New developments at TAIR include the addition of the GBrowse genome viewer to the TAIR site, a redesigned home page, navigation structure and portal pages to make the site more intuitive and easier to use, the launch of several TAIR web services and a new genome annotation release (TAIR7) in April 2007. A combination of manual and computational methods were used to generate this release, which contains 27,029 protein-coding genes, 3889 pseudogenes or transposable elements and 1123 ncRNAs (32,041 genes in all, 37,019 gene models). A total of 681 new genes and 1002 new splice variants were added. Overall, 10,098 loci (one-third of all loci from the previous TAIR6 release) were updated for the TAIR7 release.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17986450 PMCID: PMC2238962 DOI: 10.1093/nar/gkm965
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.TAIR GBrowse. The TAIR GBrowse tool allows navigation of the five A. thaliana nuclear chromosomes plus the mitochondrial and chloroplast genomes. A 10 kb region including AT4G39680 and AT4G39690 is shown (coding regions in dark blue and UTRs in light blue). Selected tracks shown here include cDNAs (dark green), ESTs (light green for forward orientation and light brown for reverse), T-DNA and transposon insertions (orange triangles), polymorphisms (yellow diamonds) and the VISTA plot showing sequence similarity with poplar. Additional tracks (data not shown) include CDS segments, markers and GC content. All elements shown in GBrowse can be clicked on to access the TAIR detail page for that object.
Evolution of the A. thaliana genome annotation from the initial A. thaliana sequencing project to the latest TAIR release
| Nature | TIGR1 | TIGR2 | TIGR3 | TIGR4 | TIGR5 | TAIR6 | TAIR7 | |
|---|---|---|---|---|---|---|---|---|
| Release date | 12/14/00 | 1/17/01 | 9/11/01 | 8/2/02 | 4/18/03 | 1/29/04 | 11/11/05 | 4/24/07 |
| Genome size (Mb) | 115.410 | 116.238 | 117.227 | 117.077 | 119.055 | 118.998 | 119.186 | 119.186 |
| Protein-coding genes | 25 498 | 25 554 | 26 156 | 27 117 | 27 170 | 26 207 | 26 541 | 26 819 |
| Transposons and pseudogenes | n/a | 1274 | 1305 | 1967 | 2218 | 3786 | 3818 | 3889 |
| Genes annotated with alternative splice-variants | n/a | 0 | 28 | 162 | 1267 | 2330 | 3159 | 3866 |
| Gene density (kb per gene) | 4.5 | 4.55 | 4.48 | 4.32 | 4.38 | 4.54 | 4.48 | 4.44 |
| Exons/gene model | 5.2 | 5.23 | 5.25 | 5.24 | 5.31 | 5.42 | 5.64 | 5.79 |
| Average exon length | 250 | 256 | 265 | 266 | 279 | 276 | 269 | 268 |
| Average intron length | 168 | 168 | 167 | 166 | 166 | 164 | 164 | 165 |
TIGR values from Haas et al. (5). Numbers of protein-coding genes do not include those present on mitochondrial and chloroplast genomes.