| Literature DB >> 22080559 |
Martha B Arnaud1, Gustavo C Cerqueira, Diane O Inglis, Marek S Skrzypek, Jonathan Binkley, Marcus C Chibucos, Jonathan Crabtree, Clinton Howarth, Joshua Orvis, Prachi Shah, Farrell Wymore, Gail Binkley, Stuart R Miyasato, Matt Simison, Gavin Sherlock, Jennifer R Wortman.
Abstract
The Aspergillus Genome Database (AspGD; http://www.aspgd.org) is a freely available, web-based resource for researchers studying fungi of the genus Aspergillus, which includes organisms of clinical, agricultural and industrial importance. AspGD curators have now completed comprehensive review of the entire published literature about Aspergillus nidulans and Aspergillus fumigatus, and this annotation is provided with streamlined, ortholog-based navigation of the multispecies information. AspGD facilitates comparative genomics by providing a full-featured genomics viewer, as well as matched and standardized sets of genomic information for the sequenced aspergilli. AspGD also provides resources to foster interaction and dissemination of community information and resources. We welcome and encourage feedback at aspergillus-curator@lists.stanford.edu.Entities:
Mesh:
Year: 2011 PMID: 22080559 PMCID: PMC3245136 DOI: 10.1093/nar/gkr875
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
AspGD curation statistics
| Number of ORFs | 11 286 | 9887 | 14 071 |
| Number of tRNAs | 222 | 178 | 264 |
| Manual GO annotations | 20 233 | 16 994 | N/A |
| Features with manual GO annotations | 8781 | 7786 | N/A |
| Orthology-based GO annotations | 8295 | 9220 | 9637 |
| Features with orthology-based GO annotations | 1973 | 2229 | 2342 |
| Protein-domain (InterPro)-based GO annotations | 14 505 | 13 068 | 16 614 |
| Features with protein-domain (InterPro)-based GO annotations | 5748 | 5226 | 6348 |
| Features with orthology-based description lines | 5016 | 5430 | N/A |
AspGD curation statistics were determined as of 1 September 2011. The number of Open Reading Frames (ORFs) includes characterized (verified) and uncharacterized genes that are predicted to encode protein products. The tRNA complement was predicted at AspGD using tRNAscan-SE (8). GO annotations have been assigned based on manual curation of the scientific literature, predicted based on orthology to characterized gene products from S. cerevisiae or other aspergilli, and predicted based on characterized InterPro protein domains (7). Manual curation of the scientific literature about A. nidulans and A. fumigatus is complete and up-to-date; curation of A. niger is commencing in September 2011. In species for which manual curation is complete, predicted gene products that are uncharacterized, but which have orthologs, are given descriptions containing orthology-based information.
Figure 1.Locus Summary page for A. fumigatus fre2. (A) Navigation tabs lead to expanded reports on Locus History, relevant literature, GO and phenotype annotations and protein details. (B) Basic Information section provides gene names and aliases, a headline-like description of the gene product, links to orthologous genes in other Aspergillus species in AspGD and links to orthologs in S. cerevisiae at SGD. (C) GO section lists functional annotations derived from manual curation of available data and from orthology-based and protein domain-based computational analysis. (D) Mutant Phenotype section outlines literature-derived experimental data for mutant phenotypes. (E) Sequence Information section gives an overview of the gene model and provides access to sequences (parts of this section have been removed from the figure to conserve space). (F) Additional Information section includes other notes and the list of references cited on the Locus Summary page. The comprehensive citation list for the gene is found on the gene’s Literature Guide, accessible from the Literature tab near the top of the Locus Summary page.
Figure 2.Multispecies search tools. (A) Quick Search Results page is divided into sections for general results, such as GO terms or authors and for species-specific results, such as gene names, names of orthologous genes, as well as words found in descriptions, notes and phenotypes. (B) Gene and Sequence Resources tool retrieves DNA sequences for genes or chromosomal regions; the arrow points to the new pull-down menu for species selection. (C) Advanced Search tool allows retrieval of genes based on their properties; arrows indicate options for selection of a species, types of features, other properties and GO terms (lower part of the page is removed to conserve space).
Figure 3.Search and visualization tools in the Sybil comparative genomics module. (A) Synteny gradient indicating the location of homologous blocks relative to a reference sequence, in this example A. nidulans FGSC A4. Vertical bars with the same shade of color represent orthologous genes across genomes. (B) Ortholog/Paralog cluster search tool, that allows researchers to retrieve homolog clusters meeting user-defined criteria regarding strain or species distribution. (C) Genomic region comparison depicting the structural annotation of syntenic and non-syntenic genes in a homologous genomic region. The trapezoids connect orthologous genes. (D) Whole-genome display showing the gene density and GC content along the sequence of A. nidulans chromosome IV and A. fumigatus chromosome 8. Colored vertical bars indicate gene locations.
Figure 4.Examples of modifications to A. niger gene models based on analysis with PASA. The changes include (A) extension of coding sequence (CDS), (B) update of internal exon structure and (C) gene merging. Rectangles represent gene exons and are colored according to the legend. Displayed with each structure diagram is the name of the gene modified by the PASA analysis or the GenBank accession number of each EST and full-length cDNA sequence supporting the modifications.