| Literature DB >> 21075795 |
Fiona M McCarthy1, Cathy R Gresham, Teresia J Buza, Philippe Chouvarine, Lakshmi R Pillai, Ranjit Kumar, Seval Ozkan, Hui Wang, Prashanti Manda, Tony Arick, Susan M Bridges, Shane C Burgess.
Abstract
AgBase (http://www.agbase.msstate.edu/) provides resources to facilitate modeling of functional genomics data and structural and functional annotation of agriculturally important animal, plant, microbe and parasite genomes. The website is redesigned to improve accessibility and ease of use, including improved search capabilities. Expanded capabilities include new dedicated pages for horse, cat, dog, cotton, rice and soybean. We currently provide 590 240 Gene Ontology (GO) annotations to 105 454 gene products in 64 different species, including GO annotations linked to transcripts represented on agricultural microarrays. For many of these arrays, this provides the only functional annotation available. GO annotations are available for download and we provide comprehensive, species-specific GO annotation files for 18 different organisms. The tools available at AgBase have been expanded and several existing tools improved based upon user feedback. One of seven new tools available at AgBase, GOModeler, supports hypothesis testing from functional genomics data. We host several associated databases and provide genome browsers for three agricultural pathogens. Moreover, we provide comprehensive training resources (including worked examples and tutorials) via links to Educational Resources at the AgBase website.Entities:
Mesh:
Year: 2010 PMID: 21075795 PMCID: PMC3013706 DOI: 10.1093/nar/gkq1115
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Publication of agricultural microarray data. On average, less than half of the agricultural microarray data sets submitted to the NCBI GEO database are published. These statistics are based upon gene expression data submitted to the NCBI GEO database. Records are shown for as at 9 September 2010. For each species the number of expression data sets linked to a PubMed record was used to determine the proportion of unpublished data sets. Note that this approach is likely to under-estimate the amount of unpublished data sets as it does not include data not submitted to this database.
Figure 2.GO annotation strategies for AgBase and the EBI’s Gene Ontology Annotation (EBI GOA) group. Both AgBase and the EBI GOA Project provide GO annotations for chicken and cow. This annotation effort is complementary. (A) The percentage of biocuration for each group is shown for literature biocuration projects for chicken and cow. (B) Complementary biocuration of chicken and cow gene products. The EBI GOA Project provide computational GO annotation for UniProtKB records while the AgBase computational annotation effort focuses on Genbank gene products not represented in the UniProtKB database (typically these are gene products represented on commonly used arrays).
Figure 3.AgBase tools for functional analysis using the GO. This figure shows the individual AgBase tools for functional analysis of the GO and how they can be used sequentially as a pipeline to provide GO annotations and functional modeling for data sets. Square boxes represent AgBase tools; octagons represent manual steps or checks in the process.
Annotation updates for agricultural arrays
| Platform ID | Array name | Submitted | Last update |
|---|---|---|---|
| Chicken | |||
| GPL3213 | Affymetrix Chicken Genome Array | November 2005 | June 2009 |
| GPL5480 | ARK-Genomics G. gallus 20K v1.0 | July 2007 | July 2007 |
| GPL1731 | DEL-MAR 14K Integrated Systems | December 2004 | March 2006 |
| Bovine | |||
| GPL2853 | UIUC Bos taurus 13.2K 70-mer oligoarray | September 2005 | March 2007 |
| GPL2864 | UIUC Cattle 7,872-element cDNA - alternate version | September 2005 | March 2007 |
| GPL2112 | Affymetrix Bovine Genome Array | May 2005 | June 2009 |
| Pig | |||
| GPL7435 | Swine Protein-Annotated Oligonucleotide Microarray | October 2008 | November 2008 |
| GPL3608 | DIAS_PIG_55K3_v1 | March 2006 | May 2009 |
| GPL1881 | Qiagen-NRSP-8 porcine oligo array | February 2005 | May 2005 |
| Horse | |||
| GPL10248 | Agilent 4x44k Horse Gene Expression microarrays | March 2010 | March 2010 |
| GPL8582 | MacLeod custom equine cartilage 10K cDNA microarray version 3 | May 2009 | October 2009 |
| Maize | |||
| GPL4032 | Affymetrix Maize Genome Array | July 2006 | June 2009 |
| GPL3538 | SAM3.0 | March 2006 | November 2006 |
| GPL3333 | SAM1.1a | January 2006 | March 2006 |
| GPL1996 | Maize cDNA Generation II Version B | April 2005 | May 2005 |
| Rice | |||
| GPL1829 | Rice Genome Oligo Set V1.0 | January 2005 | October 2008 |
| GPL892 | Agilent-012106 Rice Oligo Microarray G4138A | January 2004 | September 2008 |
| GPL8161 | NSF Rice Oligonucleotide Array 45K One Chip Version | February 2009 | February 2009 |
| Soybean | |||
| GPL3015 | Keck Glycine max 18kA cDNA Prints101-108 | October 2005 | October 2005 |
| GPL1012 | Gm-r1088 | February 2004 | May 2005 |
| GPL229 | Gm-r1070 | December 2002 | October 2005 |
| Tomato | |||
| GPL9923 | CombiMatrix 90K TomatArray 1.0 | January 2010 | August 2010 |
| GPL4741 | Affymetrix Tomato Genome Array | January 2007 | June 2009 |
| GPL3034 | Cornell-CGEP Tomato 13K vTOM1 | October 2005 | November 2005 |
Arrays for agricultural species with the greatest numbers of data sets submitted to the NCBI GEO database (as at 9 September 2010) are shown, along with information about when the array platform data was submitted and its last update. Updates typically include ID mapping; updated functional information for transcripts represented on arrays is not always included and is harder to assess collectively.
Figure 4.Overview of functional modeling strategies. This figure shows how the AgBase tools can be used as part of a larger functional modeling strategy that incorporates other, existing functional analysis tools. Square boxes represent AgBase tools; arrow shaped boxes represent overall modeling approaches and octagonal boxes contain representative example of non-AgBase tools that are commonly used for functional modeling.