| Literature DB >> 25414324 |
Vivek Krishnakumar1, Matthew R Hanlon2, Sergio Contrino3, Erik S Ferlanti4, Svetlana Karamycheva4, Maria Kim4, Benjamin D Rosen4, Chia-Yi Cheng4, Walter Moreira2, Stephen A Mock2, Joseph Stubbs2, Julie M Sullivan3, Konstantinos Krampis4, Jason R Miller4, Gos Micklem3, Matthew Vaughn2, Christopher D Town4.
Abstract
The Arabidopsis Information Portal (https://www.araport.org) is a new online resource for plant biology research. It houses the Arabidopsis thaliana genome sequence and associated annotation. It was conceived as a framework that allows the research community to develop and release 'modules' that integrate, analyze and visualize Arabidopsis data that may reside at remote sites. The current implementation provides an indexed database of core genomic information. These data are made available through feature-rich web applications that provide search, data mining, and genome browser functionality, and also by bulk download and web services. Araport uses software from the InterMine and JBrowse projects to expose curated data from TAIR, GO, BAR, EBI, UniProt, PubMed and EPIC CoGe. The site also hosts 'science apps,' developed as prototypes for community modules that use dynamic web pages to present data obtained on-demand from third-party servers via RESTful web services. Designed for sustainability, the Arabidopsis Information Portal strategy exploits existing scientific computing infrastructure, adopts a practical mixture of data integration technologies and encourages collaborative enhancement of the resource by its user community.Entities:
Mesh:
Year: 2014 PMID: 25414324 PMCID: PMC4383980 DOI: 10.1093/nar/gku1200
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Summary of the datasets used to populate ThaleMine and their corresponding sources
| Dataset | Data source |
|---|---|
| Genome sequence and annotation | TAIR (version 10; 8/31/13) |
| Protein sequence and properties | UniprotKB |
| Protein interactions | Bio-Analytic Resource (BAR) |
| Affymetrix expression data | AtGenExpress via BAR |
| Electronic Fluorescent Pictographs | Bio-Analytic Resource |
| Publications | UniprotKB and NCBI Entrez |
| Orthologs | PANTHER and PhytoMine |
List of commonly executed searches provided as ‘Template searches’
| Input (accepts wildcard characters) | Output (reported as a table, exportable in different file formats) |
|---|---|
| Gene (identifier/alias) | Related protein identifiers |
| List of interactors | |
| Set of homologous genes | |
| Expression values | |
| Publications referencing this gene | |
| FASTA sequences of the CDS | |
| FASTA sequence of the proteins | |
| FASTA sequences of the 5′/3′ UTRs | |
| Protein (identifier/domain) | Related gene identifiers |
| Publications referencing this protein | |
| Ontology term | List of genes |
Figure 1.A Gene report page showing: (A) attributes like standard gene locus identifier, symbols/synonyms, TAIR curator summary, confidence rating, etc. Other useful information is segregated into the aspects, as follows: (B) ‘Genomics’ showing gene structure information; (C) ‘Function’ displaying the Gene Ontology annotation; (D) ‘Proteins’ with links to the relevant protein records populated with data from UniProt; (E) ‘Homology’ listing the computed orthologs and paralogs across a diverse set of species; (F) ‘Interactions’ showing a visual representation of the genes’ physical/genetic interactions (tabular format is also available); (G) ‘Expression’ reporting gene expression levels based on AtGenExpress project data, also visualized using an embedded eFP view; (H) ‘Publications’ reports papers associated with the current gene; (I) ‘Links’ to other InterMine databases where homologs of the current gene exist, and external links to several important Arabidopsis and plant genomics data providers.
Figure 2.(A) (i) List of tracks available via the hierarchical track selector; (ii) a link to access the faceted track selector; (iii) track display panel showing the currently selected tracks for viewing (green inset: track chosen from the faceted selector). (B) Faceted track selector offers a search and filtering interface to enable choosing tracks based on the experiment metadata. (C) File upload capability ensures privacy to view ones own datasets alongside the Arabidopsis genome, and flexibility to work with large datasets such as short read alignments.