| Literature DB >> 18801157 |
Chris Tomlinson1, Manjula Thimma, Stelios Alexandrakis, Tito Castillo, Jayne L Dennis, Anthony Brooks, Thomas Bradley, Carly Turnbull, Ekaterini Blaveri, Geraint Barton, Norie Chiba, Klio Maratou, Pat Soutter, Tim Aitman, Laurence Game.
Abstract
BACKGROUND: Despite considerable efforts within the microarray community for standardising data format, content and description, microarray technologies present major challenges in managing, sharing, analysing and re-using the large amount of data generated locally or internationally. Additionally, it is recognised that inconsistent and low quality experimental annotation in public data repositories significantly compromises the re-use of microarray data for meta-analysis. MiMiR, the Microarray data Mining Resource was designed to tackle some of these limitations and challenges. Here we present new software components and enhancements to the original infrastructure that increase accessibility, utility and opportunities for large scale mining of experimental and clinical data.Entities:
Mesh:
Year: 2008 PMID: 18801157 PMCID: PMC2572073 DOI: 10.1186/1471-2105-9-379
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The three-tier hardware and software architecture of MiMiR comprises a storage layer, an application services layer and an access/user interface layer (delimited by dashed lines). The MiMiR backend database is physically protected by two firewalls (FW) and is only accessible through the middle tier application servers (MDSA) which act as trusted middleware service and security gateway. The demilitarised zone (DMZ) sits between the two firewalls. All requests and retrieval of data to and from the web servers in the access layer are done in an encrypted format (marked 'https' with shaded arrows). MiMiR staff access internal tools for experimental annotation (GUI) using a less secure data transmission.
Figure 2Screen shots of the Curation and Annotation Tools. a: Visualisation using the Curation Tool of experimental information submitted online. An object model of the experiment is programmatically built and represented graphically, where nodes represent the experiment and biomaterials (organisms with a prefix BS_ and samples with a prefix BSM_), and treatments are represented by arcs. Nodes are colour-coded to represent different stages of sample preparation; for example, beige and pink nodes correspond to the extracted total RNA and hybridisation cocktail, respectively. The biomaterial information supplied by users via the Online Annotation Tool is automatically displayed in the Curation Tool and assigned with MGED Ontology terms as indicated by the prefix "MO:" (bottom table views). Where appropriate NCI Metathesaurus terms and accession numbers are also automatically assigned and indicated by the "NCI:" prefix. b: Table views of biosample details in the Annotation Tool. The table view enables rapid validation of detailed information including sample types, descriptions and names assigned to samples by users. Double clicking on an item in the table opens a pop-up window (insert) where more detailed ontology information can be viewed and edited.
Figure 3Screen shots of MiMiR Online. a: Screen shot of MiMiR Online showing the data sharing functionality. b: Screen shot of MiMiR Online showing the left hand tree view icons for navigation and the right hand panel showing the Experiment Details of a public experiment in MiMiR. Several experiments can be opened simultaneously and users can toggle between them using the top panel of tabs. Information including the study description, number of biosources (organisms), biosamples and hybridisations performed, chip type(s) used, private/public status, ArrayExpress accession number and date of public release (if relevant), as well as the active PubMed link if the experiment has been published, can be accessed. MGED Ontology terms are used systematically e.g. Category MO:PerturbationalDesign, Value MO:compound_treatment_design) and experimental factors (e.g. Category MO:compound, Value H2O2_-0.04). The tree view also gives access to information on the Biomaterials i.e. Biosources (whole organisms), Biosamples (material derived from the biosources) and the consecutive treatment steps generating the labelled extract to be hybridised on arrays. Different icons are used in the tree to visually facilitate navigation between the different procedure stages.
Figure 4Screen shots of the MiMiR-EMAAS interface, showing a guest user viewing and selecting data and meta-data from a MiMiR experiment to export into EMAAS for analysis.