| Literature DB >> 26320941 |
Sai Lakshmi Subramanian1, Robert R Kitchen2,3,4, Roger Alexander5, Bob S Carter6, Kei-Hoi Cheung7, Louise C Laurent8, Alexander Pico9, Lewis R Roberts10, Matthew E Roth1, Joel S Rozowsky2,4, Andrew I Su11, Mark B Gerstein2,4,12, Aleksandar Milosavljevic13.
Abstract
The large diversity and volume of extracellular RNA (exRNA) data that will form the basis of the exRNA Atlas generated by the Extracellular RNA Communication Consortium pose a substantial data integration challenge. We here present the strategy that is being implemented by the exRNA Data Management and Resource Repository, which employs metadata, biomedical ontologies and Linked Data technologies, such as Resource Description Framework to integrate a diverse set of exRNA profiles into an exRNA Atlas and enable integrative exRNA analysis. We focus on the following three specific data integration tasks: (a) selection of samples from a virtual biorepository for exRNA profiling and for inclusion in the exRNA Atlas; (b) retrieval of a data slice from the exRNA Atlas for integrative analysis and (c) interpretation of exRNA analysis results in the context of pathways and networks. As exRNA profiling gains wide adoption in the research community, we anticipate that the strategies discussed here will increasingly be required to enable data reuse and to facilitate integrative analysis of exRNA data.Entities:
Keywords: DMRR; ERC Consortium; exRNA; exRNA Atlas; exRNA Portal
Year: 2015 PMID: 26320941 PMCID: PMC4553261 DOI: 10.3402/jev.v4.27497
Source DB: PubMed Journal: J Extracell Vesicles ISSN: 2001-3078
Fig. 1Data slicing and pathway enrichment analysis. This illustration is based on a hypothetical example of sequencing-based exRNA profiling of cerebrospinal fluid (CSF) from a brain tumour patient. Based on metadata about the selected samples, (a) “data slice” is extracted for further downstream analysis using pathway/network modules to detect activation of a metastatic brain tumour pathway. Panel b details selection of samples for profiling and inclusion in the exRNA Atlas using sample (CSF) and disease (CNS neoplasm) ontology traversals. Panel c details sequencing assay selection process using assay and experiment ontology traversals. The highlighted ontologies “CNS neoplasm” and “sequencing assay” are examples of terms that occur within an “ontology slim.” Ontology traversal in panel d identifies RNA species of interest. (a) “Data slice” defined by selections (b–d) is analysed to obtain a set of exRNA genes that show a pattern of coordinated changes. The metastatic brain tumour pathway (www.wikipathways.org/index.php/Pathway:WP2249) in panel e shows enrichment for the exRNA genes overexpressed in this hypothetical case.