| Literature DB >> 29322930 |
Edison Ong1, Jiangan Xie2, Zhaohui Ni2, Qingping Liu2, Sirarat Sarntivijai3, Yu Lin4, Daniel Cooper4,5, Raymond Terryn4,5, Vasileios Stathias4,5, Caty Chung5,6, Stephan Schürer7,8,9, Yongqun He10,11.
Abstract
BACKGROUND: Aiming to understand cellular responses to different perturbations, the NIH Common Fund Library of Integrated Network-based Cellular Signatures (LINCS) program involves many institutes and laboratories working on over a thousand cell lines. The community-based Cell Line Ontology (CLO) is selected as the default ontology for LINCS cell line representation and integration.Entities:
Keywords: Cell line; Cell line ontology; ChEMBL; Data integration; Lincs; Ontology
Mesh:
Substances:
Year: 2017 PMID: 29322930 PMCID: PMC5763302 DOI: 10.1186/s12859-017-1981-5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Project pipeline of integrating cell lines from LINCS data portal and ChEMBL into CLO
Fig. 2Cell line-related data types of the data downloaded from LINCS Data Portal and ChEMBL. a Data types from LINCS Data Portal. b Data types from ChEMBL. Red-highlighted items (e.g., ChEMBL ID) were not covered in CLO, which were added later to CLO in this study
Fig. 3Basic CLO design pattern model for integrating LINCS cell line information from LINCS and ChEMBL
Fig. 4CLO design pattern model for using the new shortcut relation ‘derives originally from patient having disease’. a General design pattern; b an example to illustrate the design pattern. The shortcut relation makes it more efficient to represent the relation between a cell line cell and a disease when the parent term of the cell line cell includes sufficient information about the cell type and tissue/organ. In this illustration, the classes as shown in the dotted boxes are redundant and are not needed
Fig. 5SPARQL query of the number of cell lines with LINCS ID annotation. The query was performed using Ontobee SPARQL (http://www.ontobee.org/sparql)
Fig. 6The DOID hierarchy of 121 diseases of patients from whom 1133 LINCS cell lines were derived. The red color numbers represent the number of LINCS cell lines that are associated with corresponding diseases
Fig. 7Identification of additional cell lines under cervix carcinoma by SPARQL querying LINCS-CLOview. Note that DOID_2893 is the class ‘cervix carcinoma’, and CLO_0000015 is the object property ‘derives originally from human having disease’. In total 15 cell line cells were identified. Here only 6 are shown up. The query was conducted using Ontobee SPARQL (http://www.ontobee.org/sparql)
Fig. 8Part of the UBERON hierarchical structure in LINCS-CLOview. This structure shows the location of ‘uterine cervix’. This is a screenshot of the UBERON term in the LINCS-CLOview Ontobee web page