| Literature DB >> 32097025 |
Adam Struck1, Brian Walsh1, Alexander Buchanan1, Jordan A Lee1, Ryan Spangler1, Joshua M Stuart2,3, Kyle Ellrott1.
Abstract
PURPOSE: The analysis of cancer biology data involves extremely heterogeneous data sets, including information from RNA sequencing, genome-wide copy number, DNA methylation data reporting on epigenetic regulation, somatic mutations from whole-exome or whole-genome analyses, pathology estimates from imaging sections or subtyping, drug response or other treatment outcomes, and various other clinical and phenotypic measurements. Bringing these different resources into a common framework, with a data model that allows for complex relationships as well as dense vectors of features, will unlock integrated data set analysis.Entities:
Year: 2020 PMID: 32097025 PMCID: PMC7049249 DOI: 10.1200/CCI.19.00110
Source DB: PubMed Journal: JCO Clin Cancer Inform ISSN: 2473-4276
BioMedical Evidence Graph Data Sources and Their Licenses Used to Build the Graph
FIG 1.The BioMedical Evidence Graph schema showing the vertex types and connections of the graph. Numbers on vertices represent the total instances of a specific type defined by the vertex (eg, the Gene vertex includes 63,677 distinct protein-coding, microRNAs, and other gene entries); numbers on an edge connecting two vertices represent the total connections between any instance of the first vertex to any instance of the second vertex (eg, there are 214,804 connections from transcripts to the genes that encode them). Pfam, protein families.
FIG 2.Example queries. A diagram showing how each of the different queries described in this article traverse the graph. Each separate query is labeled by the example number in the text.
Gene Expression in Transcripts per Million Across Cell Lines With Variants in CTRP
FIG 3.BioMedical Evidence Graph (BMEG) architecture diagram. (A) The Extract Transform Load (ETL) processes used to build the graph. (B) The database and query engine used to power the bmeg.io site. (C) The different client-side options for communicating with the system. (D) Graph engines that can be used with the BMEG export code to move the BMEG data to other graph databases. GripQL, Graph Integration Platform query language.