| Literature DB >> 35164854 |
Abstract
OBJECTIVES: A novel graph data model of non-small cell lung cancer clinical and genomic data has been constructed with two aims: (1) provide a suitable model for facilitating graph analytics within the Neo4j framework or through tools which can interact through existing Neo4j APIs; and (2) provide a base model extensible to other cancer types and additional datasets such as those derived from electronic health records and other real world sources. DATA DESCRIPTION: Clinical and genomic data integrated with a novel property graph database schema from publicly available datasets and analyses based on The Cancer Genome Atlas lung cancer datasets augmented by with subgraphs patient-patient social network from similarity and correlation as well as individual based biological networks.Entities:
Keywords: Non-small cell lung cancer; Property graph database; The Cancer Genome Atlas
Mesh:
Year: 2022 PMID: 35164854 PMCID: PMC8842806 DOI: 10.1186/s13104-022-05912-9
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Overview of data files/data sets
| Label | Name of data file/data set | File types (file extension) | Data repository and identifier |
|---|---|---|---|
| Data file 1 | lung-cancer-graph-neo4j-2021-07-15T022024.bin | binary dump file (bin) | Harvard Dataverse [ |
| Data file 2 | README.md | text/markdown | Harvard Dataverse [ |
| Data file 3 | ACancerGraphSchema.png | image (png) | Harvard Dataverse [ |
| Data file 4 | makeIndexesConstraints.cql | cypher (cql) | Harvard Dataverse [ |
| Data file 5 | ACancerGraphLoader.cql | cypher (cql) | Harvard Dataverse [ |
| Data file 6 | schema.json | json | Harvard Dataverse [ |
| Data set 1 | Input files (csv format) to create database | comma separated values (csv) | Harvard Dataverse [ |
| Data file set 2 | Input files (csv format) to create supportive data | comma separated values (csv) | Harvard Dataverse [ |