| Literature DB >> 28732082 |
Jason Glover1, Tsz-Kwong Man2, Donald A Barkauskas3, David Hall4, Tanya Tello4, Mary Beth Sullivan4, Richard Gorlick5, Katherine Janeway6, Holcombe Grier6, Ching Lau7, Jeffrey A Toretsky8, Scott C Borinstein9, Chand Khanna10, Timothy M Fan11.
Abstract
The prospective banking of osteosarcoma tissue samples to promote research endeavors has been realized through the establishment of a nationally centralized biospecimen repository, the Children's Oncology Group (COG) biospecimen bank located at the Biopathology Center (BPC)/Nationwide Children's Hospital in Columbus, Ohio. Although the physical inventory of osteosarcoma biospecimens is substantive (>15,000 sample specimens), the nature of these resources remains exhaustible. Despite judicious allocation of these high-value biospecimens for conducting sarcoma-related research, a deeper understanding of osteosarcoma biology, in particular metastases, remains unrealized. In addition the identification and development of novel diagnostics and effective therapeutics remain elusive. The QuadW-COG Childhood Sarcoma Biostatistics and Annotation Office (CSBAO) has developed the High Dimensional Data (HDD) platform to complement the existing physical inventory and to promote in silico hypothesis testing in sarcoma biology. The HDD is a relational biologic database derived from matched osteosarcoma biospecimens in which diverse experimental readouts have been generated and digitally deposited. As proof-of-concept, we demonstrate that the HDD platform can be utilized to address previously unrealized biologic questions though the systematic juxtaposition of diverse datasets derived from shared biospecimens. The continued population of the HDD platform with high-value, high-throughput and mineable datasets allows a shared and reusable resource for researchers, both experimentalists and bioinformatics investigators, to propose and answer questions in silico that advance our understanding of osteosarcoma biology.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28732082 PMCID: PMC5521774 DOI: 10.1371/journal.pone.0181204
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Imported digital data within High Dimensional Database platform available for in silico research.
| Study Identifier | Investigator | N = Samples | Readout | Methodology | Reference |
|---|---|---|---|---|---|
| COG | 974 | Necrosis | Histopathology | ||
| Koshkina | 88 | Fas | SNPs | [ | |
| Ebb | 96 | ErbB-2 | IHC | [ | |
| Lau | 143 | TARGET | Sequencing | ||
| Borinstein | 164 | IGF-1 axis | ELISA | [ | |
| Borinstein | 255 | IGFBP-2 | ELISA | [ | |
| Gorlick | 149 | ErbB-2 | IHC | [ | |
| Lau | 229 | SPECS | cDNA array | ||
| Ragg | 148 | Biomarker discovery | Proteomics | ||
| Squire | 45 | CIN | FISH | ||
| Dome | 123 | TERT | PCR |
SNPs- single nucleotide polymorphism; IHC- immunohistochemistry; ELISA- enzyme linked immunosorbent assay; CIN- chromosomal instability; FISH- fluorescent in situ hybridization; TERT- telomerase reverse transcriptase; PCR- polymerase chain reaction; TARGET- therapeutically applicable research to generate effective treatments; SPECS- strategic partnering to evaluate cancer signatures
Number of common biospecimens (≥ 10) used to derive unique assay data.
| Overlapping Biospecimens | Pairs | Trios | Quartets | Quintets |
|---|---|---|---|---|
| 149 | 104 | 62 | 28 | |
| 12 | 10 | 10 | 10 | |
| 33 | 15.5 | 14.5 | 14 |
Fig 1Representative Venn diagrams of shared and overlapping biospecimens used to derive unique assay data (pairs, trios, or quartets) from divergent investigations.
Fig 2(A) Heat map showing the RNA expression of 3 genes C4orf3, USP9Y, and DDX3Y inversely correlated with circulating concentrations of IGFBP2. (B) Unsupervised hierarchal clustering analysis of gene expression profiles of the primary tumors based upon quartile circulating IGFBP2 concentrations. (C) The plot of primary tumor IGFBP2 mRNA expressions against circulating IGFBP2 concentrations of the matched samples.
Fig 3Proposed workflow for continued population of HDD with new digital datasets derived from voluntary sharing of research results provided by principal investigators utilizing exhaustible biospecimens.