| Literature DB >> 26085786 |
Scott White1, Karoline Laske2, Marij Jp Welters3, Nicole Bidmon4, Sjoerd H van der Burg3, Cedrik M Britten4, Jennifer Enzor5, Janet Staats6, Kent J Weinhold7, Cécile Gouttefangeas2, Cliburn Chan1.
Abstract
With the recent results of promising cancer vaccines and immunotherapy1-5, immune monitoring has become increasingly relevant for measuring treatment-induced effects on T cells, and an essential tool for shedding light on the mechanisms responsible for a successful treatment. Flow cytometry is the canonical multi-parameter assay for the fine characterization of single cells in solution, and is ubiquitously used in pre-clinical tumor immunology and in cancer immunotherapy trials. Current state-of-the-art polychromatic flow cytometry involves multi-step, multi-reagent assays followed by sample acquisition on sophisticated instruments capable of capturing up to 20 parameters per cell at a rate of tens of thousands of cells per second. Given the complexity of flow cytometry assays, reproducibility is a major concern, especially for multi-center studies. A promising approach for improving reproducibility is the use of automated analysis borrowing from statistics, machine learning and information visualization21-23, as these methods directly address the subjectivity, operator-dependence, labor-intensive and low fidelity of manual analysis. However, it is quite time-consuming to investigate and test new automated analysis techniques on large data sets without some centralized information management system. For large-scale automated analysis to be practical, the presence of consistent and high-quality data linked to the raw FCS files is indispensable. In particular, the use of machine-readable standard vocabularies to characterize channel metadata is essential when constructing analytic pipelines to avoid errors in processing, analysis and interpretation of results. For automation, this high-quality metadata needs to be programmatically accessible, implying the need for a consistent Application Programming Interface (API). In this manuscript, we propose that upfront time spent normalizing flow cytometry data to conform to carefully designed data models enables automated analysis, potentially saving time in the long run. The ReFlow informatics framework was developed to address these data management challenges.Entities:
Keywords: Flow cytometry; REST API; automated analysis; data management; data provenance; laboratory informatics; metadata; reproducible analysis
Year: 2015 PMID: 26085786 PMCID: PMC4463798 DOI: 10.4137/CIN.S16346
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Overview of ReFlow hardware components.
Figure 2Overview of ReFlow software components.
Figure 3Data schema for ReFlow showing the mapping of table names to flow cytometry domain concepts. Arrows indicate foreign key relationships between database tables.
Figure 4Data schema detail for ReFlow showing the relationships that define the Panel Template and Site Panel domain concepts.
Figure 5Data schema for ReFlow showing the process request models.
Figure 6ReFlow data schema illustrating clustering models used to store process request results.
Figure 7ReFlow REST API URL schema illustrating various component labels.
Figure 8Sample upload snapshot.
Figure 9Panel creation snapshot.
Figure 10ReFlow screen shot demonstrating the visualization of clustering data. A detailed explanation of the viewing options is presented in the main text.