| Literature DB >> 34972102 |
Rolando Barajas1, Brionna Hair2, Gabriel Lai1, Melissa Rotunno1, Marissa M Shams-White1, Elizabeth M Gillanders1, Leah E Mechanic1.
Abstract
Systems epidemiology offers a more comprehensive and holistic approach to studies of cancer in populations by considering high dimensionality measures from multiple domains, assessing the inter-relationships among risk factors, and considering changes over time. These approaches offer a framework to account for the complexity of cancer and contribute to a broader understanding of the disease. Therefore, NCI sponsored a workshop in February 2019 to facilitate discussion about the opportunities and challenges of the application of systems epidemiology approaches for cancer research. Eight key themes emerged from the discussion: transdisciplinary collaboration and a problem-based approach; methods and modeling considerations; interpretation, validation, and evaluation of models; data needs and opportunities; sharing of data and models; enhanced training practices; dissemination of systems models; and building a systems epidemiology community. This manuscript summarizes these themes, highlights opportunities for cancer systems epidemiology research, outlines ways to foster this research area, and introduces a collection of papers, "Cancer System Epidemiology Insights and Future Opportunities" that highlight findings based on systems epidemiology approaches.Entities:
Mesh:
Year: 2021 PMID: 34972102 PMCID: PMC8719747 DOI: 10.1371/journal.pone.0255328
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Major themes identified to advance systems epidemiology research.
| Theme | Description |
|---|---|
|
| The ability to perform systems epidemiology research is contingent on the engagement of experts from varying fields to holistically address a scientific problem. Needs for transdisciplinary collaboration included: encouraging a focus on research problems holistically, bringing researchers together, addressing communication barriers, and sustaining transdisciplinary collaboration. |
|
| Whether data-driven or hypothesis-driven, the overall methodology for systems epidemiology must incorporate an iterative approach where models evolve over time based on results. Several methods exist to apply systems modeling. Newer improved methods should incorporate changes over time, bridging multiple scales (e.g., cell, individual, and neighborhood), and dealing with unknown contributions of chance. |
|
| The complexity of systems models results in challenges for interpretation, validation, and evaluation. Comparative modeling, using common datasets or controls, and reproducibility pipelines are possible strategies to address these issues. |
|
| Despite numerous rich datasets in support of epidemiology research, data gaps remain. These gaps include the need for data from populations underrepresented in biomedical sciences, health behaviors, built environment, and health care provider information. Opportunities exist to leverage data from wearable devices, electronic health records, and large cohorts and initiatives. Challenges were noted regarding combining data from multiple sources and research domains. |
|
| Promotion of systems epidemiology depends on the ability to share models and data. Effective sharing and reuse requires sufficient documentation and mechanisms to assess quality and support findability. Some mechanisms and infrastructures, including existing sharing platforms, could be leveraged to help address these needs. |
|
| The evolving field of systems epidemiology will need to facilitate training for both students and current researchers in systems modeling, transdisciplinary research, data sciences, informatics, and computational modeling. |
|
| Successful dissemination depends on effective communication with content experts and the non-research community. Through direct engagement of various stakeholders, systems methods are more likely to be translated, utilized, and accepted to inform biological interpretations, interventions, or policies. |
|
| Sustainability of systems epidemiology may depend on cultivating a systems epidemiology community. This can be facilitated by establishing organizations, interest groups, or other platforms for sharing ideas and discussing models. Specialized funding initiatives and review panels may further support systems epidemiology research. |
Example platforms supporting sharing of data and analytical models or methods.
| Data Sharing Platforms/Solutions | Description |
| All of Us Research Workbench | The All of Us cohort has a goal of enrolling one million participants in the United States to improve health sciences research. With the goal of oversampling for diverse participants, the data being collected includes various measures such as: health questionnaires, electronic health records (EHRs), physical measurements, the use of digital health technology, and biospecimens. |
| Link: | |
| Database of Genotypes and Phenotypes (dbGaP) | dbGaP stores and distributes results of studies that have investigated the association between genotypes and phenotypes. |
| Link: | |
| NCI Cancer Research Data Commons (CRDC) | The NCI Cancer Research Data Commons (CRDC) is a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data. |
| Link: | |
| Model Sharing platforms/Analysis platforms | Description |
| Bioconductor | Bioconductor provides tools for the analysis and comprehension of high-throughput genomic data. |
| Link: | |
| Galaxy | Galaxy is a web-based platform that enables multi-omics data integration and analysis workflows. |
| Link: | |
| Kepler project (and bioKepler) | Kepler is designed to harmonize data by allowing scientists to create, execute, and share models and analyses across a broad range of scientific disciplines. |
| Link: | |
| KNIME | KNIME is an analytics platform that supports data science workflows and reusable components. |
| Link: | |
| Taverna | Taverna is a suite of tools used to design and execute scientific workflows. |
| Link: | |
| Combined Analysis and Data platforms | Description |
| Biosphere | Biosphere is an open-source platform developed by the Broad Institute that can operate across several different platforms (e.g., Terra, Gen3, and Dockstore) to create an interoperable data environment for the biomedical community. |
| Link: | |
| Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) | AnVIL is a scalable and interoperable resource for the genomic scientific community. It leverages a cloud-based infrastructure for genomic data access, sharing and computing across various data sets. |
| Link: | |
| NCI Cloud Resources | The NCI Cloud Resources are components of the NCI Cancer Research Data Commons that allow researchers to download, store, and analyze vast datasets in the cloud. The platform gives users access to tools and pipelines already implemented or lets them upload their own data or analytical methods to workspaces. |
| Link: | |
| St. Jude Cloud-based Repository | St. Jude Cloud is a large pediatric genomics dataset that offers a suite of unique analysis tools and visualizations. It supports access to more than 700 paired tumor/germline samples for common and rare pediatric cancers, sequenced as part of the Pediatric Cancer Genome Project (PCGP). |
| Link: | |
| Terra | Terra is a scalable and secure platform for biomedical researchers to access Broad hosted data, upload their own data, and combine data to run on analytic tools. The platform also contains functions that promote sharing of data to facilitate collaboration between scientists. |
| Link: |
Example opportunities for a systems epidemiology approach in cancer research.
| Major Opportunity Areas | Example Research Questions |
|---|---|
| • Study obesity via a systems approach to discover the dynamic (feedback/feedforward) role obesity (both child and adult) has on cancer etiology and survivorship. | |
| • Behaviors are often assessed individually and in absence of environmental context. However, a systems approach can support the evaluation of how factors like sexual behaviors, nutrition, tobacco usage, physical activity, sedentary behavior, circadian rhythm (sleep) disruptions, social networks, and infectious disease transmission work in tandem and vary within different environments (e.g., rural vs. urban settings) to contribute to cancer. | |
| • Certain groups may have a higher risk of certain cancers due to many factors such as stress, low access to care, education, environmental exposures, and genetics. | |
| • Examine how dynamic social determinants (e.g., diet, lifestyle, environmental exposures, behaviors, etc.) work in conjunction with biological processes (e.g., the microbiome) to influence treatment responses. | |
| • Utilize a systems approach to better prioritize at-risk populations and tailor interventions beyond a single behavior. | |
| • Use system modeling approaches to test a policy prior to implementation or examine impact of policy under different conditions. |