| Literature DB >> 35629113 |
Teresa Torres Moral1,2,3, Albert Sanchez-Niubo1,4,5, Anna Monistrol-Mula1, Chiara Gerardi6, Rita Banzi6, Paula Garcia7, Jacques Demotes-Mainard7, Josep Maria Haro1,4.
Abstract
Personalized medicine requires large cohorts for patient stratification and validation of patient clustering. However, standards and harmonized practices on the methods and tools to be used for the design and management of cohorts in personalized medicine remain to be defined. This study aims to describe the current state-of-the-art in this area. A scoping review was conducted searching in PubMed, EMBASE, Web of Science, Psycinfo and Cochrane Library for reviews about tools and methods related to cohorts used in personalized medicine. The search focused on cancer, stroke and Alzheimer's disease and was limited to reports in English, French, German, Italian and Spanish published from 2005 to April 2020. The screening process was reported through a PRISMA flowchart. Fifty reviews were included, mostly including information about how data were generated (25/50) and about tools used for data management and analysis (24/50). No direct information was found about the quality of data and the requirements to monitor associated clinical data. A scarcity of information and standards was found in specific areas such as sample size calculation. With this information, comprehensive guidelines could be developed in the future to improve the reproducibility and robustness in the design and management of cohorts in personalized medicine studies.Entities:
Keywords: cohorts; personalized medicine; sample size; stratification
Year: 2022 PMID: 35629113 PMCID: PMC9144352 DOI: 10.3390/jpm12050688
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1PRISMA flowchart describing process for article selection.
Summary of the quantity of information found by type of data.
| Methods and Tools | Most Frequent Strategy Used | |
|---|---|---|
| Within-Subject Correlation |
To quantify intraclass correlation: Modifications of Pearson’s product-moment correlation coefficient. Comparisons based on the generalized estimating equations generated by mixed-effect models. |
Analysis of data using a mixed-effect linear model that can accommodate a dependent variance-covariance structure. |
| Multiplicity |
Controlling the family-wise error rate (Tukey, Bonferroni, Scheffe and other). Approach to controlling the false discovery rate used in biomarker studies: Benjamini and Hochber. |
Analysis of data using a methodology that controls the family-wise error rate. |
| Multiple Clinical Endpoints |
The selection of a single primary endpoint for formal statistical inference, considering that the endpoints are possibly biologically related and positively correlated. Creating a univariate outcome by combining multiple clinical endpoints (weighted measures taking into account the relevance of each endpoint). To compare the two samples based on the endpoint of highest priority first, and, if no winner can be determined, would one move to the endpoint of the next highest priority. |
Analysis of data by prioritizing the relevant endpoints or by using a composite endpoint. |
| Selection bias |
To adjust for age, stage, treatment and so forth. Matched samples. |
Analysis of data using a multivariate model to simultaneously adjust for confounders Obtention of matched samples. Propensity score weighted. |
| Publication bias |
To publish positive and negative results. |
Encourage the objective assessment of molecular signatures by reporting both positive and negative outcomes. Make data publicly available after publication. |
A summary of methods, tools and strategies proposed to avoid different types of bias.
| Methods and Tools | Most Frequent Strategy Used | |
|---|---|---|
| Within-Subject Correlation |
To quantify intraclass correlation: Modifications of Pearson’s product-moment correlation coefficient. Comparisons based on the generalized estimating equations generated by mixed-effect models. |
Analysis of data using a mixed-effect linear model that can accommodate a dependent variance-covariance structure. |
| Multiplicity |
Controlling the family-wise error rate (Tukey, Bonferroni, Scheffe and other). Approach to controlling the false discovery rate used in biomarker studies: Benjamini and Hochber. |
Analysis of data using a methodology that controls the family-wise error rate. |
| Multiple Clinical Endpoints |
The selection of a single primary endpoint for formal statistical inference, considering that the endpoints are possibly biologically related and positively correlated. Creating a univariate outcome by combining multiple clinical endpoints (weighted measures taking into account the relevance of each endpoint). To compare the two samples based on the endpoint of highest priority first, and, if no winner can be determined, would one move to the endpoint of the next highest priority. |
Analysis of data by prioritizing the relevant endpoints or by using a composite endpoint. |
| Selection bias |
To adjust for age, stage, treatment and so forth. Matched samples. |
Analysis of data using a multivariate model to simultaneously adjust for confounders Obtention of matched samples. Propensity score weighted. |
| Publication bias |
To publish positive and negative results. |
Encourage the objective assessment of molecular signatures by reporting both positive and negative outcomes. Make data publicly available after publication. |