| Literature DB >> 30298124 |
Ceyda Oksel1, Sadia Haider1, Sara Fontanella1, Clement Frainay2,3, Adnan Custovic1.
Abstract
Advances in big data analytics have created an opportunity for a step change in unraveling mechanisms underlying the development of complex diseases such as asthma, providing valuable insights that drive better diagnostic decision-making in clinical practice, and opening up paths to individualized treatment plans. However, translating findings from data-driven analyses into meaningful insights and actionable solutions requires approaches and tools which move beyond mining and patterning longitudinal data. The purpose of this review is to summarize recent advances in phenotyping of asthma, to discuss key hurdles currently hampering the translation of phenotypic variation into mechanistic insights and clinical setting, and to suggest potential solutions that may address these limitations and accelerate moving discoveries into practice. In order to advance the field of phenotypic discovery, greater focus should be placed on investigating the extent of within-phenotype variation. We advocate a more cautious modeling approach by "supervising" the findings to delineate more precisely the characteristics of the individual trajectories assigned to each phenotype. Furthermore, it is important to employ different methods within a study to compare the stability of derived phenotypes, and to assess the immutability of individual assignments to phenotypes. If we are to make a step change toward precision (stratified or personalized) medicine and capitalize on the available big data assets, we have to develop genuine cross-disciplinary collaborations, wherein data scientists who turn data into information using algorithms and machine learning, team up with medical professionals who provide deep insights on specific subjects from a clinical perspective.Entities:
Keywords: asthma; big data; disease progression; longitudinal data; machine learning; phenotypes
Year: 2018 PMID: 30298124 PMCID: PMC6160736 DOI: 10.3389/fped.2018.00258
Source DB: PubMed Journal: Front Pediatr ISSN: 2296-2360 Impact factor: 3.418
Different approaches for phenotypic discovery with the associated advantages and disadvantages.
| ( | 1–6 | 54 | Subjective sub-typing |
- Phenotypes are observable expressions - Choice of cutoff guided by investigator expertise - Simple |
- Predefined or hypothesized criteria needed - Rare patterns may be missed - Risk of over- or under- fitting as there are no objective statistical criteria for judging fit - Subjective cut-offs need to be recalibrated when new data becomes available - Un-validated cut-offs pose challenge for comparing findings across studies |
| ( | 1–6 | 826 | |||
| ( | 1–6 | 6265 | Latent class analysis |
- Probabilistic class allocation. - No prior knowledge is needed. - Hidden patterns may be uncovered that could not be a priori. - Hypothesis generating - Objective statistical criteria for judging whether phenotypes represent true variation |
- Discovered sub-types are latent and retrospective by nature - Within-class heterogeneity arising from individuals whose patterns do not exemplify any phenotype - Meaningful clinical interpretation required to explain the patterns - Number of derived phenotypes may be related to the frequency and timing of data collection - Unclear to what extent established phenotype labels convey temporal patterns |
| ( | 1–7 | 689 | |||
| ( | 1–9 | 953 | |||
| ( | 1–8 | 5760 | |||
| 1–8 | 2810 | ||||
| ( | 1–8 | 1184 | |||
| ( | 8–12 | 3890 | |||
| ( | 3–5 | 946 | Principal component analysis |
- Accounts for - coexisting symptoms - Reduces the variable dimensions in complex diseases |
- Difficult clinical interpretation - Not useful for categorical and longitudinal data unless properly specified |
| ( | 7–35 | 925 | Exploratory factor analysis | ||
| ( | 6–18 | 613 | Hierarchical clustering |
- No a priori info about the number of classes required |
- Risk of misclassifying distinct phenotypes that are present at low frequency |
Figure 1From phenotype discovery to clinical utility.