Literature DB >> 33241287

Trajectories, bifurcations, and pseudo-time in large clinical datasets: applications to myocardial infarction and diabetes data.

Sergey E Golovenkin1, Jonathan Bac2,3,4, Alexander Chervov2,3,4, Evgeny M Mirkes5,6, Yuliya V Orlova1, Emmanuel Barillot2,3,4, Alexander N Gorban5,6, Andrei Zinovyev2,3,4.   

Abstract

BACKGROUND: Large observational clinical datasets are becoming increasingly available for mining associations between various disease traits and administered therapy. These datasets can be considered as representations of the landscape of all possible disease conditions, in which a concrete disease state develops through stereotypical routes, characterized by "points of no return" and "final states" (such as lethal or recovery states). Extracting this information directly from the data remains challenging, especially in the case of synchronic (with a short-term follow-up) observations.
RESULTS: Here we suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values, through modeling the geometrical data structure as a bouquet of bifurcating clinical trajectories. The methodology is based on application of elastic principal graphs, which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection, and quantifying the geodesic distances (pseudo-time) in partially ordered sequences of observations. The methodology allows a patient to be positioned on a particular clinical trajectory (pathological scenario) and the degree of progression along it to be characterized with a qualitative estimate of the uncertainty of the prognosis. We developed a tool ClinTrajan for clinical trajectory analysis implemented in the Python programming language. We test the methodology in 2 large publicly available datasets: myocardial infarction complications and readmission of diabetic patients data.
CONCLUSIONS: Our pseudo-time quantification-based approach makes it possible to apply the methods developed for dynamical disease phenotyping and illness trajectory analysis (diachronic data analysis) to synchronic observational data.
© The Author(s) 2020. Published by Oxford University Press GigaScience.

Entities:  

Keywords:  clinical data; clinical trajectory; data analysis; diabetes; dimensionality reduction; dynamical diseases phenotyping; myocardial infarction; patient disease pathway; principal trees; pseudo-time

Year:  2020        PMID: 33241287      PMCID: PMC7688475          DOI: 10.1093/gigascience/giaa128

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  21 in total

1.  Principal manifolds and graphs in practice: from molecular biology to dynamical systems.

Authors:  Alexander N Gorban; Andrei Zinovyev
Journal:  Int J Neural Syst       Date:  2010-06       Impact factor: 5.866

2.  Reconstructing complex lineage trees from scRNA-seq data using MERLoT.

Authors:  R Gonzalo Parra; Nikolaos Papadopoulos; Laura Ahumada-Arranz; Jakob El Kholtei; Noah Mottelson; Yehor Horokhovsky; Barbara Treutlein; Johannes Soeding
Journal:  Nucleic Acids Res       Date:  2019-09-26       Impact factor: 16.971

3.  Dynamic predictions and prospective accuracy in joint models for longitudinal and time-to-event data.

Authors:  Dimitris Rizopoulos
Journal:  Biometrics       Date:  2011-02-09       Impact factor: 2.571

4.  Group-based trajectory modeling in clinical research.

Authors:  Daniel S Nagin; Candice L Odgers
Journal:  Annu Rev Clin Psychol       Date:  2010       Impact factor: 18.561

5.  Dynamic phenotypes: illustrating a single-cell odyssey.

Authors:  William Wang; Bijun Zhu; Xiangdong Wang
Journal:  Cell Biol Toxicol       Date:  2017-06-21       Impact factor: 6.691

6.  The emergence of dynamic phenotyping.

Authors:  Daniel Ruderman
Journal:  Cell Biol Toxicol       Date:  2017-09-23       Impact factor: 6.691

7.  Handling missing data in large healthcare dataset: A case study of unknown trauma outcomes.

Authors:  E M Mirkes; T J Coats; J Levesley; A N Gorban
Journal:  Comput Biol Med       Date:  2016-06-08       Impact factor: 4.589

Review 8.  Data mining for wearable sensors in health monitoring systems: a review of recent trends and challenges.

Authors:  Hadi Banaee; Mobyen Uddin Ahmed; Amy Loutfi
Journal:  Sensors (Basel)       Date:  2013-12-17       Impact factor: 3.576

9.  Population-wide analysis of differences in disease progression patterns in men and women.

Authors:  David Westergaard; Pope Moseley; Freja Karuna Hemmingsen Sørup; Pierre Baldi; Søren Brunak
Journal:  Nat Commun       Date:  2019-02-08       Impact factor: 14.919

10.  Characterization of cell fate probabilities in single-cell data with Palantir.

Authors:  Manu Setty; Vaidotas Kiseliovas; Jacob Levine; Adam Gayoso; Linas Mazutis; Dana Pe'er
Journal:  Nat Biotechnol       Date:  2019-03-21       Impact factor: 54.908

View more
  2 in total

1.  A Fast kNN Algorithm Using Multiple Space-Filling Curves.

Authors:  Konstantin Barkalov; Anton Shtanyuk; Alexander Sysoyev
Journal:  Entropy (Basel)       Date:  2022-05-30       Impact factor: 2.738

2.  Modeling Progression of Single Cell Populations Through the Cell Cycle as a Sequence of Switches.

Authors:  Andrei Zinovyev; Michail Sadovsky; Laurence Calzone; Aziz Fouché; Clarice S Groeneveld; Alexander Chervov; Emmanuel Barillot; Alexander N Gorban
Journal:  Front Mol Biosci       Date:  2022-02-01
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.