| Literature DB >> 30654776 |
Hugo Loureiro1,2, Eunice Carrasquinha1,2, Irina Alho3, Arlindo R Ferreira3, Luís Costa3, Alexandra M Carvalho4,5, Susana Vinga6,7.
Abstract
BACKGROUND: Joint models (JM) have emerged as a promising statistical framework to concurrently analyse survival data and multiple longitudinal responses. This is particularly relevant in clinical studies where the goal is to estimate the association between time-to-event data and the biomarkers evolution. In the context of oncological data, JM can indeed provide interesting prognostic markers for the event under study and thus support clinical decisions and treatment choices. However, several problems arise when dealing with this type of data, such as the high-dimensionality of the covariates space, the lack of knowledge about the function structure of the time series and the presence of missing data, facts that may hamper the accurate estimation of the JM.Entities:
Keywords: Bone metastasis; Cancer studies; Fuzzy clustering; Joint models; Longitudinal analysis; Survival analysis
Mesh:
Substances:
Year: 2019 PMID: 30654776 PMCID: PMC6337820 DOI: 10.1186/s12911-018-0728-1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Global approach flowchart. Representation of the analytical approach followed. The raw data (with missing values) is processed in parallel by three different imputation techniques: omitting or ignoring the corresponding missing entries (Omit), Last Observation Carried Forward (LOCF) and Optimal Completion Strategy (OCS). These three imputed datasets are then separately analysed by the Extended Cox Model and the Joint Models (JM), thus generating three Extended Cox models and nine JM
Number of patients per type of cancer that have NTX measurements on month t (NTXt)
| Cancer type | Baseline | NTX1 | NTX3 | NTX6 | NTX9 | NTX12 |
|---|---|---|---|---|---|---|
| Breast | 90 | 67 | 58 | 48 | 32 | 26 |
| Prostate | 26 | 20 | 17 | 15 | 11 | 5 |
| Others | 31 | 18 | 15 | 9 | 5 | 3 |
| Totals | 147 | 87 | 90 | 72 | 48 | 34 |
Others include the following types of cancer: lung, kidney, gastric, sarcoma, hepatobiliary, bladder, endometrium, cervix, neuroendocrine, osteoblastoma and unknown primary tumor
Fig. 2Comparison between NTX and log(NTX) trajectories. Graphical representation of the values of NTX and log(NTX) for all the patients. Panel a represents the original values and panel b represents the log-transformed values
Log Rank and Wald tests p-values for each feature
| Feature | |||
|---|---|---|---|
| Log rank | Wald test | ||
| Baseline | Age Diagnosis | - |
|
| Sex |
|
| |
| Primary Cancer | 0.0872 | - | |
| X-Ray Pattern | 0.1264 | - | |
| NSRE | 0.8027 | 0.5627 | |
| ExtraMets |
|
| |
| Longitudinal | NTX3 | - | 0.4438 |
| log(NTX3) | - |
| |
| NTX3 >100 | 0.1615 | 0.1515 | |
| NTX12 | - | 0.0737 | |
| log(NTX12) | - | 0.0533 | |
| NTX12 >64 |
|
|
The features were divided in Baseline (Time-independent) and Longitudinal (Time-dependent). The values in bold are statistically significant for a significance level of 5%
Coefficents and p-values for the multivariate Cox regression model
| Age diagnosis | Sex | ExtraMets | NTX | |||||
|---|---|---|---|---|---|---|---|---|
| Value | Value | Value | Type | Value | ||||
| 0.0220 |
| 0.2485 | 0.3073 | 0.7823 |
| NTX3 | 0.0002 | 0.5875 a |
| 0.0209 |
| 0.2421 | 0.3155 | 0.7393 |
| log(NTX3) | 0.1211 | 0.2165 a |
| 0.0225 |
| 0.2214 | 0.3614 | 0.7752 |
| NTX3 >100 | 0.1448 | 0.5831 a |
| 0.0184 | 0.1513 | 0.3709 | 0.3663 | 0.6223 | 0.0775 | NTX12 | 0.0010 | 0.2225 b |
| 0.0177 | 0.1633 | 0.3647 | 0.3756 | 0.5563 | 0.1314 | log(NTX12) | 0.1638 | 0.2556 b |
| 0.0164 | 0.1910 | 0.5141 | 0.2312 | 0.4947 | 0.1806 | NTX12 >64 | 0.6936 | 0.0781 b |
The NTX type column refers to the type of NTX feature used in each model fit. a: Analysis using only patients with NTX3 measurement (106 patients with 86 events). b: Analysis using only patients with NTX12 measurement (51 patients with 41 events)
Fig. 3Centroids obtained in the FSTS clustering algorithm. Graphical representation of the six centroids obtained in the FSTS algorithm and NTX time series. The number in the top right corner of each plot is the number of patients with partition matrix coefficient of at least 0.75 for that cluster
Fig. 4Comparison of the trajectories from different imputation techniques. Four patients were selected where the differences between the imputation techniques generate drastically different trajectories
Fig. 5Mean value of NTX for each imputation type. The Measurements curve refers to the omission of the missing values, the LOCF curve to the values imputed with Last Observation Carried Forward and OCS to the Fuzzy Clustering approach using Optimal Completion Strategy
Longitudinal functions used in the JM
| # | Model | NTX | |
|---|---|---|---|
| Omit | 1 | Rational |
|
| 2 | Exponential | ( | |
| 3 | Spline | ( | |
| LOCF | 4 | Rational |
|
| 5 | Exponential | ( | |
| 6 | Spline | ( | |
| OCS | 7 | Rational |
|
| 8 | Exponential | ( | |
| 9 | Spline | ( |
Fig. 6Population fits for the longitudinal models. The nine longitudinal fits from Table 4 are represented alongside the population mean for each of the imputation types
Coefficients and p-values of the JM
| # | Age diagnosis | Sex | ExtraMets | log(NTX) | |||||
|---|---|---|---|---|---|---|---|---|---|
| Value | Value | Value | Value | ||||||
| Omit | 1 | 0.0185 |
| 0.3160 | 0.1499 | 0.6655 |
| 0.2093 | 0.0593 |
| 2 | 0.0184 |
| 0.3134 | 0.1530 | 0.6649 |
| 0.2175 | 0.0638 | |
| 3 | 0.0184 |
| 0.2655 | 0.2216 | 0.6051 |
| 0.1095 |
| |
| LOCF | 4 | 0.0186 |
| 0.3006 | 0.1672 | 0.6687 |
| 0.1674 | 0.0755 |
| 5 | 0.0186 |
| 0.3005 | 0.1672 | 0.6673 |
| 0.1692 | 0.0753 | |
| 6 | 0.0175 |
| 0.2930 | 0.1820 | 0.6322 |
| 0.1353 |
| |
| OCS | 7 | 0.0182 |
| 0.3048 | 0.1612 | 0.6586 |
| 0.1955 |
|
| 8 | 0.0181 |
| 0.3046 | 0.1615 | 0.6575 |
| 0.1962 |
| |
| 9 | 0.0170 |
| 0.3153 | 0.1496 | 0.6681 |
| 0.1966 |
| |
In models 1 to 3 the missing values were omitted, in 4 to 6 LOCF was used to impute the missing values and in 7 to 9 OCS was used
Coefficients and p-values for extended Cox models
| Model | Age diagnosis | Sex | ExtraMets | log(NTX) | ||||
|---|---|---|---|---|---|---|---|---|
| Value | Value | Value | Value | |||||
| Omit/LOCF | 0.0171 |
| 0.3049 | 0.1658 | 0.6485 |
| 0.1401 | 0.0819 |
| OCS | 0.0172 |
| 0.2963 | 0.1763 | 0.6458 |
| 0.1629 |
|
Since the extended Cox model considers the time-variant feature as a step function, with steps on each measurement, omitting missing values and LOCF generate the same extended Cox model