| Literature DB >> 35202406 |
April E Hebert1, Usha S Kreaden1, Ana Yankovsky1, Dongjing Guo1, Yang Li1, Shih-Hao Lee1, Yuki Liu1, Angela B Soito2, Samira Massachi3, April E Slee4.
Abstract
Survival analysis following oncological treatments require specific analysis techniques to account for data considerations, such as failure to observe the time of event, patient withdrawal, loss to follow-up, and differential follow up. These techniques can include Kaplan-Meier and Cox proportional hazard analyses. However, studies do not always report overall survival (OS), disease-free survival (DFS), or cancer recurrence using hazard ratios, making the synthesis of such oncologic outcomes difficult. We propose a hierarchical utilization of methods to extract or estimate the hazard ratio to standardize time-to-event outcomes so that study inclusion into meta-analyses can be maximized. We also provide proof-of concept results from a statistical analysis that compares OS, DFS, and cancer recurrence for robotic surgery to open and non-robotic minimally invasive surgery. In our example, use of the proposed methodology would allow for the increase in data inclusion from 108 hazard ratios reported to 240 hazard ratios reported or estimated, resulting in an increase of 122%. While there are publications summarizing the motivation for these analyses, and comprehensive papers describing strategies to obtain estimates from published time-dependent analyses, we are not aware of a manuscript that describes a prospective framework for an analysis of this scale focusing on the inclusion of a maximum number of publications reporting on long-term oncologic outcomes incorporating various presentations of statistical data.Entities:
Mesh:
Year: 2022 PMID: 35202406 PMCID: PMC8870464 DOI: 10.1371/journal.pone.0263661
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Decision tree for hazard ratio extraction: Flow chart to determine which hazard ratio estimate to use based on data provided in manuscript.
Determination of reference group for hazard ratio: Table to assist extractors in identifying reference group, determining need to invert hazard ratios.
| Hazard Ratio Direction | Open/Minimally Invasive Reference | Robotic Reference |
|---|---|---|
|
| Robot is Worse | Robot is Better |
|
| Robot is Better | Robot is Worse |
Fig 2Simulations to verify correct implementation of Guyot algorithm: Visual illustration of simulations explored for validation of Guyot algorithm.
Details can be found in S2 Appendix.
Comparison of hazard ratios Method 1 through 4.
Comparison showing information required for calculation of hazard ratio using various methods and resulting hazard ratio and 95% confidence interval.
| N | Method 1 | Method 2 | Method 3 | Method 4a | Method 4b | |
|---|---|---|---|---|---|---|
| Cohort | HR [95% CI] | Deaths | Est. event n | OS 3yr | Median Survival, Deaths | |
| Robotic | 300 | Ref | 105 | 107 | 58.6% | 3.8 yr, 105 |
| Open | 300 | 1.47 [1.14, 1.90] | 133 | 132 | 44.5% | 2.5 yr, 133 |
| p-value | 0.0032 | 0.003 | 0.003 | 0.003 | ||
| Calculated HR [95% CI] | 0.68 [0.53, 0.88] | 0.68 [0.53, 0.88] | 0.68 [0.53, 0.88] | 0.71 [0.56, 0.89] | 0.66 [0.51, 0.85] | |
Worked example using Method 2: Hazard ratio calculated using event counts.
HR = Hazard Ratio, CI = Confidence Interval, eq. = equation, est. = estimated, Vr is the inverse variance of the log hazard ratio for the robotic group, OTotal is the total number of events in the robotic plus the comparison group, “Ln” denotes the natural logarithm (loge), Φ-1 is the inverse of the standard normal distribution, and the p-value is assumed to be 2-sided and from the log-rank test if not otherwise stated. Because p-values were assumed to be two-sided, manual assignment of direction was adjusted by multiplying the term by -1 or 1 so that the result was negative when survival in the robotic group was higher/better and positive when survival in the robotic group was lower/worse.
| D | F | ||
|---|---|---|---|
| Example with equations | Example with values | ||
| Raw Data | Robotic vs Open | Robotic vs Open | |
| 4 | 300 | 300 | |
| 5 | 300 | 300 | |
| 7 | 105 | 105 | |
| 8 | 133 | 133 | |
| 9 |
| 0.003 | 0.003 |
|
| |||
| 11 |
| = D7/D4 | 0.35 |
| 12 |
| = D8/D5 | 0.443 |
| 13 |
| = D11-D12 | -0.093 |
| 14 |
| -1 | -1 |
| 16 | = (((D7+D8)*D4*D5)/((D4+D5)^2)) | 59.5 | |
| 17 |
| = 1/D16 | 0.0168 |
| 18 | = (SQRT((D7+D8)*D4*D5)/(D4+D5))*(NORM.S.INV(1-D9/2)*D14) | -22.89 | |
| 19 |
| = D18/D16 | -0.385 |
| 21 |
| = EXP(D19) | 0.68 |
| 22 |
| = EXP(D19-1.96*SQRT(D17)) | 0.53 |
| 23 |
| = EXP(D19+1.96*SQRT(D17)) | 0.88 |
Fig 3Kaplan-Meier curve worked example: Example data for Method 3 Guyot algorithm and worked example.
Panel A is a graph that might appear in a publication. Panel B shows the “digitized” version with time and KM points, and panel C shows the re-constructed individual patient data using the digitization and n at risk as input. See S1 Appendix for full R code.
Worked example using Method 4b: Hazard ratio calculated using median survival estimates.
HR = Hazard Ratio, CI = Confidence Interval, eq. = equation, “Ln” denotes the natural logarithm (loge).
| D | F | ||
|---|---|---|---|
| Example with equations | Example with values | ||
| Raw Data | Robotic vs Open | Robotic vs Open | |
| 4 |
| 300 | 300 |
| 5 |
| 300 | 300 |
| 7 | 105 | 105 | |
| 8 | 133 | 133 | |
| 9 | 3.8 | 3.8 | |
| 10 | 2.5 | 2.5 | |
|
| |||
| 12 |
| = D10/D9 | 0.66 |
| 13 |
| = SQRT(1/D7+1/D8) | 0.13 |
| 14 |
| = LN(D12)-1.96*D13 | -0.67 |
| 15 |
| = LN(D12)+1.96*D13 | -0.16 |
| 17 |
| = EXP(D14) | 0.51 |
| 18 |
| = EXP(D15) | 0.85 |