| Literature DB >> 29625055 |
Jianfang Liu1, Tara Lichtenberg2, Katherine A Hoadley3, Laila M Poisson4, Alexander J Lazar5, Andrew D Cherniack6, Albert J Kovatich7, Christopher C Benz8, Douglas A Levine9, Adrian V Lee10, Larsson Omberg11, Denise M Wolf12, Craig D Shriver13, Vesteinn Thorsson14, Hai Hu15.
Abstract
For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale.Entities:
Keywords: Cox proportional hazards regression model; TCGA; The Cancer Genome Atlas; clinical data resource; disease-free interval; disease-specific survival; follow-up time; overall survival; progression-free interval; translational research
Mesh:
Year: 2018 PMID: 29625055 PMCID: PMC6066282 DOI: 10.1016/j.cell.2018.02.052
Source DB: PubMed Journal: Cell ISSN: 0092-8674 Impact factor: 41.582
Figure 1Clinical Data Analysis
(A–E) Flowchart (A) and K-M plots of the Pan-Cancer types for OS (B), PFI (C), DFI (D), and DSS (E) respectively. The tail of each K-M curve is truncated at the point when fewer than 10 patients remain at risk. See Figure S1 for plots of the 4 endpoints within each of the 33 tumor types and Tables S1, S2, S3 and S4 for more detailed information regarding endpoint derivation.
TCGA Pan-Cancer Cohort Characteristics
| Cancer Type | No. of Cases | Age | Gender M/F | Race White/Black/Other/NA | Stage | Grade |
|---|---|---|---|---|---|---|
| ACC | 92 | 47.2 ± 16.3 | 32/60 | 78/1/2/11 | 0/9/44/19/18/2 | 0/0/0/0/92 |
| BLCA | 412 | 68.1 ± 10.6 | 304/108 | 327/23/44/18 | 0/2/131/141/136/2 | 21/0/388/0/3 |
| BRCA | 1097 | 58.4 ± 13.2 | 12/1085 | 757/183/62/95 | 0/183/621/249/20/24 | 0/0/0/0/1097 |
| CESC | 307 | 48.3 ± 13.8 | 0/307 | 211/30/30/36 | 0/163/70/46/21/7 | 18/136/120/1/32 |
| CHOL | 45 | 63.6 ± 12.2 | 20/25 | 38/3/3/1 | 0/20/11/4/10/0 | 1/22/20/2/0 |
| COAD | 459 | 66.9 ± 13.1 | 243/216 | 214/59/12/174 | 0/76/178/129/65/11 | 0/0/0/0/459 |
| DLBC | 48 | 56.3 ± 13.9 | 22/26 | 29/1/18/0 | 0/8/17/5/12/6 | 0/0/0/0/48 |
| ESCA | 185 | 62.5 ± 11.9 | 158/27 | 114/5/46/20 | 0/18/79/56/9/23 | 19/77/49/0/40 |
| GBM | 596 | 57.8 ± 14.4 | 366/230 | 507/51/13/25 | 0/0/0/0/0/596 | 0/0/0/596/0 |
| HNSC | 528 | 60.9 ± 11.9 | 386/142 | 452/48/13/15 | 0/27/74/82/270/75 | 63/311/125/7/22 |
| KICH | 113 | 51.2 ± 13.9 | 62/51 | 95/12/4/2 | 0/54/33/19/7/0 | 0/0/0/0/113 |
| KIRC | 537 | 60.6 ± 12.2 | 346/191 | 466/56/8/7 | 0/269/57/125/83/3 | 14/230/207/78/8 |
| KIRP | 291 | 61.5 ± 12.1 | 214/77 | 207/61/8/15 | 0/173/21/52/15/30 | 0/0/0/0/291 |
| LAML | 200 | 55.0 ± 16.1 | 109/91 | 181/15/2/2 | 0/0/0/0/0/200 | 0/0/0/0/200 |
| LGG | 515 | 42.9 ± 13.4 | 285/230 | 475/21/9/10 | 0/0/0/0/0/515 | 0/249/265/0/1 |
| LIHC | 377 | 59.5 ± 13.5 | 255/122 | 187/17/163/10 | 0/175/87/86/5/24 | 55/180/124/13/5 |
| LUAD | 522 | 65.3 ± 10.0 | 242/280 | 393/53/9/67 | 0/279/124/85/26/8 | 0/0/0/0/522 |
| LUSC | 504 | 67.3 ± 8.6 | 373/131 | 351/31/9/113 | 0/245/163/85/7/4 | 0/0/0/0/504 |
| MESO | 87 | 63.0 ± 9.8 | 71/16 | 85/1/1/0 | 0/10/16/45/16/0 | 0/0/0/0/87 |
| OV | 587 | 59.7 ± 11.5 | 0/587 | 498/34/24/31 | 0/17/30/446/89/5 | 6/69/495/1/16 |
| PAAD | 185 | 64.9 ± 11.1 | 102/83 | 162/7/11/5 | 0/21/152/4/5/3 | 32/97/51/2/3 |
| PCPG | 179 | 47.3 ± 15.1 | 78/101 | 148/20/7/4 | 0/0/0/0/0/179 | 0/0/0/0/179 |
| PRAD | 500 | 61.0 ± 6.8 | 500/0 | 147/7/2/344 | 0/0/0/0/0/500 | 0/0/0/0/500 |
| READ | 170 | 64.5 ± 11.9 | 92/78 | 82/6/1/81 | 0/33/51/52/25/9 | 0/0/0/0/170 |
| SARC | 261 | 60.9 ± 14.7 | 119/142 | 228/18/6/9 | 0/0/0/0/0/261 | 0/0/0/0/261 |
| SKCM | 470 | 58.2 ± 15.7 | 290/180 | 447/1/12/10 | 7/77/140/171/23/52 | 0/0/0/0/470 |
| STAD | 443 | 65.7 ± 10.8 | 285/158 | 278/13/90/62 | 0/59/130/183/44/27 | 12/159/263/0/9 |
| TGCT | 134 | 32.0 ± 9.3 | 134/0 | 119/6/4/5 | 0/101/12/14/0/7 | 0/0/0/0/134 |
| THCA | 507 | 47.3 ± 15.8 | 136/371 | 334/27/53/93 | 0/285/52/113/55/2 | 0/0/0/0/507 |
| THYM | 124 | 58.2 ± 13.0 | 64/60 | 103/6/13/2 | 0/38/61/15/8/2 | 0/0/0/0/124 |
| UCEC | 548 | 63.9 ± 11.1 | 0/548 | 374/109/33/32 | 0/342/52/124/30/0 | 99/122/327/0/0 |
| UCS | 57 | 69.7 ± 9.3 | 0/57 | 44/9/3/1 | 0/22/5/20/10/0 | 0/0/0/0/57 |
| UVM | 80 | 61.6 ± 13.9 | 45/35 | 55/0/0/25 | 0/0/39/36/4/1 | 0/0/0/0/80 |
ACC, adrenocortical carcinoma; CHOL, cholangiocarcinoma; KIRC, kidney renal clear cell carcinoma; LIHC, liver hepatocellular carcinoma; READ, rectum adenocarcinoma; TGCT, testicular germ cell tumor; THCA, thyroid carcinoma; UCS, uterine carcinosarcoma; NA, not applicable.
51 cases are missing age at diagnosis, and 46 patients were 90 years of age or older and were capped at 90 years because of Health Insurance Portability and Accountability Act (HIPAA) regulations.
Included AJCC stage for most cancer types; clinical stages for CESC, DLBC, OV, UCEC, and UCS; and Masaoka stage for THYM. In the detailed data file shown in Table S1, tab TCGA-CDR, all originally reported stage types were retained.
In BLCA, G1 was for “low grade” and G3 for “high grade” in this table; UCEC had 11 high grade, which was converted to G3 (the highest for this disease) in this table. All original values were retained in Table S1, tab TCGA-CDR.
GBM is grade IV by definition. In the original TCGA dataset, the grades for GBM cases were not provided.
We realized that OV should not have a grade IV disease but reported the data as in the original TCGA dataset.
For SKCM, the majority of tumors were from bulky regional lymph node metastases or distant metastases, and the patients’ initial diagnosis years of non-metastatic diseases, including stage 0 disease, were provided (in situ; see STAR Methods). No other cancer types had a stage 0 cancer diagnosis.
Median Follow-Up Times Overall and the Median Time to Event and to Censor for the Four Clinical Outcome Endpoints
| Cancer Type | Median Follow-Up Time (Months) | OS Median Time (Months) | PFI Median Time (Months) | DFI Median Time (Months) | DSS Median Time (Months) | ||||
|---|---|---|---|---|---|---|---|---|---|
| All | To Event | To Censor | To Event | To Censor | To Event | To Censor | To Event | To Censor | |
| ACC | 38.9 | 18.1 | 47.8 | 8.1 | 49.2 | 20.0 | 61.0 | 18.0 | 47.8 |
| BLCA | 17.6 | 13.5 | 21.0 | 9.7 | 17.8 | 14.8 | 19.1 | 13.6 | 19.4 |
| BRCA | 27.7 | 41.8 | 25.0 | 26.0 | 25.0 | 25.4 | 25.0 | 32.6 | 26.0 |
| CESC | 20.9 | 19.9 | 22.6 | 13.6 | 21.7 | 15.9 | 28.3 | 18.0 | 23.0 |
| CHOL | 21.6 | 18.0 | 30.1 | 7.1 | 22.4 | 7.1 | 25.3 | 18.3 | 28.3 |
| COAD | 22.0 | 13.3 | 24.0 | 12.0 | 22.5 | 16.0 | 29.3 | 11.1 | 24.0 |
| DLBC | 26.7 | 19.5 | 31.1 | 10.3 | 29.2 | 113.7 | 31.4 | 16.2 | 29.2 |
| ESCA | 13.1 | 11.5 | 13.2 | 8.8 | 12.6 | 7.4 | 13.2 | 13.5 | 12.9 |
| GBM | 12.0 | 12.6 | 8.5 | 6.1 | 5.9 | 31.5 | 26.3 | 12.7 | 8.4 |
| HNSC | 21.2 | 14.1 | 27.9 | 9.4 | 25.7 | 7.6 | 27.5 | 13.5 | 25.9 |
| KICH | 48.3 | 24.3 | 54.2 | 11.9 | 49.7 | 52.7 | 39.6 | 28.1 | 51.0 |
| KIRC | 39.0 | 26.9 | 47.8 | 13.5 | 43.0 | 29.6 | 45.4 | 23.2 | 46.4 |
| KIRP | 25.2 | 21.1 | 25.4 | 11.0 | 25.5 | 15.5 | 25.4 | 14.2 | 26.0 |
| LAML | 12.0 | 9.0 | 23.0 | NA | NA | NA | NA | NA | NA |
| LGG | 22.1 | 26.7 | 20.7 | 15.3 | 18.7 | 19.6 | 20.1 | 25.5 | 20.7 |
| LIHC | 19.7 | 13.7 | 21.3 | 9.0 | 15.6 | 9.0 | 17.6 | 19.8 | 19.7 |
| LUAD | 21.6 | 20.3 | 22.0 | 14.4 | 20.0 | 15.7 | 22.5 | 19.9 | 21.8 |
| LUSC | 21.9 | 18.1 | 24.9 | 14.0 | 21.1 | 18.0 | 26.9 | 18.8 | 22.5 |
| MESO | 16.9 | 15.0 | 38.4 | 10.3 | 19.4 | 15.5 | 9.8 | 14.7 | 24.9 |
| OV | 33.0 | 35.3 | 27.7 | 14.7 | 14.9 | 17.9 | 26.5 | 35.3 | 28.9 |
| PAAD | 15.3 | 12.9 | 17.0 | 11.2 | 13.8 | 14.8 | 15.7 | 13.8 | 15.9 |
| PCPG | 24.8 | 14.9 | 25.2 | 19.9 | 23.8 | 27.3 | 24.5 | 17.5 | 25.0 |
| PRAD | 30.5 | 36.2 | 30.5 | 18.4 | 28.2 | 24.9 | 30.4 | 43.7 | 30.5 |
| READ | 20.0 | 22.0 | 20.0 | 19.0 | 19.0 | 27.8 | 21.0 | 20.0 | 20.0 |
| SARC | 31.1 | 21.3 | 35.9 | 10.1 | 32.7 | 11.2 | 36.5 | 22.6 | 34.9 |
| SKCM | 35.9 | 35.3 | 36.9 | 23.5 | 22.7 | 21.8 | 23.8 | 36.6 | 34.3 |
| STAD | 14.0 | 11.3 | 17.2 | 9.5 | 13.8 | 10.8 | 18.6 | 12.4 | 16.1 |
| TGCT | 41.4 | 18.6 | 41.6 | 9.1 | 35.6 | 14.8 | 31.8 | 16.9 | 41.7 |
| THCA | 31.1 | 33.5 | 31.0 | 16.0 | 30.9 | 16.2 | 31.9 | 33.5 | 31.0 |
| THYM | 41.2 | 28.0 | 41.6 | 25.2 | 41.2 | 30.8 | 42.1 | 54.9 | 41.2 |
| UCEC | 29.9 | 23.3 | 31.2 | 16.8 | 29.7 | 17.1 | 30.7 | 21.9 | 30.7 |
| UCS | 20.1 | 17.1 | 27.2 | 9.0 | 26.9 | 16.6 | 27.2 | 14.7 | 26.9 |
| UVM | 25.8 | 19.9 | 27.0 | 12.5 | 25.0 | 12.2 | 26.2 | 19.9 | 27.0 |
Assessment and Recommended Use of the Endpoints of OS, PFI, DFI, and DSS
| Type | N | OS (Accurately Defined) | PFI (Accurately Defined) | DFI (Accurately Defined) | DSS (Approximately Defined) | Explanation/Caution | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Use | Event | Censored | Use | Event | Censored | Use | Event | Censored | Use | Event | Censored | |||
| ACC | 92 | 34 | 58 | 49 | 43 | 14 | 39 | 30 | 60 | number of events is small | ||||
| BLCA | 412 | 181 | 231 | 177 | 235 | 32 | 157 | 124 | 274 | |||||
| BRCA | 1097 | 151 | 946 | 145 | 952 | 84 | 869 | 83 | 995 | need a longer follow-up for OS and DSS | ||||
| CESC | 307 | 71 | 236 | 71 | 236 | 26 | 150 | 54 | 249 | |||||
| CHOL | 45 | 22 | 23 | 23 | 22 | 10 | 18 | 18 | 24 | sample size is too small for OS, DSS, DFI, and PFI | ||||
| COAD | 459 | 102 | 357 | 123 | 336 | 24 | 166 | 64 | 379 | |||||
| DLBC | 48 | X | 9 | 39 | 12 | 36 | 4 | 24 | 4 | 44 | sample size and number of events are too small, need a longer follow-up | |||
| ESCA | 185 | 77 | 108 | 87 | 98 | 23 | 66 | 51 | 132 | |||||
| GBM | 596 | 491 | 105 | 506 | 90 | 2 | 1 | 445 | 110 | number of disease-free cases is small | ||||
| HNSC | 528 | 223 | 305 | 198 | 330 | 28 | 106 | 130 | 372 | |||||
| KICH | 113 | 13 | 100 | 17 | 96 | 6 | 65 | 10 | 103 | number of events is too small, need a longer follow-up | ||||
| KIRC | 537 | 177 | 360 | 162 | 375 | 15 | 102 | 110 | 415 | number of events is small | ||||
| KIRP | 291 | 44 | 247 | 58 | 233 | 28 | 156 | 28 | 259 | |||||
| LAML | 200 | 133 | 67 | NA | NA | NA | NA | NA | NA | NA | NA | NA | only has OS data | |
| LGG | 515 | 125 | 390 | 192 | 323 | 20 | 114 | 113 | 394 | need a longer follow-up for OS and DSS | ||||
| LIHC | 377 | 132 | 245 | 185 | 192 | 147 | 176 | 80 | 288 | need a longer follow-up DSS | ||||
| LUAD | 522 | 188 | 334 | 213 | 309 | 92 | 217 | 116 | 370 | |||||
| LUSC | 504 | 219 | 285 | 149 | 355 | 63 | 241 | 91 | 361 | |||||
| MESO | 87 | 74 | 13 | 61 | 26 | 7 | 8 | 43 | 23 | sample size for DFI is small | ||||
| OV | 587 | 349 | 236 | 414 | 173 | 196 | 90 | 302 | 246 | |||||
| PAAD | 185 | 100 | 85 | 110 | 75 | 23 | 46 | 79 | 99 | |||||
| PCPG | 179 | 6 | 173 | 21 | 158 | 4 | 156 | 4 | 175 | need a longer follow-up for OS, DSS, DFI, and PFI; number of events is small | ||||
| PRAD | 500 | 10 | 490 | 93 | 407 | 30 | 310 | 5 | 493 | need a longer follow-up for OS and DSS | ||||
| READ | 170 | 26 | 144 | 39 | 131 | 7 | 41 | 15 | 149 | need a longer follow-up for OS, DSS, and DFI; number of events for DFI is too small | ||||
| SARC | 261 | 99 | 162 | 139 | 122 | 67 | 86 | 81 | 174 | |||||
| SKCM | 470 | 216 | 247 | 309 | 154 | NA | NA | NA | 190 | 267 | no information to derive DFI | |||
| STAD | 443 | 172 | 271 | 143 | 300 | 46 | 213 | 103 | 313 | |||||
| TGCT | 134 | 4 | 130 | 35 | 99 | 27 | 78 | 3 | 131 | number of events is small for OS and DSS; need a longer follow-up | ||||
| THCA | 507 | 16 | 491 | 52 | 455 | 26 | 332 | 7 | 494 | number of events is small for OS and DSS; need a longer follow-up | ||||
| THYM | 124 | 9 | 115 | 22 | 102 | NA | NA | NA | 4 | 120 | number of events is too small for OS and DSS; need a longer follow-up; no information to derive DFI | |||
| UCEC | 548 | 91 | 457 | 124 | 424 | 57 | 369 | 60 | 486 | |||||
| UCS | 57 | 35 | 22 | 37 | 20 | 10 | 17 | 31 | 24 | sample size is small | ||||
| UVM | 80 | 23 | 57 | 30 | 50 | NA | NA | NA | 21 | 59 | no information to derive DFI | |||
recommended for use (passed at least passed one of the 3 tests in step 1 and the supplemental checks in step 2 as described in the STAR Methods);
not recommended for use;
caution, see the explanation/caution column; app., approximate; acc., accurate.
Figure 2Validation and Application Examples
(A–D) Clinical survival outcomes of breast cancer patients with ER+ or ER− tumors using (A) OS, (B) PFI, (C) DFI, and (D) DSS as the endpoint, respectively. Plots were truncated at 10 years, but the analyses were conducted using all of the data.
(E–G) Pairwise plots comparing natural logHRs for event development measured by three different clinical endpoints (OS, DSS, and PFI), with natural logHRs calculated for high-stage (III, IV) disease relative to low-stage (I, II) disease: (E) PFI versus OS, (F) PFI versus DSS, and (G) OS versus DSS. Here 14 diseases for which these three endpoints are recommended for use are analyzed. The blue diagonal lines illustrate equal logHRs measured by each endpoint pair. Error bars represents the standard errors of logHR. Red error bars show models that did not satisfy the PHs assumption.
NTE Development from Patients Who Were Never Disease-Free Compared with Those Who Were Once Disease-Free
| Type | No. | Disease-Free | With Disease | HR | 95% CI | ||
|---|---|---|---|---|---|---|---|
| No. | Event (n) | No. | Event (n) | ||||
| ACC | 92 | 52 | 13 | 11 | 10 | 14.48 | 5.59-37.5 |
| BLCA | 412 | 177 | 32 | 49 | 40 | 9.68 | 5.98-15.67 |
| BRCA | 1097 | 890 | 80 | 31 | 4 | 1.81 | 0.66-4.95 |
| CESC | 307 | 172 | 26 | 16 | 7 | 4.7 | 2.02-10.94 |
| CHOL | 45 | 23 | 7 | 2 | 2.31 | 0.48-11.19 | |
| COAD | 459 | 187 | 22 | 24 | 15 | 10.13 | 5.19-19.74 |
| DLBC | 48 | 27 | 4 | 3 | 21.99 | 2.25-215.2 | |
| ESCA | 185 | 78 | 17 | 13 | 10 | 3.84 | 1.75-8.41 |
| GBM | 596 | 2 | 17 | 14 | 6.62 | 0.83-52.61 | |
| HNSC | 528 | 132 | 28 | 7 | 9.47 | 4.01-22.41 | |
| KICH | 113 | 68 | 6 | 2 | 15.61 | 2.17-112.59 | |
| KIRC | 537 | 112 | 15 | 4 | |||
| KIRP | 291 | 175 | 24 | 4 | 5.96 | 2.04-17.45 | |
| LGG | 515 | 124 | 20 | 292 | 136 | 3.12 | 1.95-5 |
| LIHC | 377 | 277 | 131 | 16 | 11 | 1.8 | 0.97-3.34 |
| LUAD | 522 | 296 | 90 | 85 | 59 | 3.96 | 2.83-5.53 |
| LUSC | 504 | 289 | 63 | 41 | 28 | 6.68 | 4.25-10.51 |
| MESO | 87 | 14 | 7 | 7 | 1.64 | 0.57-4.71 | |
| OV | 587 | 284 | 196 | 91 | 79 | 3.78 | 2.86-5 |
| PAAD | 185 | 65 | 22 | 54 | 46 | 4.47 | 2.65-7.55 |
| PCPG | 179 | 140 | 4 | 7 | 21.13 | 5.85-76.29 | |
| PRAD | 500 | 338 | 28 | 96 | 50 | 8.47 | 5.32-13.49 |
| READ | 170 | 47 | 7 | 4 | 7.28 | 1.91-27.78 | |
| SARC | 261 | 146 | 61 | 69 | 40 | 1.92 | 1.28-2.86 |
| STAD | 443 | 236 | 42 | 64 | 49 | 8.78 | 5.7-13.53 |
| TGCT | 134 | 100 | 23 | 17 | 3 | ||
| THCA | 507 | 351 | 25 | 42 | 7 | 2.46 | 1.06-5.69 |
| UCEC | 548 | 404 | 55 | 21 | 13 | 6.95 | 3.78-12.76 |
| UCS | 57 | 25 | 10 | 11 | 11 | 20.91 | 5.58-78.29 |
| Pan-cancer | 11,160 | 5,232 | 1,060 | 1,123 | 672 | 4.47 | 4.06-4.93 |
, FDR-adjusted q value < 0.05;
, model with a nominal p value from log rank test < 0.05. Italic HR and 95% CI indicate models we recommend for additional testing of PHs assumptions (c.f. Table S1, tab Table4_PHAssumptionTests).
To overcome immortal time bias, we restricted the patient set to those surviving at least 3 months. This provides a proxy for the time required for a patient to complete treatment and be identified as disease-free.
HR and 95% CI were calculated from Cox PHs regression models, using disease-free as reference.
Italic and underscored, number of patients at risk < 10.
Comparing Outcomes from the Top Two TSSs that Provided Most Cases (at Least 50) of Each Cancer Type Studied
| Type | Site | OS | PFI | DFI | DSS | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| No. | Event | HR | 95% CI | No. | Event | HR | 95% CI | No. | Event | HR | 95% CI | No. | Event | HR | 95% CI | ||
| BLCA | first | 52 | 18 | 0.58 | 0.36-0.96 | 52 | 20 | 0.67 | 0.42-1.08 | 25 | 4 | 0.61 | 0.21-1.77 | 51 | 12 | 0.57 | 0.31-1.05 |
| second | 51 | 34 | 1.16 | 0.79-1.7 | 51 | 25 | 0.92 | 0.6-1.42 | 19 | 2 | 0.31 | 0.07-1.32 | 46 | 22 | 1.13 | 0.7-1.82 | |
| others | 308 | 128 | 309 | 125 | 145 | 26 | 300 | 89 | |||||||||
| BRCA | first | 150 | 48 | 150 | 23 | 111 | 3 | 0.23 | 0.07-0.74 | 140 | 21 | 2.13 | 1.27-3.56 | ||||
| second | 102 | 15 | 102 | 16 | 83 | 9 | 0.8 | 0.4-1.61 | 102 | 10 | 1.17 | 0.59-2.3 | |||||
| others | 844 | 88 | 844 | 94 | 758 | 72 | 835 | 52 | |||||||||
| CESC | first | 64 | 29 | 1.97 | 1.12-3.47 | 64 | 19 | 1.35 | 0.75-2.42 | 31 | 8 | 1.56 | 0.65-3.75 | 62 | 18 | 1.65 | 0.85-3.2 |
| second | 54 | 20 | 2.27 | 1.23-4.19 | 54 | 23 | 2.12 | 1.23-3.66 | 30 | 4 | 0.86 | 0.28-2.62 | 54 | 18 | 2.39 | 1.24-4.62 | |
| others | 189 | 22 | 189 | 26 | 115 | 14 | 187 | 18 | |||||||||
| COAD | first | 173 | 31 | 173 | 42 | 84 | 8 | 0.65 | 0.27-1.61 | 173 | 28 | ||||||
| second | 52 | 11 | 52 | 18 | 27 | 3 | 0.7 | 0.2-2.51 | 52 | 6 | |||||||
| others | 233 | 60 | 233 | 63 | 79 | 13 | 217 | 30 | |||||||||
| GBM | first | 155 | 128 | 155 | 138 | 1.21 | 0.98-1.48 | NA | NA | NA | NA | 152 | 124 | ||||
| second | 93 | 87 | 93 | 90 | 0.84 | 0.66-1.07 | NA | NA | NA | NA | 90 | 84 | |||||
| others | 348 | 276 | 348 | 278 | 3 | 2 | 313 | 237 | |||||||||
| HNSC | first | 135 | 98 | 1.85 | 1.37-2.51 | 135 | 59 | 1.02 | 0.74-1.42 | 21 | 4 | 0.44 | 0.12-1.55 | 121 | 48 | 1.6 | 1.08-2.37 |
| second | 74 | 35 | 1.56 | 1.05-2.3 | 74 | 25 | 0.99 | 0.65-1.51 | 13 | 4 | 1.37 | 0.46-4.08 | 73 | 21 | 1.37 | 0.83-2.24 | |
| others | 318 | 90 | 318 | 105 | 99 | 20 | 307 | 61 | |||||||||
| KIRC | first | 142 | 39 | 141 | 30 | 0.85 | 0.55-1.3 | NA | NA | NA | NA | 141 | 22 | 1.03 | 0.62-1.73 | ||
| second | 107 | 70 | 107 | 59 | 2.3 | 1.63-3.26 | 29 | 6 | 1.14 | 0.39-3.34 | 102 | 45 | 2.88 | 1.89-4.38 | |||
| others | 288 | 68 | 287 | 71 | 88 | 9 | 282 | 43 | |||||||||
| LGG | first | 104 | 9 | 104 | 18 | 0.73 | 0.45-1.21 | 44 | 3 | 0.40 | 0.12-1.36 | 104 | 7 | ||||
| second | 86 | 44 | 86 | 59 | 2.00 | 1.46-2.75 | NA | NA | NA | NA | 81 | 39 | |||||
| others | 324 | 72 | 324 | 115 | 90 | 17 | 321 | 67 | |||||||||
| OV | first | 111 | 77 | 0.59 | 0.45-0.78 | 111 | 92 | 0.94 | 0.74-1.2 | 63 | 51 | 1 | 0.72-1.41 | 107 | 67 | 0.58 | 0.44-0.78 |
| second | 99 | 72 | 1.28 | 0.98-1.68 | 99 | 69 | 1.31 | 1-1.71 | 60 | 44 | 1.77 | 1.24-2.53 | 94 | 67 | 1.4 | 1.06-1.87 | |
| others | 372 | 199 | 372 | 252 | 163 | 101 | 344 | 167 | |||||||||
| PRAD | first | 97 | 0 | NA | NA | 97 | 13 | 0.58 | 0.32-1.04 | 85 | 3 | 0.27 | 0.08-0.9 | 97 | 0 | NA | #N/A |
| second | 65 | 0 | NA | NA | 65 | 6 | 0.71 | 0.33-1.56 | 43 | 1 | 0.26 | 0.03-1.9 | 65 | 0 | NA | #N/A | |
| others | 338 | 10 | 338 | 67 | 212 | 26 | 336 | 5 | |||||||||
| SKCM | first | 92 | 55 | 0.73 | 0.53-1 | 92 | 73 | NA | NA | NA | NA | 89 | 48 | 0.71 | 0.51-1 | ||
| second | 68 | 28 | 0.54 | 0.36-0.82 | 68 | 59 | NA | NA | NA | NA | 67 | 24 | 0.53 | 0.34-0.83 | |||
| others | 295 | 131 | 296 | 176 | NA | NA | 293 | 116 | |||||||||
| STAD | first | 137 | 56 | 1.57 | 1.1-2.24 | 137 | 26 | 91 | 16 | 123 | 17 | 0.66 | 0.38-1.15 | ||||
| second | 67 | 42 | 1.59 | 1.08-2.34 | 67 | 38 | 29 | 2 | 0.05-0.88 | 64 | 32 | 1.62 | 1.04-2.52 | ||||
| others | 233 | 71 | 235 | 76 | 137 | 28 | 223 | 52 | |||||||||
| THCA | first | 93 | 0 | NA | NA | 93 | 1 | 0.44 | 0.17-1.12 | 81 | 1 | 0.12 | 0.02-0.91 | 93 | 0 | NA | #N/A |
| second | 81 | 0 | NA | NA | 81 | 5 | 0.62 | 0.24-1.57 | 61 | 3 | 0.65 | 0.19-2.2 | 81 | 0 | NA | #N/A | |
| others | 333 | 16 | 333 | 42 | 216 | 22 | 327 | 7 | |||||||||
| UCEC | first | 78 | 10 | 1.66 | 0.84-3.26 | 78 | 10 | 1.02 | 0.54-1.91 | 70 | 6 | 1.16 | 0.49-2.74 | 78 | 3 | 0.66 | 0.2-2.12 |
| second | 67 | 10 | 0.97 | 0.5-1.88 | 67 | 12 | 0.95 | 0.54-1.66 | 30 | 5 | 1.03 | 0.41-2.59 | 65 | 7 | 0.95 | 0.43-2.09 | |
| others | 402 | 71 | 402 | 91 | 326 | 46 | 402 | 50 | |||||||||
, significant model with a nominal p < 0.05;
, FDR-adjusted q value < 0.05. Italic HR and 95% CI indicate models we recommend for additional testing of PHs assumptions (c.f. Table S1, tab Table5_PHAssumptionTests). K-M plots comparing the survival data from top two TSSs with those from patients of other sites (i.e., not from the top two TSSs) for each cancer type without restricting to TSSs supplying at least 50 cases are shown in Figure S5.
HR and 95% CI were calculated from Cox PHs regression models.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Deposited Data | ||
| TCGA clinical data (33 tumor types; 225 files) | Genomic Data Commons data portal | |
| Tissue Source Site Codes | National Cancer Institute | |
| Genomic Data Commons | ||
| Sample Submission Form | National Cancer Institute | |
| GDC Legacy Archive | ||
| TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR) | This paper | |
| Software and Algorithms | ||
| R 3.2.2 | ||