| Literature DB >> 29991697 |
Peter Smittenaar1, Alexandra K Walker2, Shaun McGill2, Christiana Kartsonaki3,4, Rupesh J Robinson-Vyas1, Janette P McQuillan1, Sarah Christie1, Leslie Harris1, Jonathan Lawson1, Elizabeth Henderson2, Will Howat5, Andrew Hanby6, Gareth J Thomas7, Selina Bhattarai8, Lisa Browning9,10, Anne E Kiltie11.
Abstract
BACKGROUND: Immunohistochemistry (IHC) is often used in personalisation of cancer treatments. Analysis of large data sets to uncover predictive biomarkers by specialists can be enormously time-consuming. Here we investigated crowdsourcing as a means of reliably analysing immunostained cancer samples to discover biomarkers predictive of cancer survival.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29991697 PMCID: PMC6048059 DOI: 10.1038/s41416-018-0156-0
Source DB: PubMed Journal: Br J Cancer ISSN: 0007-0920 Impact factor: 7.640
Fig. 1Typical 0.6 um TMA core, stained with DAB and haematoxylin counterstain, and split into 6 × 6 grid. Upper left panel shows contents of red bound square colour transformed for use in the app by citizen scientists
Overview of markers, classifications and expert vs. crowdsourcing scores
| Marker | Localisation | Total # cores | Total # classifications | Average # classifications per core | # Cores scored by experts (% of cores) | ‘Proportion stained’ category | H-score, Spearman correlation with experts (95% CI) | Proportion cancer cells stained, Spearman correlation with experts (95% CI) | Intensity of staining, quadratic-weighted kappa with experts (95% CI) |
|---|---|---|---|---|---|---|---|---|---|
| MRE11 original | Nuclear | 831 | 910,008 | 1095 | 88 (11%) | 1 | 0.67 (0.52, 0.80) | 0.44 (0.22, 0.63) | 0.47 (0.30, 0.61) |
| RAD50 | Nuclear | 786 | 455,429 | 579 | 106 (13%) | 1 | 0.81 (0.71, 0.88) | 0.39 (0.19, 0.57) | 0.73 (0.56, 0.85) |
| p21 | Nuclear | 845 | 1,040,933 | 1232 | 95 (11%) | 1 | 0.90 (0.84, 0.93) | 0.87 (0.80, 0.91) | 0.57 (0.39, 0.72) |
| 53BP1 | Nuclear | 841 | 201,390 | 239 | 85 (10%) | 1 | 0.70 (0.55, 0.80) | 0.53 (0.34, 0.68) | 0.67 (0.52, 0.79) |
| p53 | Nuclear | 849 | 265,334 | 313 | 96 (11%) | 2 | 0.92 (0.86, 0.95) | 0.85 (0.77, 0.91) | 0.69 (0.57, 0.79) |
| CK5/6 | Membranous | 814 | 86,469 | 106 | 104 (13%) | 3 | 0.82 (0.71, 0.89) | 0.66 (0.50, 0.78) | 0.86 (0.77, 0.92) |
| CK20 | Membranous | 806 | 78,613 | 98 | 93 (12%) | 3 | 0.88 (0.83, 0.92) | 0.83 (0.75, 0.88) | 0.87 (0.79, 0.92) |
| TIP60 | Nuclear | 850 | 90,850 | 107 | 91 (11%) | 1 | 0.66 (0.52, 0.76) | 0.17 (−0.07, 0.39) | 0.59 (0.46, 0.70) |
| MRE11 new | Nuclear | 524 | 49,857 | 95 | 84 (16%) | 1 | 0.65 (0.49, 0.78) | 0.61 (0.44, 0.75) | 0.58 (0.41, 0.72) |
| MRE11 c-terminal | Nuclear | 376 | 102,417 | 272 | 79 (21%) | 1 | 0.79 (0.66, 0.86) | 0.66 (0.49, 0.78) | 0.70 (0.55, 0.80) |
| Ki67 | Nuclear | 775 | 68,637 | 89 | 86 (11%) | 2 | 0.80 (0.72, 0.86) | 0.80 (0.71, 0.87) | 0.19 (−0.01, 0.37) |
Classifications were performed on each segment of a core, with between 5 and 25 ratings per segment. These classifications were aggregated across segments to arrive at a single score per core. A subset of cores was scored by experts. The final three columns show correspondence between user and expert scores, presented as Spearman correlation (for H-score and proportion of cancer cells stained) or as quadratic-weighted kappa (for intensity of staining), with bootstrapped 95% CI. Proportion stained category 1: 0, 1–25, 25–50, 50–75, 75–95, 95–100%; category 2: 0, 1–10, 10–25, 25–50, 50–75, 75–100%; category 3: 0, 1–10, 10–25, 25–65, 65–95, 95–100%
Fig. 2Plot of user participation over time. a Number of classifications per week. b Cumulative percentage of all classifications as a function of time
Fig. 3a Scatter plots for individual IHC stains ranked in order of H-score Spearman rho. X-axes represent the expert scores and y-axes the citizen score. Diagonal line represents a perfect score whereby the expert score is identical to the crowdsourced score; b The relationship between number of classifications and accuracy. The y-axis represents the H-score Spearman rho between expert and crowdsourced scores, and the x-axis represents the number of classifications used for a core. The accuracy is estimated through bootstrapping with 1000 samples. The error bars represent the bootstrapped 95% confidence interval (2.5 and 97.5 percentile of bootstrapped samples)
Fig. 4Kaplan Meier survival curves for disease-specific survival. (a) MRE11, (b) RAD50 and (c) CK20 for 1995-9 radiotherapy cohort, and (d) CK20 for cystectomy cohort
Fig. 5Kaplan Meier survival curves for disease-specific survival. (a) MRE11, (b) RAD50 and (c) CK20 for 2002-5 radiotherapy cohort