| Literature DB >> 26288840 |
Francisco J Candido Dos Reis1, Stuart Lynn2, H Raza Ali3, Diana Eccles4, Andrew Hanby5, Elena Provenzano6, Carlos Caldas3, William J Howat3, Leigh-Anne McDuffus3, Bin Liu3, Frances Daley7, Penny Coulson7, Rupesh J Vyas8, Leslie M Harris8, Joanna M Owens8, Amy F M Carton8, Janette P McQuillan8, Andy M Paterson8, Zohra Hirji8, Sarah K Christie8, Amber R Holmes8, Marjanka K Schmidt9, Montserrat Garcia-Closas7, Douglas F Easton10, Manjeet K Bolla11, Qin Wang11, Javier Benitez12, Roger L Milne13, Arto Mannermaa14, Fergus Couch15, Peter Devilee16, Robert A E M Tollenaar17, Caroline Seynaeve18, Angela Cox19, Simon S Cross20, Fiona M Blows21, Joyce Sanders9, Renate de Groot9, Jonine Figueroa22, Mark Sherman22, Maartje Hooning18, Hermann Brenner23, Bernd Holleczek24, Christa Stegmaier24, Chris Lintott2, Paul D P Pharoah10.
Abstract
BACKGROUND: Citizen science, scientific research conducted by non-specialists, has the potential to facilitate biomedical research using available large-scale data, however validating the results is challenging. The Cell Slider is a citizen science project that intends to share images from tumors with the general public, enabling them to score tumor markers independently through an internet-based interface.Entities:
Keywords: Breast cancer; Citizen science; Crowd science; Crowdsourcing
Mesh:
Substances:
Year: 2015 PMID: 26288840 PMCID: PMC4534635 DOI: 10.1016/j.ebiom.2015.05.009
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Fig. 1Flowchart of study design showing details of score calculation (sub-image, image and tumor), comparison with pathologist evaluation and survival analysis.
Fig. 2Engagement in Cell Slider by Citizen Scientists. (A) Number of classifications by day since project launch; (B) histogram of number of classifications done by each Citizen Scientist.
Fig. 3Histogram of user performance final scores.
Fig. 4Accuracy of Citizen Scientist classifications. (A) Receiver operator characteristic curve for identification of cancer cells by Citizen Scientists in 3038 tumors from the SEARCH study compared to classification by a pathologist. (B) Frequency weighted scatterplot of Citizen Scientist pseudo-Allred score against the pathologist assigned Allred score for 2121 tumors from the SEARCH study. (C) Receiver operator characteristic curve for classification of ER status based on Citizen Scientist pseudo-Allred score against the pathologist classification for 2121 tumors from the SEARCH study; (D) receiver operator characteristic curve for classification of ER status based on Citizen Scientist pseudo-Allred score against the ER status recorded in the Breast Cancer Association Consortium data base from a variety of sources for 10,679 tumors from ten studies.
Distribution of pseudo Allred scores according to ER status.
| ER status | Pseudo Allred score | |||
|---|---|---|---|---|
| < 2 | 2–3 | 4–5 | > 5 | |
| SEARCH TMAs (2121 tumors) | ||||
| ER negative | 481 (22.68%) | 26 (1.23%) | 2 (0.09%) | 1 (0.05%) |
| ER positive | 101 (4.76%) | 228 (10.75%) | 386 (18.20%) | 896 (42.24%) |
| BCAC TMAs (2842 tumors) | ||||
| ER negative | 265 (9.32%) | 177 (6.23%) | 127 (4.47%) | 200 (7.04%) |
| ER positive | 69 (2.43%) | 132 (4.64%) | 284 (9.99% | 1588 (58.88%) |
Performance of Citizen Scientists to identify cancer cells and classify ER staining.
| Identification of cancer cells in SEARCH study | Obs. | ROC area | 95% CI |
|---|---|---|---|
| All original scores | 3082 | 0.951 | 0.943 to 0.960 |
| Scores from CS with 5 or more scores | 3082 | 0.951 | 0.942 to 0.960 |
| All original scores with UPS-weighting | 3082 | 0.951 | 0.942 to 0.959 |
| Classification of ER in SEARCH study | Obs. | ROC area | 95% CI |
| All original scores | 2121 | 0.968 | 0.961 to 0.974 |
| Scores from CS with 5 or more scores | 2121 | 0.967 | 0.960 to 0.974 |
| All original scores with UPS-weighting | 2121 | 0.965 | 0.958 to 0.972 |
| Correlation with the pathologist in SEARCH study | Obs. | Spearman rho | 95% CI |
| All original scores | 2121 | 0.898 | 0.890 to 0.906 |
| Scores from CS with 5 or more scores | 2121 | 0.896 | 0.888 to 0.904 |
| All original scores with UPS-weighting | 2121 | 0.894 | 0.885 to 0.902 |
| Classification of ER in BCAC | Obs. | ROC area | 95% CI |
| All original scores | 2842 | 0.822 | 0.804 to 0.840 |
| Scores from CS with 5 or more scores | 2842 | 0.821 | 0.803 to 0.839 |
| All original scores with UPS-weighting | 2842 | 0.820 | 0.802 to 0.838 |
CS: Citizen Scientists.
UPS: final user performance score.
Fig. 5Kaplan–Meier estimates of cumulative survival of 4947 patients. (A) ER status classified by Citizen Scientists. (B) ER status as recorded in BCAC database.
Estimated hazard ratios (HR) for all-cause mortality in 4947 breast cancer patients from multi-variable Cox proportional hazards model after multiple imputations of missing data for stage and grade.
| Variable | Cox model with ER evaluated by Citizen Scientists | Cox model with ER reported in BCAC data base | ||||
|---|---|---|---|---|---|---|
| HR | 95% CI | p-Value | HR | 95% CI | p-Value | |
| Age | 1.00 | 0.99–1.00 | 0.998 | 1.00 | 0.99–1.00 | 0.875 |
| Stage | ||||||
| I | Ref | Ref | ||||
| II | 2.33 | 1.91–2.84 | < 0.001 | 2.32 | 1.90–2.83 | < 0.001 |
| III | 4.97 | 3.76–6.57 | < 0.001 | 4.97 | 3.76–6.56 | < 0.001 |
| IV | 18.84 | 12.36–28.73 | < 0.001 | 18.39 | 12.12–27.89 | < 0.001 |
| Grade | ||||||
| 1 | Ref | Ref | ||||
| 2 | 1.58 | 1.24–2.01 | < 0.001 | 1.55 | 1.22–1.98 | < 0.001 |
| 3 | 2.45 | 1.91–3.13 | < 0.001 | 2.30 | 1.79–2.97 | < 0.001 |
| ER positive | 0.26 | 0.18–0.37 | < 0.001 | 0.24 | 0.18–0.33 | < 0.001 |
| ER positive TVC | 1.23 | 1.15–1.31 | < 0.001 | 1.23 | 1.17–1.30 | < 0.001 |
Time varying covariate.