| Literature DB >> 11914137 |
Stuart G Baker1, Barnett S Kramer, Sudhir Srivastava.
Abstract
BACKGROUND: Recently many long-term prospective studies have involved serial collection and storage of blood or tissue specimens. This has spurred nested case-control studies that involve testing some specimens for various markers that might predict cancer. Until now there has been little guidance in statistical design and analysis of these studies.Entities:
Mesh:
Substances:
Year: 2002 PMID: 11914137 PMCID: PMC100327 DOI: 10.1186/1471-2288-2-4
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Hypothetical data for a binary marker from a nested case-control study
| Marker - | Marker + | Total | |
| No cancer | 990 | 10 | 1000 |
| Cancer | 20 | 80 | 100 |
True positive rate (TPR) = 80/100 = .80 = sensitivity False positive rate (FPR) = 10/1000 = .01 = 1 – specificity
Hypothetical Data for an Ordered Marker From a Nested Case-Control Design
| 1 | 2 | 3 | 4 | 5 | ||
| No cancer | 960 | 20 | 10 | 8 | 2 | 1000 |
| Cancer | 0 | 10 | 10 | 20 | 60 | 100 |
For cutpoint 4, the true positive rate is (20+60)/100 = .80 For cutpoint 4, the false positive rate is (8 + 2)/1000 = .01
Figure 1ROC curve derived from hypothetical data in Table 2. Because acceptable cancer screening requires a very small false positive rate (FPR) and a reasonable true positive rate (TPR), only the four points farthest to the left are relevant for biomarker evaluation.
Hypothetical data for two ordered markers, A and B
| Subjects without cancer (N = 1000) | Subjects with cancer (N = 100) | |||||||
| A = 1 | A = 2 | A = 3 | A = 1 | A = 2 | A = 3 | |||
| B = 1 | 640 | 80 | 80 | 800 | 20 | 0 | 0 | 20 |
| B = 2 | 80 | 0 | ||||||
| B = 3 | 80 | 0 | ||||||
| 800 | 20 | |||||||
For A and B combined; aFalse positive rate (FPR) of indicated region (10+10+10+10)/1000=. 04; bTrue positive rate (TPR) of indicated region = (20 +0+0+60)/100=. 80 For A only; cFalse positive rate (FPR) of indicated region =(100+100)/1000=. 20; dTrue positive rate (TPR) of indicated region = (20+60)/100=. 80 For B only; eFalse positive rate (FPR) of indicated region =(100+100)/1000= .20; fTrue positive rate (TPR) of indicated region = (20+60)/100=.80