| Literature DB >> 22258521 |
Naomi R Wray1, Sang Hong Lee, Kenneth S Kendler.
Abstract
Disorders that share genetic risk factors often are placed in closely related diagnostic categories and treated similarly. Until recently, evidence for shared genetic etiology derived from classical research strategies--coaggregation in family and twin studies. Accumulating sufficient numbers of families was often problematic. However, in the era of genome-wide genotyping, we can now directly estimate the degree of sharing of genetic risk factors between disorders. This strategy is practical even for very rare disorders, where it is infeasible to ascertain informative families. Importantly, the estimates of genetic correlations from genome-wide genotypes are derived using such distant relatives that contamination by shared environmental factors seems unlikely. However, any method that seeks to quantify the shared etiology of disorders assumes they can be distinguished diagnostically from one another without error. Here we investigate the impact of misdiagnosis on estimates of genetic correlation both from traditional family data and from genome-wide genotypes of case-control samples from unrelated individuals. Our analyses show similar results for levels of misdiagnosis in both types of data. In both scenarios, genetic variances and heritabilities tend to be slightly underestimated but genetic correlations are overestimated, sometimes substantially so. For example, two genetically distinct but equally heritable disorders each with prevalence 1%, can generate false-positive estimates of genetic correlations of >0.2 in the presence of 10% reciprocal misdiagnosis. Strategies for minimizing the effects of misdiagnosis in cross-disorder genetic studies are discussed.Entities:
Mesh:
Year: 2012 PMID: 22258521 PMCID: PMC3355255 DOI: 10.1038/ejhg.2011.257
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Impact of misclassification between schizophrenia (disorder A) and bipolar disorder (disorder B) on estimation of genetic parameters from recurrence risks in first-degree relatives
| 0 | 0 | 1.00 | 1.00 | 76 | 76 | 0 | 76 | 76 | 25 | 76 | 76 | 50 |
| 5 | 5 | 1.00 | 1.00 | 72 | 72 | 21 | 73 | 73 | 39 | 74 | 74 | 59 |
| 10 | 10 | 1.00 | 1.00 | 68 | 68 | 37 | 69 | 69 | 51 | 71 | 71 | 67 |
| 15 | 15 | 1.00 | 1.00 | 65 | 65 | 50 | 66 | 66 | 61 | 69 | 69 | 74 |
| 20 | 20 | 1.00 | 1.00 | 62 | 62 | 62 | 64 | 64 | 70 | 67 | 67 | 80 |
| 30 | 30 | 1.00 | 1.00 | 56 | 56 | 82 | 59 | 59 | 86 | 63 | 63 | 91 |
| 40 | 40 | 1.00 | 1.00 | 52 | 52 | 95 | 56 | 56 | 96 | 61 | 61 | 98 |
| 50 | 50 | 1.00 | 1.00 | 51 | 51 | 100 | 55 | 55 | 100 | 60 | 60 | 100 |
| 0 | 5 | 1.05 | 0.95 | 74 | 75 | 11 | 74 | 75 | 32 | 75 | 75 | 54 |
| 0 | 10 | 1.10 | 0.90 | 71 | 74 | 20 | 72 | 74 | 38 | 74 | 74 | 58 |
| 0 | 15 | 1.15 | 0.85 | 69 | 73 | 28 | 71 | 73 | 44 | 73 | 73 | 62 |
| 0 | 20 | 1.20 | 0.80 | 67 | 71 | 34 | 69 | 71 | 48 | 72 | 71 | 65 |
| 0 | 30 | 1.30 | 0.70 | 65 | 69 | 45 | 67 | 69 | 57 | 71 | 69 | 70 |
Parameters follow those used in Table 3 of Kendler.[5] All values are expressed as percentages. The true disease prevalences are assumed to be 1% for both schizophrenia and bipolar disorder, K=K=1%. True recurrence risks to first-degree relatives are λ=λ=8.0. These parameters equate to true heritabilities on the liability scale of hTA2=hTB2=0.76. MTA is the proportion of true schizophrenia cases misclassified as bipolar disorder and MTB is the proportion of true bipolar disorder cases misclassified as schizophrenia. The true genetic correlation between the disorders is r=0, 0.25,0.5. The estimated parameters based on diagnosed prevalences and recurrences risks have subscript D.
Impact of misclassification between schizophrenia (disorder A) and brief psychotic disorder (disorder B) on estimation of genetic parameters from recurrence risks in first-degree relatives
| 0 | 0 | 1.00 | 1.00 | 76 | 21 | 0 | 76 | 21 | 25 | 76 | 21 | 50 |
| 5 | 5 | 1.00 | 1.00 | 72 | 20 | 25 | 72 | 21 | 44 | 73 | 21 | 63 |
| 10 | 10 | 1.00 | 1.00 | 68 | 19 | 45 | 69 | 20 | 59 | 69 | 22 | 73 |
| 15 | 15 | 1.00 | 1.00 | 64 | 19 | 62 | 65 | 21 | 72 | 66 | 23 | 82 |
| 20 | 20 | 1.00 | 1.00 | 60 | 20 | 75 | 61 | 22 | 81 | 62 | 25 | 88 |
| 30 | 30 | 1.00 | 1.00 | 51 | 23 | 91 | 53 | 26 | 93 | 55 | 29 | 96 |
| 0 | 5 | 1.05 | 0.95 | 73 | 21 | 3 | 74 | 21 | 27 | 74 | 21 | 51 |
| 0 | 10 | 1.10 | 0.90 | 71 | 21 | 6 | 71 | 21 | 29 | 72 | 21 | 52 |
| 0 | 15 | 1.15 | 0.85 | 68 | 20 | 9 | 69 | 20 | 31 | 70 | 20 | 54 |
| 0 | 20 | 1.20 | 0.80 | 66 | 20 | 12 | 67 | 20 | 33 | 68 | 20 | 55 |
| 0 | 30 | 1.30 | 0.70 | 62 | 19 | 17 | 63 | 19 | 37 | 65 | 19 | 57 |
| 5 | 0 | 0.95 | 1.05 | 75 | 20 | 22 | 75 | 21 | 41 | 75 | 22 | 61 |
| 10 | 0 | 0.90 | 1.10 | 74 | 20 | 38 | 74 | 21 | 54 | 74 | 22 | 70 |
| 15 | 0 | 0.85 | 1.15 | 73 | 20 | 52 | 73 | 22 | 64 | 73 | 24 | 77 |
| 20 | 0 | 0.80 | 1.20 | 71 | 20 | 63 | 71 | 22 | 72 | 71 | 25 | 82 |
| 30 | 0 | 0.70 | 1.30 | 69 | 22 | 77 | 69 | 25 | 83 | 69 | 28 | 89 |
Parameters for the disorders follow those used in Table 5 of Kendler.[5] All values are expressed as percentages. The true disease prevalences are assumed to be 1% for both schizophrenia and brief psychotic disorder, KTA=KTB=1%. True recurrence risks to first-degree relatives are λTA,=8.0, λTB=2.0. These parameters equate to true heritabilities on the liability scale of h2TA=0.76 and h2TB=0.21. MTA is the proportion of true schizophrenia cases misclassified as brief psychotic disorder and MTB is the proportion of true brief psychotic disorder cases misclassified as schizophrenia. The true genetic correlation between the disorders is r=0, 0.25,0.5. The estimated parameters based on diagnosed prevalences and recurrences risks have subscript D.
Impact of misclassification between schizophrenia (disorder A)and delusional disorder (disorder B) on estimation of genetic parameters from recurrence risks in first-degree relatives
| 0 | 0 | 1.00 | 0.10 | 76 | 13 | 0 | 76 | 13 | 25 | 76 | 13 | 50 |
| 1 | 1 | 0.99 | 0.11 | 76 | 12 | 39 | 76 | 13 | 54 | 76 | 14 | 70 |
| 2 | 2 | 0.98 | 0.12 | 76 | 12 | 63 | 76 | 13 | 72 | 76 | 15 | 82 |
| 3 | 3 | 0.97 | 0.13 | 75 | 13 | 77 | 75 | 15 | 83 | 75 | 16 | 89 |
| 5 | 5 | 0.96 | 0.15 | 75 | 16 | 91 | 75 | 18 | 93 | 75 | 20 | 95 |
| 1 | 10 | 1.00 | 0.10 | 75 | 12 | 42 | 76 | 12 | 57 | 76 | 13 | 72 |
| 2 | 20 | 1.00 | 0.10 | 75 | 12 | 71 | 75 | 13 | 78 | 75 | 15 | 85 |
| 3 | 30 | 1.00 | 0.10 | 74 | 14 | 87 | 74 | 16 | 90 | 74 | 18 | 93 |
| 0 | 10 | 1.01 | 0.09 | 76 | 13 | 1 | 76 | 13 | 25 | 76 | 13 | 50 |
| 0 | 20 | 1.02 | 0.08 | 75 | 12 | 1 | 75 | 12 | 26 | 75 | 12 | 51 |
| 0 | 50 | 1.05 | 0.05 | 73 | 12 | 3 | 74 | 12 | 27 | 74 | 12 | 51 |
| 1 | 0 | 0.99 | 0.11 | 76 | 12 | 39 | 76 | 13 | 54 | 76 | 14 | 70 |
| 2 | 0 | 0.98 | 0.12 | 76 | 12 | 62 | 76 | 13 | 72 | 76 | 15 | 81 |
| 5 | 0 | 0.95 | 0.15 | 75 | 16 | 90 | 75 | 18 | 92 | 75 | 20 | 94 |
Parameters follow those used in Table 6 of Kendler.[5] All values are expressed as percentages. The true disease prevalences are assumed to be 1% for schizophrenia and 0.1% delusional disorder, KTA=1% and KTB0.1%. True recurrence risks to first-degree relatives are λTA,= 8.0, λTB=2.0. These parameters equate to true heritabilities on the liability scale of h2TA=0.76 and h2TB=0.13. MTA is the proportion of true schizophrenia cases misclassified as delusional disorder and MTB is the proportion of true delusional disorder cases misclassified as schizophrenia. The true genetic correlation between the disorders is r0, 0.25,0.5. The estimated parameters based on diagnosed prevalences and recurrences risks have subscript D.
The impact of misdiagnosis in estimating genetic parameter from genome-wide genotypes
| 0 | 0 | T | Estimated | 0.096 | 0.112 | 0.002 | 0.391 (0.089) | 0.470 (0.093) | 0.023 (0.155) |
| 0.1 | 0 | D | Estimated | 0.096 | 0.092 | 0.013 | 0.387 (0.029) | 0.393 (0.023) | 0.139 (0.055) |
| D | Calculated | 0.096 | 0.095 | 0.010 | 0.385 | 0.396 | 0.109 | ||
| 0.1 | 0.1 | D | Estimated | 0.075 | 0.093 | 0.024 | 0.304 (0.034) | 0.388 (0.035) | 0.296 (0.092) |
| D | Calculated | 0.079 | 0.093 | 0.021 | 0.316 | 0.383 | 0.244 | ||
Calculated using equations in text based on the estimates from the true disorders and misclassification rates.
MTA proportion of disorder A cases labelled as disorder B cases; M proportion of disorder B cases labelled as disorder A cases. Values in parentheses are the standard errors for the parameters estimated when MTA=MTB=0, but otherwise are the standard deviations over 100 replicates.
Figure 1Illustrations of the impact of misdiagnosis rate of true disorder A cases as disorder B (M) on parameters estimated by genome-wide SNPs: Proportion of variance in case–control status explained by SNPs for disorder A (solid line), disorder B (dashed line) and the genetic correlation between disorders A and B explained by SNPs (dotted line). (a) Proportion of variance that can be explained by SNPs for true disorders A and B=0.6, true genetic correlation 0, no misdiagnosis of true disorder B cases as disorder A, M0. (b) As (a) but proportion of variance that can be explained by SNPs for true disorders A and B=0.2. (c) As (b) but MM. (d) As (c) but true genetic correlation between disorders is 0.5. Note: the dashed line does not show when the values are the same as for the solid line.