| Literature DB >> 23272039 |
Abstract
Making only the assumption that twins are representative of the population from which they are drawn, we here develop a simple mathematical model (using widely available epidemiological information) that sheds considerable light on the pathogenesis of complex human diseases. Specifically, for the case of multiple sclerosis (MS), we demonstrate that the vast majority of patients (≥94%), possibly all, require genetic susceptibility in order to get MS. Nevertheless, only a tiny fraction of the population (≤2.2%) is actually susceptible to getting this disease; a finding which is highly consistent in all of the studied populations across both North America and Europe. Men are more likely to be susceptible than women although susceptible women are more than twice as likely to actually develop MS compared to susceptible men (i.e., they have a greater disease penetrance). This is because women are more responsive to the environmental factors involved in MS pathogenesis than men. These differences account for the current gender-ratio (3∶1, favoring women) and also for the increasing incidence of MS in women around the world. By contrast, the most important genetic marker for MS susceptibility (DRB1*1501) influences the likelihood of susceptibility but not the penetrance of the disease. Nevertheless, even for this major susceptibility allele, only a very small fraction of DRB1*1501carriers (<5%) are susceptible to getting MS and for only a minority of MS patients (∼41%) does this allele contribute to their susceptibility. Moreover, each copy of this allele seems to make an independent contribution to susceptibility. Finally, at least three environmental events are necessary for MS pathogenesis and, during the course of their lives, the large majority of the population (≥69%) experiences an environmental exposure, which is sufficient to produce MS in, at least, some susceptible genotypes. Also, susceptible men (compared to susceptible women) have a lower threshold, a greater hazard-rate, or both in response to the environmental factors involved in MS pathogenesis.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23272039 PMCID: PMC3525648 DOI: 10.1371/journal.pone.0047875
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Model definitions*
|
| = | The life-time probability of developing MS in the general population.[equated to the prevalence of the disease] |
|
| = | Sets of persons who either are |
|
| = | Two mutually exclusive subsets of |
|
| = | Mutually exclusive sets of genetically susceptible individuals who depend upon |
|
| = | Penetrance of the least penetrant genotype in the population |
|
| = | Penetrance of the ith genotype in the set |
|
| = | Expected penetrance of the set |
| σzi 2 | = | Penetrance Variance within the set |
|
| = |
|
|
| = | Sets of persons with a monozygotic |
|
| = | Sets of persons who share, with an MS-proband, either the same intra-uterine |
|
| = | Sets of persons who either do |
|
| = | The sets of first |
|
| = | The set of persons who either possess |
|
| = | The set of persons who either carry |
|
| = | The sets of persons who carry one |
|
| = | The set of persons who carry one copy of a non-DRB1 |
|
| = | Sets consisting of either women |
|
| = |
|
|
| = |
|
|
| = |
|
|
| = |
|
|
| = |
|
|
| = |
|
|
| = |
|
|
| = |
|
|
| = |
|
|
| = |
|
| p | = |
|
| g | = |
|
| g1 | = |
|
| g2 | = |
|
| A0 | = |
|
| A | = |
|
| MAF | = | Mean allelic frequency – defined as the frequency of an “allelic state”{e.g., the “ |
| HWE | = | Hardy-Weinberg Equilibrium |
See Appendix S1 (Section A) for additional model definitions.
Epidemiological data used in the model
| Population | Women | Men | |
|
| 0.0015 | 0.00204 | 0.00096 |
|
| 0.68 | ||
|
| 0.92 | ||
|
| 0.74 | ||
|
| > 0.82 | ||
| Raw MZ-twin Concordance = | 0.25 | 0.34 | 0.067 |
| Adjusted MZ-twin Concordance = | 0.134 | 0.183 | 0.036 |
| Raw DZ-twin Concordance = | 0.054 | 0.051 | 0.057 |
| Raw Sibling Concordance = | 0.029 | 0.039 | 0.019 |
| UCSF (#1) – | 0.56 | 0.57 | 0.52 |
| Canadian – | 0.55 | 0.60 | 0.52 |
| Canadian – | 0.24 | ∼ 0.24 | ∼ 0.24 |
| UCSF (#2) – | 0.46 | 0.49 | 0.39 |
| UCSF (#2) – | 0.20 | 0.18 | 0.22 |
HLA+ = carrier of ≥ 1 copy of the DRB1*1501 allele
From Canadian Data [21], based on a prevalence of 150 per 105 population and split into men and women according to [15]. Concordance rates presented as “proband-wise” rates [30].
Data unavailable on the 2 male patients [21]. The worst case is: 9/11 = 0.82
See: Prop. (1.4) of Appendix S1 (Section C)
Canadian HLA data: D Sadovnick (personal communication). Based on ∼ 3,000 cases and ∼ 400 Controls (% women not available). Control rates confirmed in a much larger transplant database.
UCSF Databases: J Oksenberg (personal communication) UCSF #1 (GeneMSA) - 485 cases (68% women) and 431 Controls (66% women) UCSF #2 (IMSGC) - 779 cases (76% women)
HLA data used in the model .
| 2HB+ | 1HB+ | HLA− | |||||
|
| |||||||
| Observed Frequency – Cases (HLA+ and HLA−) | 0.55 | 0.45 | |||||
| Observed Frequency – Controls (HLA+ and HLA−) | 0.24 | 0.76 | |||||
| OR – (2HB+ & 1HB+) vs. (HLA−) | 3.9 | ||||||
|
| |||||||
| Observed Frequency – Cases | 0.10 | 0.46 | 0.44 | ||||
| Predicted HWE frequencies – Cases | 0.11 | 0.45 | 0.44 | ||||
| Predicted Controls – HWE at: P(HLA+) = 0.24 | 0.016 | 0.224 | 0.76 | ||||
| OR – (2HB+) vs. (HLA−) & (1HB+) vs. (HLA−) | 10.4 | 3.6 | |||||
| OR – (2HB+ & 1HB+) vs. (HLA−) | 4.0 | ||||||
|
| |||||||
| Observed Frequency – Cases | 0.07 | 0.39 | 0.54 | ||||
| Predicted HWE frequencies – Cases | 0.07 | 0.39 | 0.54 | ||||
| Observed Frequency – Controls | 0.012 | 0.186 | 0.80 | ||||
| Predicted HWE frequencies – Controls | 0.011 | 0.186 | 0.80 | ||||
| OR – (2HB+) vs. (HLA−) & (1HB+) vs. (HLA−) | 9.3 | 3.1 | |||||
| OR – (2HB+ & 1HB+) vs. (HLA−) | 3.5 |
Numbers listed are genotype frequencies.
2HB+ = carrier of 2 copies of the DRB1*1501 allele (homozygous carrier).
1HB+ = carrier of 1 copies of the DRB1*1501 allele (heterozygous carrier).
HLA− = carrier of 0 DRB1*1501 alleles.
(HLA+) = (2HB+)+(1HB+).
Canadian HLA data: D Sadovnick (personal communication).
Based on ∼3,000 cases and ∼400 Controls (% women not available). Control rates confirmed in a much larger transplant database.
Odds ratio (OR) versus controls. Calculated as odds of genotype in cases divided by odds of the same genotype in controls.
UCSF Databases: J Oksenberg (personal communication).
UCSF #1 (IMSGC) – 779 cases (76% women); No observed controls.
UCSF #2 (GeneMSA) – 485 cases (68% women) and 431 Controls (66% women).
Hardy Weinberg Equilibrium (HWE) values predicted based on the observed P(2HB+) in Cases or Controls.
HLA data by gender used in the model .
| 2HB+ | 1HB+ | HLA− | ||
|
| ||||
| Observed Frequency – Cases | 0.11 | 0.46 | 0.43 | |
| Predicted HWE frequencies – Cases | 0.11 | 0.44 | 0.45 | |
| OR – (2HB+ & 1HB+) vs. (HLA−) | 4.3 | |||
|
| ||||
| Observed Frequency – Cases | 0.06 | 0.45 | 0.48 | |
| Predicted HWE frequencies – Cases | 0.09 | 0.42 | 0.48 | |
| OR – (2HB+ & 1HB+) vs. (HLA−) | 3.4 | |||
|
| ||||
| Observed Frequency – Cases | 0.08 | 0.41 | 0.51 | |
| Predicted HWE frequencies – Cases | 0.08 | 0.41 | 0.50 | |
| Observed Frequency – Controls | 0.01 | 0.17 | 0.82 | |
| Predicted HWE frequencies – Controls | 0.01 | 0.21 | 0.78 | |
| OR – (2HB+) vs. (HLA−) & (1HB+) vs. (HLA−) | 9.7 | 3.9 | ||
| OR – (2HB+ & 1HB+) vs. (HLA−) | 4.4 | |||
|
| ||||
| Observed Frequency – Cases | 0.05 | 0.35 | 0.61 | |
| Predicted HWE frequencies – Cases | 0.05 | 0.34 | 0.61 | |
| Observed Frequency – Controls | 0.01 | 0.22 | 0.78 | |
| Predicted HWE frequencies – Controls | 0.01 | 0.21 | 0.78 | |
| OR – (2HB+) vs. (HLA−) & (1HB+) vs. (HLA−) | 8.6 | 2.0 | ||
| OR – (2HB+ & 1HB+) vs. (HLA−) | 2.2 |
Numbers listed are genotype frequencies.
2HB+ = carrier of 2 copies of the DRB1*1501 allele (homozygous carrier).
1HB+ = carrier of 1 copies of the DRB1*1501 allele (heterozygous carrier).
HLA− = carrier of 0 DRB1*1501 alleles;
(HLA+) = (2HB+)+(1HB+).
UCSF Databases: J Oksenberg (personal communication).
UCSF #1 (IMSGC) – 779 cases (76% women).
UCSF #2 (GeneMSA) – 485 cases (68% women) and 431 Controls (66% women).
Hardy Weinberg Equilibrium (HWE) values predicted based on the observed P(2HB+) in Cases or Controls. Because of the small number of men in these samples, the number of males with 2 copies of HLA DRB1*1501 was tiny. Therefore, in men, HWE was estimated from the observed P(HLA−).
Odds ratio (OR) versus controls. Calculated as odds of genotype in cases divided by odds of the same genotype in controls.
MS concordance rates in monozygotic twins of DRB1*1501 carrier (HLA+) and DRB1*1501 non-carrier (HLA–) probands*.
| Monozygotic Twins of MS Probands | |||
|
|
|
| |
|
| 9 | 11 | 20 |
|
| 31 | 42 | 73 |
|
| 40 | 53 | 93 |
|
| (9/40) = 0.225 | (11/53) = 0.207 | 0.215 |
|
| 0.309 | 0.287 | 0.297 |
|
|
|
|
|
|
|
|
|
|
|
| 0.57 | ||
(HLA+) = carrier of ≥1 copy of the DRB1*1501 allele; Data from: Reference [21].
Pair-wise rates (Z) calculated as: ; see Reference [30]
Proband-wise concordance rates (Z) calculated as: ; adjusted [30] for the overall probability of doubly ascertaining concordant twin-pairs in the study of Willer, et al. [21]
For adjustment: See: Prop. (1.4a) (1.4b) of Appendix S1 (Section C)
Further adjusted to the requirement that: b = 0.134
Adjusted to the condition where:
MS concordance rates in monozygotic twins of female (F) and male (M) probands*.
| Monozygotic Twins of MS Probands | |||
|
|
| Totals | |
|
| 22 | 2 | 24 |
|
| 66 | 43 | 109 |
|
| 88 | 45 | 133 |
|
| (22/88) = 0.25 | (2/45) = 0.044 | 0.18 |
|
| 0.34 | 0.067 | 0.25 |
|
|
|
|
|
|
| 0.92 | ||
(F) = Women ; (M) = Men ; Data from: Reference [21]
Pair-wise rates (Z) calculated as: ; see Ref. [30]
Proband-wise concordance rates (Z) calculated as: ; adjusted [30] for the overall probability of doubly ascertaining concordant twin-pairs in the study of Willer, et al. [21]
For adjustment: See: Prop. (1.4a) & (1.4b) of Appendix S1 (Section C)
Adjusted to the condition where:
Summary of conclusions regarding MS pathogenesis derived from the model*
|
| |
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
| |
|
| |
|
|
|
|
|
|
|
|
|
|
| |
|
| |
|
| |
|
| |
|
|
|
|
|
|
|
| |
|
| |
See Table 1 for term definitions; In Table “Section” refers to Sections of Appendix S1
Estimated prevalence (probability) of genetic susceptibility in different geographic regions.
| MSPrevalence | MZ-TwinConcordance | % Susceptible | |
|
|
|
| |
|
| |||
| Canada [21] | 68 – 248 | 25.3% | 0.4 – 3.6% |
| Northern US [12] | 100 – 160 | 31.4% | 0.5 – 1.9% |
| Southern US [12] | 22 – 112 | 17.4% | 0.2 – 2.4% |
|
| |||
| Finland [31] | 52 – 93 | 46.2% | 0.2 – 0.7% |
| Denmark [32] | 110 | 24% | 0.7 – 1.7% |
| British Isles [13] | 74 – 193 | 40.0% | 0.3 – 1.8% |
| France [11] | 32 – 65 | 11.1% | 0.5 – 2.2% |
| Sardinia [16] | 144 – 152 | 22.2% | 1.1 – 2.5% |
| Italy [16] | 38 – 90 | 14.5% | 0.4 – 2.3% |
Per 105 population. The prevalence of MS in each region is from data provided in Reference [33]. A range is given because, often, a range of estimates are available for a particular region.
Studies [11]–[13] reported pair-wise MZ-twin concordance-rates, which have been converted into proband-wise rates assuming a random sampling of twin-pairs [30]. Study [12], however, reported no double ascertainment of twin-pairs and, therefore, almost certainly violates this assumption [30].
P(G) calculated according to Eq. (6); Prop. (4.2a); Appendix S1 (Section C); that:
This equation assumes that (g) for each geographic region is: ; Appendix S1; Section C; Prop. (5.2b)
Moreover, as noted in Prop. (4.2b), Eq. (6) also assumes that:
A narrower range-estimate could be provided by Eq. (13); Prop. (4.2b) of Appendix S1 (Section C) . However, regardless of which range-estimate for (z max) is used, this only impacts the lower-bound estimates for P(G). The upper-bound estimates remain the same.
Figure 1Response-curves for developing MS in susceptible men (M) and women (F) to an increasing likelihood of a “sufficient” environmental exposure (E).
Proportionate hazard is assumed for the two genders (see Appendix S1; Section F). The probability of developing MS – P(MS, E|G) – is shown on y-axis and the transformed environmental exposure (x) is shown on the x-axis {NB: (x) increases with (E) but not necessarily linearly – see Appendix S1; Section F}. The maximum y-axis excursions have been set to the high-point of the predicted ranges for P(MS|G, E, M) & P(MS|G, E, F) given by Eqs. () & ( ) – Appendix S1; Section E; Prop. (7.1c). The proportionality constants, (C) and (r), are taken to be 0.5 and 1, respectively. One “environmental unit” has been defined arbitrarily as the change in the level of a sufficient environmental exposure (E), which has taken place between the time-periods of (1941–1945) and (1976–1980). Based on the increase in the gender-ratio of MS patients over this interval, together with the proband-wise MZ-twin concordance-rates for MS in men and women from Canada [15], [21], two conclusions follow directly. First, there has been more than a 32% increase in the prevalence of MS in Canada between these two time-periods and second, compared to women, men begin to develop MS at a lower level of environmental exposure (x) or they have a greater hazard-rate (see Appendix S1; Section F). In either case, women are more responsive to the environmental changes that are taking place than men (regardless of what these changes actually are). Presumably, this explains the observation that the prevalence of MS is increasing, especially among women [4]. Each of these conclusions is apparent in the Figure. The response curve for men starts at a lower value of (x) than women but their response curve is almost at its plateau in (1941–1945). By contrast, women are nowhere near their (much higher) plateau in (1941–1945) and, compared to men, have a much steeper rise of P(MS|G, E) in response to the environmental changes, which have taken place during the interval. {NB: the x-axis is a time-axis. The x-axis represents increasing levels of environmental exposure (x) from whatever cause and over whatever period of time it has taken place.}
Estimated prevalence (probability) of genetic susceptibility in rheumatoid arthritis, ankylosing spondylitis, and systemic lupus erythematosus
| Prevalence | MZ-TwinConcordance | % Susceptible | |
|
|
|
| |
| Rheumatoid Arthritis | 1 – 2% | ∼ 35% | 5.7 – 11.4% |
| Ankylosing Spondylitis | 0.4 – 4% | ∼ 53% | 1.4 – 15% |
| Systemic Lupus Erythematosus | ∼ 0.025% | ∼ 39% | ∼ 0.13% |
The prevalence of diseases {P(D)} is from data provided in Reference [35].
Studies [36]-[38] report pair-wise MZ-twin concordance-rates. These have been converted into proband-wise rates {P(D|MZ} assuming a random sampling of twin-pairs [30]. Also, the IU environment has been assumed to have no impact on the disease. A violation of either of these assumptions will make the estimate of P(G) too low.