| Literature DB >> 26599540 |
Yoshiro Nagao1,2,3.
Abstract
"Missing heritability" in genome wide association studies, the failure to account for a considerable fraction of heritability by the variants detected, is a current puzzle in human genetics. For solving this puzzle the involvement of genetic variants like rare single nucleotide polymorphisms (SNPs) and copy number variations (CNVs) has been proposed. Many papers have published estimating the heritability of sets of polymorphisms, however, there has been no paper discussing the estimation of a heritability of a single polymorphism. Here I show a simple but rational method to calculate heritability of an individual polymorphism, hp(2). Using this method, I carried out a trial calculation of hp(2) of CNVs and SNPs using published data. It turned out that hp(2) of some CNVs is quite large. Noteworthy examples were that about 25% of the heritability of type 2 diabetes mellitus and about 15% of the heritability of schizophrenia could be accounted for by one CNV and by four CNVs, respectively. The results suggest that a large part of missing heritability could be accounted for by re-evaluating the CNVs which have been already found and by searching novel CNVs with large hp(2).Entities:
Mesh:
Year: 2015 PMID: 26599540 PMCID: PMC4657159 DOI: 10.1038/srep17156
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Probability of each genotype of a sibling and an offspring.
| Genotype | Probability |
|---|---|
| Sibling | |
| AA | {α(1 + p)2 + βp(1 + p) + γp2}/4 |
| Aa | {α(1 + p)q + β(1 + pq) + γp(1 + q)}/2 |
| aa | {αq2 +β(1 + q)q + γ(1 + q)2}/4 |
| AA | αp + βp/2 |
| Aa | αq + β/2 + γp |
| aa | βq/2 + γq |
A and a represent dominant and recessive allele, respectively. α, β and γ represent the genotype frequency of AA, Aa and aa of the proband, respectively.
Figure 1Schematic images of the prevalence of a disease in general population, P, and the incidence rate of the disease for first-degree relatives, Q, for a polymorphism.
(A) P is represented by a circle. The area where the population attributable risk of a polymorphism, P(1–v/q), covers is applied gray. (B) Q is represented by the area where either the circle or the gray oval covers. Q is bigger than P by P(1−v/q)(Y1/X1−1). q: allele frequency of the non-risk allele for the general population. v: allele frequency of the non-risk allele for the patient group. X: frequency of the risk genotype of the general population. Y: frequency of the risk genotype of the first-degree relatives.
Results of a trial to calculate hp 2 of CNVs and SNPs using published data.
| Disease | CNV (locus or description) or SNP (rs number or description) | Population or source | OR | P | p | hp2 |
|---|---|---|---|---|---|---|
| Autism | • | Americans | 100 | 0.006 | 0.00016 | 0.0679 |
| • CNV (16p11.2 dup) | Americans | 16 | 0.006 | 0.00034 | 0.0077 | |
| • SNP (rs4307059) | Americans | 1.19 | 0.0067 | 0.61 | 0.00049 | |
| • SNP (rs10513025) | Several sources | 0.55 | 0.015 | 0.063 | 0.0036 | |
| Depression | • CNV (3q13.33 dup) | Hungarians | 5.27 | 0.085 | 0.013 | 0.0327 |
| • SNP (rs2251219) | (Meta-analysis) | 0.87 | 0.20 | 0.40 | 0.0014 | |
| Schizophrenia | • CNV (16p11.2 dup) | Several sources | ∞ | 0.01 | 0.000039 | 0.0498 |
| • CNV (22q11.2 del) | Several sources | ∞ | 0.01 | 0.000035 | 0.0377 | |
| • CNV (NRXN1 del) | Several sources | ∞ | 0.01 | 0.000016 | 0.0213 | |
| • CNV (AS/PWS dup) | Several sources | ∞ | 0.01 | 0.000012 | 0.0161 | |
| • CNV (15q11.3 del) | Several sources | 8.27 | 0.01 | 0.00021 | 0.0021 | |
| • CNV (1p21.1 del) | Several sources | 11.03 | 0.01 | 0.000175 | 0.0019 | |
| • SNP (ADAMTSL3) | (HapMap) | 0.68 | 0.01 | 0.29 | 0.0046 | |
| • SNP (rs17504622) | Swedish | 1.24 | 0.01 | 0.05 | 0.00035 | |
| Obsessive-compulsive disorder | • CNV (13q14.2 del) | Swiss | 6.23 | 0.023 | 0.010 | 0.0405 |
| • SNP (rs6311, located on HTR2A promoter) | Swiss | 1.69 | 0.023 | 0.44 | 0.0087 | |
| Sporadic ALS | • CNV (10q15.3 dup) | Japanese | 5.49 | 0.0001 | 0.101 | 0.0625 |
| • SNP (rs10260404) | Dutch | 1.30 | 0.0001 | 0.27 | 0.00050 | |
| Type 2 diabetes mellitus | • CNV (4p16.3 del) | Japanese | 14.8 | 0.10 | 0.022 | 0.1594 |
| • SNP (missence variant of HNF1A gene) | Mexicans and US Latinos | 5.48 | 0.14 | 0.0060 | 0.0146 |
Odds ratio (OR), risk allele frequency (p), and prevalence of disease (P) of each polymorphism are cited from the literatures31018192021222324252627282930. P of schizophrenia is cited from a review31.
*de novo CNV.
Various categories of variants and the number of variants to explain heritability of 0.4.
Categories of variants are based on the classification by Pawitan et al.12. The background of the calculated results by the method in this study is applied gray.
An example of the calculation of genotype probabilities by Beye’s method when the genotype of the proband is AA.
| AA | Aa | aa | |
|---|---|---|---|
| Prior Probability | pp | 2pq | |
| Conditioned Probability (of transmitting allele A) | 1 | 0.5 | 0 |
| Joint Probability | pp | pq | 0 |
| Posterior Probability | pp/(pp + pq) = p | pq/(pp + pq) = q |
As a result the posterior probability equals to the frequency of another allele (A or a) of the transmitted one (A) in the general population.
The calculation procedure of the genotype probabilities for a sibling.
| Proband | F | Father | P1 | Mother | P2 | Sibling | P3 | Joint Probability |
|---|---|---|---|---|---|---|---|---|
| AA | α | AA | p | AA | p | AA | 1 | αpp |
| Aa | q | AA | 0.5 | 0.5αpq | ||||
| Aa | 0.5 | 0.5αpq | ||||||
| Aa | q | AA | p | AA | 0.5 | 0.5αpq | ||
| Aa | 0.5 | 0.5αpq | ||||||
| Aa | q | AA | 0.25 | 0.25αqq | ||||
| Aa | 0.5 | 0.5αqq | ||||||
| aa | 0.25 | 0.25αqq | ||||||
| Aa | β | AA | pp | Aa | p | AA | 0.5 | 0.5βppp |
| Aa | 0.5 | 0.5βppp | ||||||
| aa | q | Aa | 1 | βppq | ||||
| Aa* | pq | Aa | p | AA | 0.25 | 0.25βppq | ||
| Aa | 0.5 | 0.5βppq | ||||||
| aa | 0.25 | 0.25βppq | ||||||
| aa | q | Aa | 0.5 | 0.5βpqq | ||||
| aa | 0.5 | 0.5βpqq | ||||||
| Aa† | pq | AA | p | AA | 0.5 | 0.5βppq | ||
| Aa | 0.5 | 0.5βppq | ||||||
| Aa | q | AA | 0.25 | 0.25βpqq | ||||
| Aa | 0.5 | 0.5βpqq | ||||||
| aa | 0.25 | 0.25βpqq | ||||||
| aa | AA | p | Aa | 1 | βpqq | |||
| Aa | q | Aa | 0.5 | 0.5βqqq | ||||
| aa | 0.5 | 0.5βqqq | ||||||
| aa | γ | Aa | p | Aa | p | AA | 0.25 | 0.25γpp |
| Aa | 0.5 | 0.5γpp | ||||||
| aa | 0.25 | 0.25γpp | ||||||
| aa | q | Aa | 0.5 | 0.5γpq | ||||
| aa | 0.5 | 0.5γpq | ||||||
| aa | q | Aa | p | Aa | 0.5 | 0.5γpq | ||
| aa | 0.5 | 0.5γpq | ||||||
| aa | q | aa | 1 | γqq |
*Allele A is derived from the father.
†Allele a is derived from the father.
F; a frequency of genotype of the proband.
P1; a posterior probability of genotype of father.
P2; a posterior probability of genotype of mother.
P3; a conditioned probability of genotype of sibling.
The calculation procedure of the genotype probabilities for an offspring.
| Proband | F | Spouse | P1 | Offspring | P2 | Joint Probability |
|---|---|---|---|---|---|---|
| AA | α | AA | pp | AA | 1 | αpp |
| Aa | 2pq | AA | 0.5 | αpq | ||
| Aa | 0.5 | αpq | ||||
| aa | Aa | 1 | αqq | |||
| Aa | β | AA | pp | AA | 0.5 | 0.5βpp |
| Aa | 0.5 | 0.5βpp | ||||
| Aa | 2pq | AA | 0.25 | 0.5βpq | ||
| Aa | 0.5 | βpq | ||||
| aa | 0.25 | 0.5βpq | ||||
| aa | Aa | 0.5 | 0.5βqq | |||
| aa | 0.5 | 0.5βqq | ||||
| aa | γ | AA | pp | Aa | 1 | γpp |
| Aa | 2pq | Aa | 0.5 | γpq | ||
| aa | 0.5 | γpq | ||||
| aa | aa | 1 | γqq |
F; a frequency of genotype of the proband.
P1; a probability of genotype of spouse.
P2; a conditioned probability of genotype of offspring.