| Literature DB >> 31266448 |
Ke Xu1,2,3,4, Roman Kosoy5,6, Khader Shameer5,6,7,8, Sudhir Kumar9,10,11, Li Liu12, Ben Readhead5,6,7,13, Gillian M Belbin5,6,14, Hao-Chih Lee5,6,7, Rong Chen5,6, Joel T Dudley15,16,17.
Abstract
BACKGROUND: Genetic diversity is known to confer survival advantage in many species across the tree of life. Here, we hypothesize that such pattern applies to humans as well and could be a result of higher fitness in individuals with higher genomic heterozygosity.Entities:
Keywords: Balancing selection; Electronic health record; Fitness; Healthy aging; Heterozygote advantage; Human diseases
Mesh:
Year: 2019 PMID: 31266448 PMCID: PMC6604157 DOI: 10.1186/s12863-019-0758-4
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Fig. 1Comparisons of noncoding SNPs between Biobank (orange) and Wellderly (green). a Mean minor allele frequency. b Mean observed heterozygosity. c Mean excess of heterozygosity. d Number of SNPs showing higher ratio of Dd/DD (D: minor allele; d: major allele) in Biobank or Wellderly under different nominal P value cutoffs from Fisher’s Exact Test (FET). e Number of SNPs showing higher ratio of Dd/dd in Biobank or Wellderly under different nominal P value cutoffs from FET. f Number of SNPs showing higher ratio of D/d in Biobank (Orange) or Wellderly (Green) under different nominal P value cutoffs from FET. The error bars represent standard errors. P values shown are raw values but with FDR < 0.05
Fig. 2Stratified comparisons of noncoding SNPs between Biobank (orange) and Wellderly (green) for minor allele frequency (MAF) (a and d), observed heterozygosity (b and e), and excess of heterozygosity (c and f). All the bars and error bars represent sample means and their standard errors. The stratification is based on combined MAF (a, b, c) and associated phenotypes (d, e, f) separately. Numbers at the bottom of bars of panel A are the SNP numbers in different MAF bins. Numbers at the bottom of bars of panel D are the numbers of SNPs associated with selected complex diseases (Diseases), selected complex traits (Traits), and all the complex diseases and traits combined (All). P values shown are raw values but with FDR < 0.05
Fig. 3Individual heterozygosity rate. a Individual heterozygosity rates of Biobank (orange) and Wellderly (green). b Average individual heterozygosity rate between Biobank (orange arrow), Wellderly (green arrow), and 10,000 permutations. c Comparison of individual heterozygosity rate between group of individuals (51–60 years old) with < 10% estimated 10-year survival probability and group of individuals (51–60 years old) with > 90% estimated 10-year survival probability in men and women of the Biobank cohort, separately. We restrict the comparison to the 51–60 year olds because no individual older than 60 years has estimated 10-year survival probability > 90% in the Biobank cohort
Coefficient (and P value) of each predictor in the multiple linear regression model of 10-year survival probability of Biobank individuals
| All individuals | Males only | Females only | |
|---|---|---|---|
| Heterozygosity rate | 11.92 (0.06) | 21.78 (0.01) | 3.50 (0.70) |
| Age | −0.02 (1.06e-16) | −0.02 (8.93e-10) | − 0.02 (3.34e-08) |
| Disease score | −0.03 (1.02e-28) | − 0.03 (9.25e-22) | −0.02 (2.92e-11) |
| Gender | −0.03 (0.35) | NA | NA |
| PC1 | 0.002 (0.28) | 0.01 (0.04) | −0.0005 (0.86) |
| PC2 | −0.001 (0.57) | −0.001 (0.80) | − 0.002 (0.45) |
| PC3 | −0.002 (0.26) | − 0.003 (0.23) | −0.002 (0.57) |
| PC4 | −0.001 (0.58) | 0.003 (0.21) | −0.005 (0.11) |
| PC5 | −0.001 (0.47) | 0.001 (0.77) | −0.004 (0.21) |
P values from paired Mann-Whitney U Test between Biobank and Wellderly for different subsets of nonsynonymous SNPs
| All genes | OMIM recessive genes | OMIM dominant genes | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SNP# | MAF | HETO | Fa | SNP# | MAF | HETO | Fa | SNP# | MAF | HETO | Fa | |
| 0.01 ≤ MAF < 0.05 | 3731 | 0.107 | 0.201 | 0.750 | 219 | 0.053 | 0.056 | 0.648 | 87 |
|
| 0.07 |
| 0.05 ≤ MAF < 0.1 | 1243 | 0.638 | 0.459 | 0.095 | 81 | 0.089 | 0.688 | 0.189 | 30 | 0.155 | 0.430 | 0.626 |
| 0.1 ≤ MAF < 0.25 | 1548 | 0.049 |
| 0.101 | 66 | 0.645 | 0.710 | 0.954 | 35 | 0.276 | 0.30 | 0.518 |
| 0.25 ≤ MAF < 0.5 | 1175 | 0.126 | 0.133 | 0.359 | 61 | 0.401 |
| 0.015b | 20 | 0.896 | 0.856 | 0.588 |
| Total | 7697 | 0.30 | 0.165 | 0.608 | 427 | 0.612 | 0.042b | 0.112 | 172 | 0.053 | 0.035c | 0.133 |
P values in bold pass multiple testing correction (FDR adjusted P < 0.1)
aF: excess of heterozygosity
bDirection: Biobank < Wellderly
cDirection: Biobank > Wellderly
Fig. 4Analyses of nonsynonymous SNPs. a Excess of heterozygosity comparison between Biobank (orange) and Wellderly (green) for SNPs with top 10% excess of heterozygosity in Biobank (left shaded area) and SNPs with top 10% excess of heterozygosity in Wellderly (right shaded area). **** P < 1e-4; ***** P < 1e-10, Mann-Whitney U test. b Evolutionary probability comparison between the top SNPs in Biobank (orange) and the tops SNPs in Wellderly (green). c CADD score comparison between the top SNPs in Biobank (orange) and the top SNPs in Wellderly (green). d Significantly enriched gene sets for genes involving the top SNPs in Biobank and genes involving the top SNPs in Wellderly