| Literature DB >> 31481654 |
Loic Yengo1, Naomi R Wray2,3, Peter M Visscher4,5.
Abstract
In most human societies, there are taboos and laws banning mating between first- and second-degree relatives, but actual prevalence and effects on health and fitness are poorly quantified. Here, we leverage a large observational study of ~450,000 participants of European ancestry from the UK Biobank (UKB) to quantify extreme inbreeding (EI) and its consequences. We use genotyped SNPs to detect large runs of homozygosity (ROH) and call EI when >10% of an individual's genome comprise ROHs. We estimate a prevalence of EI of ~0.03%, i.e., ~1/3652. EI cases have phenotypic means between 0.3 and 0.7 standard deviation below the population mean for 7 traits, including stature and cognitive ability, consistent with inbreeding depression estimated from individuals with low levels of inbreeding. Our study provides DNA-based quantification of the prevalence of EI in a European ancestry sample from the UK and measures its effects on health and fitness traits.Entities:
Mesh:
Year: 2019 PMID: 31481654 PMCID: PMC6722066 DOI: 10.1038/s41467-019-11724-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Predictive performances of FROH to discriminate different types of inbreeding: mating type 1 (MT1: parent-offspring or fullsibs mating), mating type 2 (MT2: halfsibs, avuncular, grandparent-grandchild or double-first cousins mating) or mating type 3 (MT3: first-cousins mating). Panels a and c correspond to the comparison of MT1 and MT2, while panels b and d correspond to the comparison of MT1 and MT2 on the one hand and MT3 on the other hand. Predictive statistics assessed are the area under the receiver characteristics operating curve (AUC), the sensitivity to detect MT1 over MT2 (true positive rate) and specificity to distinguish MT1 from MT2 (true negative rate). FROH>0.17 yields a sensitivity and specificity >0.92 to discriminate MT1 from MT2; and FROH>0.087 yields a sensitivity of ~0.94 and a specificity of ~0.79 to discriminate MT1 or MT2 from MT3
Mean number and length of runs of homozygosity (ROHs) detected in participants from the UK Biobank (UKB), including extreme inbreeding (EI) cases (defined as FROH > 0.1) and unrelated EI controls (defined as FROH < 0.01). We also report the mean and length of ROHs in simulated data under various mating types
| Sample size | Mean number of ROHs per individual (SD) | Mean length of ROHs in Mb (SD) | Mean | |
|---|---|---|---|---|
| Observed ROHs in UKB participantsa | ||||
| EI cases ( | 125 | 33.6 (10.3) | 14.8 (15.6) | 0.172 (.07) |
| Unrelated EI controls ( | 345,276 | 4.9 (2.2) | 2.1 (0.8) | 0.003 (.002) |
| ROHs from simulated data under various mating types | ||||
| Parent–offspring mating (PO) | 19,062 | 38.1 (5.9) | 17.6 (18.5) | 0.253 (.054) |
| Fullsibs mating (FS) | 19,065 | 45.2 (5.9) | 14.9 (15.4) | 0.254 (.046) |
| Halfsibs mating (HS) | 19,048 | 25.0 (5.5) | 13.6 (15.1) | 0.128 (.040) |
| Uncle/aunt–niece/nephew mating (AV) | 19,108 | 28.3 (5.6) | 12.0 (13.0) | 0.127 (.036) |
| Grandparent–grandchild mating (GP) | 19,025 | 24.9 (5.4) | 13.6 (15.1) | 0.128 (.039) |
| Double first cousins mating (DC) | 19,080 | 31.6 (5.7) | 10.8 (11.3) | 0.128 (.032) |
| First cousins mating (FC) | 19,061 | 18.1 (4.7) | 9.6 (10.9) | 0.065 (.025) |
| Mating between unrelated parents (UN) | 18,912 | 4.8 (2.1) | 2.0 (0.7) | 0.004 (.002) |
| Mating type 1 (MT1: PO or FS) | 38,127 | 41.7 (6.9) | 16.1 (16.9) | 0.254 (.050) |
| Mating type 2 (MT2: HS or AV or GP or DC) | 76,261 | 27.4 (6.1) | 12.4 (13.6) | 0.128 (.037) |
| Mixture of MT1 and MT2 mating (Mixture proportion: 54/125) | 139,310 | 33.6 (9.6) | 14.0 (15.2) | 0.182 (.075) |
SD standard deviation
aMean differences between EI cases and controls are statistically significant at p < 10−10
Fig. 2Histogram of the lengths of 4,196 runs of homozygosity (ROHs) detected in 125 EI cases (FROH > 0.1). Each length was subtracted 1.5 Mb (i.e., minimum length used to detect ROHs) before mixture distribution was fitted. A 84:16 mixture of two exponential distributions with means ~15.7 Mb (rate = 1/15.7 ~0.06) and ~0.72 Mb (rate = 1/0.72 ~1.4), respectively was found to best fit the observed length distribution (dotted line)
Fig. 3Chromosomal and positional distribution of runs of homozygosity (ROHs) detected in 125 EI cases (FROH > 0.1). Each row, with possibly multiple segments, represents a unique participant. Segments are groups by autosomal chromosomes from chromosome 1 (bottom of each panel) to chromosome 22 (top of each panel) ROHs are grouped in 6 length categories: between 1.5 and 5 Mb (a), between 5 and 10 Mb (b), between 10 and 20 Mb (c), between 20 and 50 Mb (d), between 50 and 100 Mb (e), and above 100 Mb (f). f also show inbreeding coefficients of individuals harbouring the largest ROHs
Parameters of mixtures of exponential distributions estimated from observed length distributions of homozygous-by-descent (HBD) genomic segments and runs of homozygosity (ROH)
| Simulated true HBD segments | Mating type | Number of segments (individuals) | Mean (SD) number of segments per individual |
|
|
|
| 1/ | 1/ |
|---|---|---|---|---|---|---|---|---|---|
| PO | 620,017 (19,062) | 32.5 (5.1) | 0.75 | 0.25 | 0.038 | 0.165 | 26.6 | 6.1 | |
| FS | 791,483 (19,065) | 41.6 (5.4) | 0.71 | 0.29 | 0.047 | 0.186 | 21.4 | 5.4 | |
| HS | 395,500 (19,096) | 20.8 (4.7) | 0.71 | 0.29 | 0.047 | 0.182 | 21.5 | 5.5 | |
| AV | 481,752 (19,129) | 25.2 (5.1) | 0.69 | 0.31 | 0.056 | 0.208 | 17.9 | 4.8 | |
| GP | 393,584 (19,081) | 20.8 (4.8) | 0.71 | 0.29 | 0.047 | 0.186 | 21.3 | 5.4 | |
| DC | 566,552 (19,082) | 29.8 (5.3) | 0.65 | 0.35 | 0.064 | 0.225 | 15.6 | 4.4 | |
| FC | 282,635 (19,430) | 14.9 (4.3) | 0.65 | 0.35 | 0.064 | 0.224 | 15.5 | 4.5 | |
| Simulated ROHs | PO | 726,247 (19,062) | 38.1 (5.9) | 0.88 | 0.12 | 0.055 | 1.309 | 18.3 | 0.8 |
| FS | 862440 (19,065) | 45.2 (5.9) | 0.87 | 0.13 | 0.066 | 1.015 | 15.2 | 1.0 | |
| HS | 476,464 (19,048) | 25.0 (5.5) | 0.81 | 0.19 | 0.068 | 1.797 | 14.7 | 0.6 | |
| AV | 540,866 (19,108) | 28.3 (5.6) | 0.82 | 0.18 | 0.079 | 1.656 | 12.6 | 0.6 | |
| GP | 474,085 (19,025) | 24.9 (5.4) | 0.81 | 0.19 | 0.068 | 1.802 | 14.7 | 0.6 | |
| DC | 602,297 (19,080) | 31.6 (5.7) | 0.83 | 0.17 | 0.091 | 1.530 | 11.0 | 0.7 | |
| FC | 345,251 (19,061) | 18.1 (4.7) | 0.73 | 0.27 | 0.093 | 1.946 | 10.8 | 0.5 | |
| UN | 90,794 (18,912) | 4.8 (2.1) | 0.72 | 0.28 | 2.895 | 0.984 | 0.30 | 1.0 | |
| ROHs detected in EI cases | Mixture | 4196 (125) | 38.9 (12.9) | 0.84 | 0.16 | 0.064 | 1.385 | 15.7 | 0.7 |
Estimated parameters, include mixture proportions (π1 and π2 = 1 − π1) and rates (λ1 and λ2 in Mb−1) of the two exponential distributions composing the mixture. Mixture of exponential distributions were fitted to the length distribution of simulated true HBD segments, of simulated ROHs and of ROH detected in 125 cases of extreme inbreeding (EI). Simulations were performed under various mating types: parent–offspring mating (PO), full-sib mating (FS), half-sib mating (HS), avuncular mating (AV), grandparent–grandchild mating (GP), double first cousins (DC), first-cousins mating (FC) and mating between unrelated individuals (UN). SD standard deviation
Association between extreme inbreeding (EI) and multiple traits measured in UK Biobank participants (125 EI cases vs. 345,276 EI controls)
| Traits (unit: trait SD) | Mean in EI cases | Mean in controls | Effect size (unit: trait SD) | Extrapolated effect size (unit: trait SD for 100% inbreeding) | Standard error (s.e.) | |
|---|---|---|---|---|---|---|
| PEF | −0.651 | 0.005 | −0.656 | −3.88 | 0.099 | 2.8 × 10−11 |
| Height | −0.404 | 0.012 | −0.417 | −2.46 | 0.090 | 3.2 × 10−6 |
| HGS | −0.395 | 0.004 | −0.441 | −2.35 | 0.091 | 1.2 × 10−5 |
| FIS | −0.570 | 0.010 | −0.581 | −3.43 | 0.152 | 1.4 × 10−4 |
| MTCIM | −0.334 | 0.003 | −0.337 | −1.99 | 0.091 | 2.0 × 10−4 |
| AA | −0.557 | 0.002 | −0.559 | −3.31 | 0.164 | 6.7 × 10−4 |
| EA | −0.260 | 0.023 | −0.283 | −1.67 | 0.089 | 1.5 × 10−3 |
| VA | 0.370 | 0.003 | −0.373 | −2.21 | 0.179 | 0.037 |
| NCh | −0.230 | −0.009 | −0.221 | −1.31 | 0.089 | 0.013 |
| HWR | −0.640 | 0.005 | −0.170 | −1.01 | 0.090 | 0.058 |
| Polygenic predictor of EA* | −0.259 | −0.262 | 4.9 × 10−4 | N/A | 0.018 | 0.978 |
| RR [log(RR)] | s.e. of log(RR) | P-value | ||||
| NCh | 1.54 | 1.78 | 0.23 [−1.46] | N/A | 0.302 | 1.3 × 10−6 |
| NDIS | 12.2 | 6.90 | 1.44 [0.36] | N/A | 0.089 | 3.6 × 10−5 |
| NDIS* | 13.9 | 8.90 | 1.34 [0.29] | N/A | 0.083 | 4.4 × 10−4 |
| NDIS parents | 2.16 | 2.24 | 0.96 [−0.04] | N/A | 0.065 | 0.507 |
Mean FROH is ~0.172 (SD: 0.067) in EI cases and ~0.003 (SD: 0.002) in EI controls. Effect sizes were estimated using either linear regression or overdispersed Poisson regression (latter for the following traits NCh, NDIS, NDIS* and NDIS parents) of the trait on the EI binary status. Extrapolated effect sizes in trait SD for 100% inbreeding were obtained by dividing estimated effect size by the difference in mean FROH between EI cases and EI controls, i.e., ~0.17. Traits analysed, include peak expiratory flow (PEF), standing height, handgrip strength (HGS), fluid intelligence score (FIS), mean time to correctly identify matches (MTCIM), auditory acuity (AA), number of years of education or educational attainment (EA), visual acuity (VA), number of children (NCh), hip-to-waist ratio (HWR), number of diseases diagnosed (NDIS) based on the International Classification of Diseases, Tenth Revision (ICD10). NDIS* refers to NDIS in individuals with at least one disease diagnosed. We also compared the number of disease-groups UKB participants reported their parents to be affected with (NDIS parents). RR relative risk, SD standard deviation. Estimates were adjusted for age at recruitment, recruitment centre (treated as a categorical factor), sex, year of birth (treated as a continuous variable), genotyping batch (treated as a factor), socioeconomic status measured by the Townsend deprivation index and population structure measured by ten genetic principal components estimated from HM3 SNPs. Polygenic predictor of EA was calculated using summary statistics from the Lee et al. (2018) study (excluding the UKB) and also adjusted for ten principal components. Inbreeding load for NCh was estimated as B = −log(RR) = 1.46 (95% confidence interval: [0.87–2.05]).
Fig. 4Phenotypic reduction (in trait standard deviation; SD) observed in 125 extreme inbreeding (EI: FROH > 0.1) cases compared to 345,276 unrelated EI controls (FROH < 0.01). Observed means for EI cases and controls are reported in Table 3. Phenotypic reduction was assessed for ten traits: auditory acuity (AA), fluid intelligence score (FIS), peak expiratory volume (PEF), hip-to-waist ratio (HWR), visual acuity (VA), height, cognitive ability measured as the mean time to correctly identify matches (MTCIM), handgrip strength (HGS), number of children (NCh) and educational attainment (EA) measured as the number of years of education. Traits were adjusted for age at recruitment, sex, recruitment centre, year of birth, genotyping batch, socioeconomic status measured by the Townsend deprivation index and population structure measured by 10 genetic principal components estimated from HM3 SNPs. Inbreeding depression was estimated within unrelated EI controls using two inbreeding measures: FUNI and FROH. Resulting estimates were used to linearly predict the reduction in EI cases. Vertical bars around predictions corresponds to 99.5% confidence interval as the significance was defined here at p < 0.05/10