| Literature DB >> 26908616 |
Samuel Lessard1, Alisa K Manning2, Cécile Low-Kam1, Paul L Auer3, Ayush Giri4, Mariaelisa Graff5, Claudia Schurmann6, Hanieh Yaghootkar7, Jian'an Luan8, Tonu Esko9, Tugce Karaderi10, Erwin P Bottinger11, Yingchang Lu11, Chris Carlson12, Mark Caulfield13, Marie-Pierre Dubé1, Rebecca D Jackson14, Charles Kooperberg12, Barbara McKnight15, Ian Mongrain16, Ulrike Peters12, Alex P Reiner12, David Rhainds16, Nona Sotoodehnia17, Joel N Hirschhorn18, Robert A Scott8, Patricia B Munroe13, Timothy M Frayling7, Ruth J F Loos6, Kari E North5, Todd L Edwards19, Jean-Claude Tardif1, Cecilia M Lindgren20, Guillaume Lettre21.
Abstract
Although the role of complete gene inactivation by two loss-of-function mutations inherited in trans is well-established in recessive Mendelian diseases, we have not yet explored how such gene knockouts (KOs) could influence complex human phenotypes. Here, we developed a statistical framework to test the association between gene KOs and quantitative human traits. Our method is flexible, publicly available, and compatible with common genotype format files (e.g. PLINK and vcf). We characterized gene KOs in 4498 participants from the NHLBI Exome Sequence Project (ESP) sequenced at high coverage (>100×), 1976 French Canadians from the Montreal Heart Institute Biobank sequenced at low coverage (5.7×), and >100 000 participants from the Genetic Investigation of ANthropometric Traits (GIANT) Consortium genotyped on an exome array. We tested associations between gene KOs and three anthropometric traits: body mass index (BMI), height and BMI-adjusted waist-to-hip ratio (WHR). Despite our large sample size and multiple datasets available, we could not detect robust associations between specific gene KOs and quantitative anthropometric traits. Our results highlight several limitations and challenges for future gene KO studies in humans, in particular when there is no prior knowledge on the phenotypes that might be affected by the tested gene KOs. They also suggest that gene KOs identified with current DNA sequencing methodologies probably do not strongly influence normal variation in BMI, height, and WHR in the general human population.Entities:
Mesh:
Year: 2016 PMID: 26908616 PMCID: PMC5062577 DOI: 10.1093/hmg/ddw055
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Number and frequency of predicted gene knockouts (KO) in 1727 African Americans and 2772 European Americans from the NHLBI Exome Sequence Project (ESP)
| Variants/individuals | Variants/gene | Not phased | Phased | |||||
|---|---|---|---|---|---|---|---|---|
| Gene KOs/individuals | Number of KO genes | Gene KOs/individuals | Number of KO genes | |||||
| African Americans | All LoF ( | 237 | 0.92 | 33.7 | 2530 | 25.9 | 2429 | |
| Homozygotes | 23.2 | 2384 | 23.2 | 2384 | ||||
| Compound heterozygotes | 10.4 | 601 | 2.6 | 334 | ||||
| Rare LoF ( | 65 | 0.89 | 4.2 | 2174 | 2.5 | 2071 | ||
| Homozygotes | 2.3 | 2028 | 2.3 | 2028 | ||||
| Compound heterozygotes | 1.9 | 381 | 0.2 | 155 | ||||
| European Americans | All LoF ( | 197 | 1.12 | 28.8 | 1844 | 23.2 | 1741 | |
| Homozygotes | 21.3 | 1694 | 21.3 | 1694 | ||||
| Compound heterozygotes | 7.6 | 487 | 1.9 | 247 | ||||
| Rare LoF ( | 39 | 1.09 | 1.8 | 1538 | 1.1 | 1433 | ||
| Homozygotes | 1 | 1390 | 1.01 | 1390 | ||||
| Compound heterozygotes | 0.8 | 318 | 0.09 | 124 | ||||
For this loss-of-function (LoF) variant analysis, we consider autosomal nonsense, stop-loss and splice site variants, as well as frameshift insertion-deletions (indels). Rare LoF variants have a minor allele frequency <5%. In the absence of phasing information, we assume that rare LoF are inherited in trans. As expected, considering phased genotype information significantly impacts the number of gene KOs that we can detect due to compound heterozygosity.
Figure 1.Distributions of the number of NHLBI Exome Sequence Project (ESP) participants with predicted gene knockouts (KOs). We present distributions in African Americans (A and B) and European Americans (C and D). We include all loss-of-function (LoF: nonsense, stop-loss, splice site, frameshift indel) variants in (A) and (C), whereas only rare/low-frequency LoF variants (minor allele frequency <5%) are included in (B) and (D). Homo., gene KO due to homozygosity; Comp. het., gene KO due to compound heterozygosity; Both, genes with homozygous and compound heterozygous LoF variants.
Figure 2.Schematic representation of the method to detect association between gene knockouts (KOs) and human quantitative variation. This example depicts a fictive gene with three exons (GENE1) that contains several SNPs. Our analytical framework only considers loss-of-function (LoF) variants (shown in red). GENE1 KOs are individuals who are either compound heterozygous of homozygous for LoF variants (individual 1 and 2). The histogram shows the distribution of a normalized human quantitative trait. Our method tests whether individuals that are KOs for a given gene (red arrows) have on average more extreme phenotypes than the rest of the individuals.
Figure 3.Quantile-quantile (QQ) plots of association results between predicted gene knockouts (KOs) and anthropometric traits in the (A–C) NHLBI Exome Sequence Project (ESP) and (D–F) GIANT ExomeChip datasets. In these datasets, we only considered loss of function (LoF) variants (nonsense, stop-loss, splice site, frameshift indels (ESP only)) with a minor allele frequency (MAF) <5%. We analyzed three anthropometric traits: (A) body mass index (BMI) (Nparticipants = 4475), (B) height (Nparticipants = 4423) and (C) waist-to-hip ratio (WHR) (Nparticipants = 2973). We performed these analyses stratified by ethnicity, and then combined the European American and African American results using meta-analysis methodology. We analyzed the same traits in the GIANT dataset: (D) BMI (Nparticipants = 103 838), (E) height (Nparticipants = 102 775) and (F) WHR (Nparticipants = 62 355). Results are not corrected for the genomic inflation factor. The dash lines correspond to the 95% confidence interval. λGC, genomic inflation factor; Ngene, number of genes with at least one participant that carries two LoF alleles.
Association of gene knockouts (KOs) with anthropometric traits in the Exome Sequence Project (ESP) and Montreal Heart Institute (MHI) Biobank DNA sequencing datasets
| Trait | Gene | ESP | MHI | Combined | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Mean EA (real units) | Mean AA (real units) | Mean (real units) | Weighted average (real units) | ||||||||
| BMI | 0.7 (+3.2 kg/m2) | 11 | 0.5 (+2.3 kg/m2) | 6 | 0.009 | 1.6 (+7.2 kg/m2) | 3 | 0.009 | 0.8 (+3.6 kg/m2) | 0.0002 | |
| 2.7 (+12.2 kg/m2) | 1 | 3.1 (+14.0 kg/m2) | 1 | 5 × 10−5 | −0.2 (−0.9 kg/m2) | 2 | 0.67 | 1.4 (+6.3 kg/m2) | 0.002 | ||
| Height | NA | 0 | −1.1 (−7.0 cm) | 4 | 0.03 | −1.6 (−10.2 cm) | 2 | 0.02 | −1.3 (−8.3 cm) | 0.002 | |
| 3.6 (23.0 cm) | 1 | 1.5 (9.6 cm) | 2 | 5 × 10−5 | −0.4 (−2.6 cm) | 2 | 0.56 | 1.2 (+7.7 cm) | 0.002 | ||
| −1.6 (−10.2 cm) | 2 | NA | 0 | 0.02 | −1.9 (−12.2 cm) | 1 | 0.06 | −1.7 (−10.9 cm) | 0.003 | ||
| WHR | 0.6 (+0.04) | 1 | 1.8 (+0.13) | 2 | 0.04 | 1.5 (+0.11) | 2 | 0.03 | 1.4 (0.10) | 0.003 | |
We attempted to replicate gene KO associations from the ESP whole-exome DNA sequencing dataset in the MHI Biobank whole-genome DNA sequencing dataset. We tested for replication genes with P < 0.05 and at least two KO individuals in the ESP dataset. We report genes with a combined P < 0.005. We provide the mean gene KO effect size in standard deviation (SD) and metric units, assuming that 1 SD corresponds to 4.5 kg/m2, 6.4 cm, and 0.07 for BMI, height, and WHR respectively. NKO: number of individuals that are KO for the given gene.
EA: European-ancestry; AA: African-ancestry.
Top association results between anthropometric traits and predicted gene knockouts (KOs) identified using ExomeChip data from 22 studies participating in the GIANT Consortium
| Trait | Gene | Weighted mean (SD) | |||
|---|---|---|---|---|---|
| BMI | 100 | 15 | −0.35 | 0.001 | |
| 2 | 2 | −1.90 | 0.002 | ||
| 7 | 5 | −0.75 | 0.002 | ||
| 4 | 3 | 1.05 | 0.003 | ||
| 147 | 16 | 0.23 | 0.003 | ||
| 191 | 6 | 0.15 | 0.004 | ||
| 9 | 2 | −1.02 | 0.004 | ||
| 3 | 2 | −1.64 | 0.004 | ||
| 2 | 2 | −2.04 | 0.005 | ||
| Height | 2 | 2 | −2.28 | 0.0001 | |
| 365 | 10 | −0.12 | 0.0003 | ||
| 3 | 2 | −2.01 | 0.0003 | ||
| 21 | 3 | −0.60 | 0.0008 | ||
| 2 | 2 | −2.04 | 0.0009 | ||
| 2 | 2 | 2.35 | 0.001 | ||
| 4 | 2 | 1.70 | 0.001 | ||
| 13 | 3 | −0.78 | 0.002 | ||
| 6 | 2 | −1.25 | 0.002 | ||
| 45 | 9 | 0.35 | 0.002 | ||
| 27 | 4 | 0.61 | 0.003 | ||
| 9 | 1 | −1.00 | 0.003 | ||
| 2 | 1 | 2.00 | 0.004 | ||
| 10 | 4 | −0.72 | 0.004 | ||
| 6 | 5 | −0.96 | 0.005 | ||
| WHR | 7 | 1 | 1.39 | 0.0002 | |
| 3 | 2 | −1.78 | 0.001 | ||
| 6 | 3 | 1.27 | 0.002 | ||
| 13 | 1 | 0.83 | 0.002 | ||
| 3 | 2 | −1.52 | 0.004 | ||
| 2 | 2 | 1.86 | 0.004 | ||
| 191 | 11 | 0.15 | 0.005 |
We only report genes with P < 0.005 and at least two KO individuals. The weighted mean corresponds to the average phenotype (in standard deviation units) of individuals that are KO for this gene. NKO: number of individuals with a KO gene; Nstudy: number of studies with at least one KO individual for a given gene.
Figure 4.Quantile-quantile (QQ) plots of association results between predicted gene knockouts (KOs) in candidate-genes and anthropometric traits. We restricted these analyses to OMIM disease-causing genes (green), genes with Residual Variation Intolerance Score (RVIS) score <15% of RVIS scores for all genes in the human genome (red), or genes with a probability of being loss-of-function intolerant (pLI) score >0.9 (blue). We report results for three anthropometric traits in the NHLBI Exome Sequence Project (ESP): (A) body mass index (BMI) (Nparticipants = 4475), (B) height (Nparticipants = 4423) and (C) waist-to-hip ratio (WHR) (Nparticipants = 2973). We also report results for the same traits in the GIANT ExomeChip datasets: (D) BMI (Nparticipants = 103 838), (E) height (Nparticipants = 102 775), and (F) WHR (Nparticipants = 62 355). Results are not corrected for the genomic inflation factor. The dash lines correspond to the 95% confidence interval. λGC, genomic inflation factor; Ngene, number of genes with at least one participant that carries two LoF alleles.