| Literature DB >> 25116239 |
Catarina D Campbell1, Kiana Mohajeri1, Maika Malig1, Fereydoun Hormozdiari1, Benjamin Nelson1, Gaixin Du2, Kristen M Patterson2, Celeste Eng3, Dara G Torgerson3, Donglei Hu3, Catherine Herman2, Jessica X Chong2, Arthur Ko1, Brian J O'Roak1, Niklas Krumm1, Laura Vives1, Choli Lee1, Lindsey A Roth3, William Rodriguez-Cintron4, Jose Rodriguez-Santana5, Emerita Brigino-Buenaventura6, Adam Davis7, Kelley Meade7, Michael A LeNoir8, Shannon Thyne9, Daniel J Jackson10, James E Gern10, Robert F Lemanske11, Jay Shendure1, Mark Abney2, Esteban G Burchard12, Carole Ober2, Evan E Eichler13.
Abstract
Asthma is a complex genetic disease caused by a combination of genetic and environmental risk factors. We sought to test classes of genetic variants largely missed by genome-wide association studies (GWAS), including copy number variants (CNVs) and low-frequency variants, by performing whole-genome sequencing (WGS) on 16 individuals from asthma-enriched and asthma-depleted families. The samples were obtained from an extended 13-generation Hutterite pedigree with reduced genetic heterogeneity due to a small founding gene pool and reduced environmental heterogeneity as a result of a communal lifestyle. We sequenced each individual to an average depth of 13-fold, generated a comprehensive catalog of genetic variants, and tested the most severe mutations for association with asthma. We identified and validated 1960 CNVs, 19 nonsense or splice-site single nucleotide variants (SNVs), and 18 insertions or deletions that were out of frame. As follow-up, we performed targeted sequencing of 16 genes in 837 cases and 540 controls of Puerto Rican ancestry and found that controls carry a significantly higher burden of mutations in IL27RA (2.0% of controls; 0.23% of cases; nominal p = 0.004; Bonferroni p = 0.21). We also genotyped 593 CNVs in 1199 Hutterite individuals. We identified a nominally significant association (p = 0.03; Odds ratio (OR) = 3.13) between a 6 kbp deletion in an intron of NEDD4L and increased risk of asthma. We genotyped this deletion in an additional 4787 non-Hutterite individuals (nominal p = 0.056; OR = 1.69). NEDD4L is expressed in bronchial epithelial cells, and conditional knockout of this gene in the lung in mice leads to severe inflammation and mucus accumulation. Our study represents one of the early instances of applying WGS to complex disease with a large environmental component and demonstrates how WGS can identify risk variants, including CNVs and low-frequency variants, largely untested in GWAS.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25116239 PMCID: PMC4130548 DOI: 10.1371/journal.pone.0104396
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of whole-genome sequencing.
| Phenotype | Sequence (Gb) | Coverage | SNVs (millions) | Small CNVs | Larger CNVs | |
|
| asthma | 53 | 13.06 | 2.7 | 2683 | 445 |
| father1 | asthma | 43 | 11.84 | 2.7 | 2737 | 466 |
| mother1 | asthma | 47 | 12.47 | 2.8 | 2662 | 412 |
|
| control | 59 | 17.11 | 2.7 | 1886 | 459 |
| father2 | control | 57 | 12.06 | 2.7 | 2556 | 454 |
| mother2 | symptoms | 45 | 9.72 | 2.8 | 2534 | 425 |
|
| control | 63 | 16.28 | 2.8 | 1717 | 485 |
| father3 | control | 51 | 10.51 | 2.7 | 2614 | 490 |
| mother3 | control | 54 | 12.21 | 2.8 | 2716 | 407 |
|
| asthma | 69 | 15.25 | 2.8 | 3094 | NA |
| father4 | asthma | 46 | 11.90 | 2.7 | 2672 | 351 |
| mother4 | asthma | 36 | 10.72 | 2.7 | 2562 | 344 |
|
| asthma | 67 | 13.1 | 2.8 | 3267 | 383 |
| father5 | BHR | 39 | 13.28 | 2.8 | 2044 | 480 |
| mother5 | asthma | 46 | 14.31 | 2.8 | 2727 | 415 |
|
| control | 58 | 14.40 | 2.8 | 3202 | 417 |
|
| - | 833 | 208.22 | 5.4 | 5852 | 1064 |
|
| - | 52 | 13.01 | 2.8 | 2605 | 429 |
*Mean coverage of the genome (NCBI build 36) based on mapped reads.
Identified from read-pair mappings.
Identified with read-depth.
Used as “reference” sample in the read-depth approach.
Bronchial hyperresponsiveness (BHR).
Figure 1Overview of sequenced genomes.
Ideograms of the autosomes and X chromosome are shown. Red bars represent segments of autozygosity observed in five individuals with asthma (parents of the sequenced trios) and yellow triangles represent asthma-specific gene-disruptive SNVs or indels. The regions of autozygosity in multiple cases on chromosomes 1q and 5q were also observed to be autozygous in at least one control individual.
Figure 2Comparison of CNV allele frequency between Hutterites and CEU.
A histogram of the difference in allele frequency for the non-reference allele between Hutterites and the CEU individuals from the 1000 Genomes Project is shown. On the x-axis are bins for the absolute value of the non-reference allele frequency in the Hutterites minus the allele frequency in CEU. The y-axis represents the number of CNVs in each bin.
Figure 3Association results of 593 CNVs to asthma.
CNVs for association testing in the full Hutterite pedigree were identified in the 16 sequenced genomes. The points for these CNVs are colored based on the results of the whole-genome sequencing to represent whether the variant was observed in cases only (red), control individuals only (blue), or in both case and control individuals (gray). The genomic position is represented on the x-axis and the −log10(p-value) of the nominal association of each CNV to asthma in the full Hutterite pedigree is on the y-axis.
Figure 4Identification of an intronic deletion in NEDD4L associated with asthma.
(A) A representation of the location of the 6 kbp deletion. This deletion occurs in an intron shared by all reported transcripts of this gene (blue). The deletion was identified from paired-end sequence reads and validated by array CGH as shown for one of the sequenced trios. The log2 ratios of the probes in this region are shown as vertical bars with a log2 ratio of zero represented by the horizontal line. The red vertical bars in the child and mother indicate negative log2 ratios and confirm the deletion. (B) The frequency of this deletion in multiple populations is shown in the bar graph with the deletion allele frequency on the y-axis. The error bars represent the standard error on the allele frequency based on the binomial distribution. The Hutterite case (black) and control (gray) frequencies were determined by array CGH; the frequencies for the other populations are as reported by the 1000 Genomes Project.
SNVs and indels nominally associated with asthma in the Hutterites.
| chr | Pos | dbSNP | gene | mutation | CEU freq | Hutterite freq | p | OR |
| 3 | 187944218 | rs76438938 | KNG1 | R376X | 0.024 | 0.05 | 0.019 | 2.34 |
| 6 | 154609555 | rs34427887 | OPRM1 | R401X | 0.000 | 0.05 | 0.022 | 1.66 |
| 15 | 63733326 | . | SLC24A1 | L1053fs | 0.000 | 0.04 | 0.013 | 2.38 |
| 18 | 50134887 | rs17292725 | STARD6 | R19X | 0.024 | 0.04 | 0.041 | 1.78 |
| 19 | 43072247 | . | WDR87 | E1263X | 0.000 | 0.01 | 0.036 | 2.03 |
*Position in NCBI build 36.
Based on the 1000 Genomes Project Phase 1 [50].
Corrected for relatedness as previously described [55].
Odds ratio (OR).
Association of intronic deletion in NEDD4L with asthma in additional populations.
| Study | Ancestry | N Cases; N Controls | Case Freq | Control Freq | p | OR |
| Chicago | European American | 177; 219 | 0.014 | 0.000 | 0.051 | Inf |
| Freiburg | German | 370; 364 | 0.015 | 0.015 | 0.97 | 0.98 |
| COAST | European American | 189; 172 | 0.019 | 0.015 | 0.64 | 1.27 |
| GALAII | Puerto Rican | 755; 742 | 0.001 | 0.000 | 0.50 | Inf |
| SAGE | African American | 1052; 747 | 0.004 | 0.003 | 0.22 | 1.6 |
|
|
|
|
|
*Odds ratio (OR).
Combined association statistics were obtained using the Cochran–Mantel–Haenszel test.