| Literature DB >> 22373363 |
Gang Shi1, Jeannette Simino, Dabeeru C Rao.
Abstract
Genome-wide association studies have been successful in identifying common variants for common complex traits in recent years. However, common variants have generally failed to explain substantial proportions of the trait heritabilities. Rare variants, structural variations, and gene-gene and gene-environment interactions, among others, have been suggested as potential sources of the so-called missing heritability. With the advent of exome-wide and whole-genome next-generation sequencing technologies, finding rare variants in functionally important sites (e.g., protein-coding regions) becomes feasible. We investigate the role of linkage information to select families enriched for rare variants using the simulated Genetic Analysis Workshop 17 data. In each replicate of simulated phenotypes Q1 and Q2 on 697 subjects in 8 extended pedigrees, we select one pedigree with the largest family-specific LOD score. Across all 200 replications, we compare the probability that rare causal alleles will be carried in the selected pedigree versus a randomly chosen pedigree. One example of successful enrichment was exhibited for gene VEGFC. The causal variant had minor allele frequency of 0.0717% in the simulated unrelated individuals and explained about 0.1% of the phenotypic variance. However, it explained 7.9% of the phenotypic variance in the eight simulated pedigrees and 23.8% in the family that carried the minor allele. The carrier's family was selected in all 200 replications. Thus our results show that family-specific linkage information is useful for selecting families for sequencing, thus ensuring that rare functional variants are segregating in the sequencing samples.Entities:
Year: 2011 PMID: 22373363 PMCID: PMC3287923 DOI: 10.1186/1753-6561-5-S9-S82
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Distribution of causal minor alleles of Q1 and Q2 in the eight simulated pedigrees
| Gene | Family | Founder | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | Yes | No | |
| Q1 | ||||||||||
| 3 | 2 | 0 | 0 | 6 | 0 | 0 | 0 | 4 | 7 | |
| 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | |
| 0 | 0 | 5 | 0 | 28 | 16 | 13 | 12 | 28 | 46 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | |
| 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 1 | 2 | |
| 14 | 37 | 43 | 31 | 6 | 0 | 93 | 0 | 55 | 169 | |
| 4 | 20 | 0 | 0 | 0 | 0 | 22 | 0 | 3 | 43 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 31 | 0 | 1 | 30 | |
| Q2 | ||||||||||
| 4 | 0 | 0 | 0 | 1 | 0 | 2 | 2 | 5 | 4 | |
| 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 7 | 20 | 2 | 1 | 0 | 1 | 23 | 7 | 15 | 46 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | 1 | |
| 0 | 1 | 2 | 0 | 0 | 3 | 0 | 0 | 4 | 2 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 3 | 0 | 0 | 4 | 21 | 0 | 4 | 24 | |
| 16 | 21 | 0 | 5 | 9 | 12 | 8 | 0 | 13 | 58 | |
| 2 | 0 | 0 | 0 | 11 | 0 | 6 | 0 | 4 | 15 | |
| 35 | 60 | 27 | 15 | 24 | 17 | 77 | 0 | 54 | 201 | |
| 24 | 17 | 27 | 0 | 21 | 0 | 42 | 42 | 46 | 127 | |
| 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 2 | 0 | |
Asterisks indicate a single nonmonomorphic causal variant in the gene.
Results of selecting one family using the family-specific LOD score for Q1 and Q2
| Gene | LOD | LODmax | Expected | Observed | |
|---|---|---|---|---|---|
| Q1 | |||||
| 0.07 | 0.09 | 0.375 | 0.390 | 0.33 | |
| 0.09 | 0.10 | 0.125 | 0.070 | 0.99 | |
| 0.12 | 0.18 | 0.625 | 0.850 | 2.47 × 10−11 | |
| 0.24 | 0.24 | 0 | 0 | NA | |
| 0.03 | 0.04 | 0.125 | 0.090 | 0.93 | |
| 0.65 | 0.75 | 0.125 | 0.010 | 1.00 | |
| 0.71 | 0.76 | 0.75 | 0.965 | 1.09 × 10−12 | |
| 3.32 | 2.40 | 0.375 | 1.000 | <10−12 | |
| 3.87 | 4.44 | 0.125 | 1.000 | <10−12 | |
| Q2 | |||||
| 0.12 | 0.13 | 0.5 | 0.555 | 0.06 | |
| 0.08 | 0.09 | 0.125 | 0.100 | 0.86 | |
| 0.09 | 0.11 | 0 | 0 | NA | |
| 0.16 | 0.15 | 0.875 | 0.925 | 0.02 | |
| 0.12 | 0.12 | 0.125 | 0.205 | 3.12 × 10−4 | |
| 0.23 | 0.20 | 0.375 | 0.395 | 0.28 | |
| 0.09 | 0.11 | 0 | 0 | NA | |
| 0.15 | 0.15 | 0.375 | 0.485 | 6.56 × 10−4 | |
| 0.28 | 0.27 | 0.75 | 0.875 | 2.23 × 10−5 | |
| 0.12 | 0.13 | 0.375 | 0.480 | 1.08 × 10−3 | |
| 0.28 | 0.23 | 0.875 | 0.785 | 1.00 | |
| 0.28 | 0.23 | 0.75 | 0.795 | 0.07 | |
| 0.09 | 0.10 | 0.125 | 0.105 | 0.80 |
Asterisks indicate a single nonmonomorphic causal variant in the gene. LOD is the average LOD score of linkage analysis in 200 replications. LODmax is the average maximum family-specific LOD score in 200 replications. “Expected” is the chance of selecting a family carrying rare alleles from random selection; “Observed” is the frequency of selecting a family carrying rare alleles by selecting the family with the LODmax score.
Percentage of phenotypic variance explained by causal genes of Q1 and Q2 in the eight simulated pedigrees
| Gene | Family | Total samples | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | ||
| Q1 | |||||||||
| 1.3 | 0.1 | 0 | 0 | 2.0 | 0 | 0 | 0 | 0.3 | |
| 0 | 0 | 0.8 | 0 | 0 | 0 | 0 | 0 | 0.08 | |
| 0 | 0 | 1.4 | 0 | 22.2 | 11.9 | 2.3 | 14.6 | 5.8 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0.07 | 0 | 0 | 0 | 0.006 | |
| 0 | 0 | 0 | 0 | 0.4 | 0 | 0 | 0 | 0.04 | |
| 0.8 | 1.6 | 0.9 | 1.8 | 0.5 | 0 | 1.6 | 0 | 1.2 | |
| 7.9 | 24.6 | 0 | 0 | 0 | 0 | 14.6 | 0 | 9.0 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 23.8 | 0 | 7.9 | |
| Q2 | |||||||||
| 2.4 | 0 | 0 | 0 | 0.07 | 0 | 0.06 | 0.2 | 0.3 | |
| 0.3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.04 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 1.8 | 4.1 | 0.6 | 0.4 | 0 | 0.3 | 4.3 | 2.2 | 2.1 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0.5 | 0 | 0.1 | |
| 0 | 0.02 | 0.7 | 0 | 0 | 0.08 | 0 | 0 | 0.1 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 1.0 | 0 | 0 | 4.2 | 3.4 | 0 | 1.3 | |
| 3.8 | 2.2 | 0 | 1.2 | 2.1 | 3.0 | 1.2 | 0 | 1.7 | |
| 0.6 | 0 | 0 | 0 | 4.6 | 0 | 3.1 | 0 | 1.2 | |
| 1.6 | 2.8 | 1.8 | 1.4 | 1.7 | 1.3 | 2.1 | 0 | 1.9 | |
| 1.7 | 1.2 | 1.9 | 0 | 1.6 | 0 | 2.1 | 7.8 | 2.2 | |
| 0 | 0 | 0 | 1.7 | 0 | 0 | 0 | 0 | 0.2 | |
Asterisks indicate a single nonmonomorphic causal variant in the gene.