| Literature DB >> 22373536 |
Ying Liu1, Chien Hsun Huang, Inchi Hu, Shaw-Hwa Lo, Tian Zheng.
Abstract
Both common variants and rare variants are involved in the etiology of most complex diseases in humans. Developments in sequencing technology have led to the identification of a high density of rare variant single-nucleotide polymorphisms (SNPs) on the genome, each of which affects only at most 1% of the population. Genotypes derived from these SNPs allow one to study the involvement of rare variants in common human disorders. Here, we propose an association screening approach that treats genes as units of analysis. SNPs within a gene are used to create partitions of individuals, and inverse-probability weighting is used to overweight genotypic differences observed on rare variants. Association between a phenotype trait and the constructed partition is then evaluated. We consider three association tests (one-way ANOVA, chi-square test, and the partition retention method) and compare these strategies using the simulated data from the Genetic Analysis Workshop 17. Several genes that contain causal SNPs were identified by the proposed method as top genes.Entities:
Year: 2011 PMID: 22373536 PMCID: PMC3287829 DOI: 10.1186/1753-6561-5-S9-S106
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Inverse probability similarity measure: allelic similarity scores
| Individual 2 | Individual 1 | |
|---|---|---|
p is the population frequency of minor allele a.
Inverse probability similarity measure: genotypic similarity scores
| Individual 2 | Individual 1 | ||
|---|---|---|---|
p is the population frequency of minor allele a.
Figure 1Clustering of individuals using nonsynonymous SNPs for Each row is a SNP, and each column is an individual. Green vertical bars indicate case subjects. Genotype aA is plotted in blue, and genotype AA is plotted in white (a is the minor allele); the genotype aa was not observed. The partitions of the 697 individuals are indicated by dotted lines. Partition element 2 is driven by similarity on SNP C13S431 but not on the more common SNPs C13S522 and C13S523.
Figure 2Top ten genes identified by each of the methods and for each of . Ninety-one genes are shown, displayed by chromosome. Genes with causal SNPs are highlighted (yellow for Q1 and blue for Q2).
Association between a consistent false-positive gene (OR2T3) and a causal SNP at C13S523
| C13S523 genotype ( | Partition based on SNPs of | ||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | |
| 1 | 41 | 29 | 3 | 9 | 11 |
| 2 | 525 | 59 | 5 | 8 | 7 |
Figure 3Power to identify a causal gene versus effect size. For each trait, we plot the power to detect using the best performing method against the effect size used in the simulation model. That is, we plot the one-way ANOVA with Bonferroni correction for Q1 and Y, and the I from the partition retention method for Q2. The gene-wise effect size is defined as the sum of SNP-wise MAF × causal SNP effect in the simulation model.