| Literature DB >> 18466463 |
Yuanjia Wang1, Yixin Fang, Shuang Wang.
Abstract
When the number of phenotypes in a genetic study is on the scale of thousands, such as in studies concerning thousands of gene expression levels, the single-trait analysis is computationally intensive, and heavy adjustment of multiple comparisons is required. Traditional multivariate genetic linkage analysis for quantitative traits focuses on mapping only a few phenotypes and is not feasible for a large number of traits. To cope with high-dimensional phenotype data, clustering analysis and principal-component analysis (PCA) are proposed to reduce the data dimensionality and to map shared genetic contributions for multiple traits. However, standard clustering analysis and PCA are applicable for independent observations. In most genetic studies, where family data are collected, these standard analyses can only be applied to founders and can lead to the loss of information. Here, we proposed a clustering method that can exploit family structure information and applied the method to 29 gene expression levels mapped to a reported hot spot on chromosome 14. We then used a PCA approach based on heritability applicable to small number of traits to combine phenotypes in the clusters. Lastly, we used a penalized PCA approach based on heritability applicable to arbitrary number of traits to combine 150 gene expression levels with the highest heritability. Genome-wide multipoint linkage analysis was carried out on the individual traits and on the combined traits. Two previously reported peaks on chromosomes 14 and 20 were identified. Linkage evidence was stronger for traits derived from methods that incorporate family structure information.Entities:
Year: 2007 PMID: 18466463 PMCID: PMC2367519 DOI: 10.1186/1753-6561-1-s1-s121
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1A, standard clustering; B, proposed clustering.
Principal components of heritability and linkage analysis
| Principal components analysis | Linkage analysis | |||||||
| First 3 components in each cluster | Clustering method | Cluster | No. members | Cumulative % variation explained | Genome-wide peak | Genome-wide peak | SNP | Peak location |
| A.1 | Classical | A | 10 | 36% | 3.7 | 1.25 × 10-4 | rs1333050 | chr 9 (43.9 cM) |
| A.2 | Classical | A | 10 | 58% | 3.01 | 1.00 × 10-3 | rs983795 | chr 14 (77.3 cM) |
| A.3 | Classical | A | 10 | 72% | 3.59 | 1.90 × 10-4 | rs332364 | chr 3 (93.3 cM) |
| B.1 | Proposed | B | 8 | 37% | 4.31 | 1.06 × 10-5 | rs941838 | chr 14 (113 cM) |
| B.2 | Proposed | B | 8 | 62% | 4.82 | 1.02 × 10-6 | rs1955897 | chr 14 (110.5 cM) |
| B.3 | Proposed | B | 8 | 77% | 4.06 | 2.93 × 10-5 | rs297675 | chr 20 (11.9 cM) |
| C.1 | Proposed | C | 11 | 41% | 4.46 | 5.31 × 10-6 | rs1892302 | chr 10 (27 cM) |
| C.2 | Proposed | C | 11 | 58% | 4.99 | 4.62 × 10-7 | rs2206185 | chr 6 (111.2 cM) |
| C.3 | Proposed | C | 11 | 68% | 5.36 | 7.35 × 10-8 | rs1950475 | chr 14 (109 cM) |
| D.1 | Proposed | D | 10 | 38% | 4.61 | 2.78 × 10-6 | rs1950475 | chr 14 (109.6 cM) |
| D.2 | Proposed | D | 10 | 60% | 3.32 | 4.95 × 10-4 | rs739495 | chr 19 (58.5 cM) |
| D.3 | Proposed | D | 10 | 74% | 3.34 | 4.62 × 10-4 | rs1955897 | chr 14 (110.5 cM) |
Figure 2Linkage analysis of the principal components obtained from standard method (Cluster A) and proposed method (Cluster D).
Figure 3Linkage analysis of the penalized PCH approach applied to the 150 traits with highest heritability.