| Literature DB >> 30562943 |
Wengang Zhang1, Xue Gao2, Xinping Shi3,4, Bo Zhu5, Zezhao Wang6, Huijiang Gao7, Lingyang Xu8, Lupei Zhang9, Junya Li10, Yan Chen11.
Abstract
Principal component analysis (PCA) is a potential approach that can be applied in multiple-trait genome-wide association studies (GWAS) to explore pleiotropy, as well as increase the power of quantitative trait loci (QTL) detection. In this study, the relationship of test single nucleotide polymorphisms (SNPs) was determined between single-trait GWAS and PCA-based GWAS. We found that the estimated pleiotropic quantitative trait nucleotides (QTNs) β * ^ were in most cases larger than the single-trait model estimations ( β 1 ^ and β 2 ^ ). Analysis using the simulated data showed that PCA-based multiple-trait GWAS has improved statistical power for detecting QTL compared to single-trait GWAS. For the minor allele frequency (MAF), when the MAF of QTNs was greater than 0.2, the PCA-based model had a significant advantage in detecting the pleiotropic QTNs, but when its MAF was reduced from 0.2 to 0, the advantage began to disappear. In addition, as the linkage disequilibrium (LD) of the pleiotropic QTNs decreased, its detection ability declined in the co-localization effect model. Furthermore, on the real data of 1141 Simmental cattle, we applied the PCA model to the multiple-trait GWAS analysis and identified a QTL that was consistent with a candidate gene, MCHR2, which was associated with presoma muscle development in cattle. In summary, PCA-based multiple-trait GWAS is an efficient model for exploring pleiotropic QTNs in quantitative traits.Entities:
Keywords: MCHR2; genome-wide association study; multiple-trait; pleiotropy; principal component analysis
Year: 2018 PMID: 30562943 PMCID: PMC6316348 DOI: 10.3390/ani8120239
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 2.752
Figure 1Layout of principal component analysis (PCA)-based multiple-trait genome-wide association studies (GWAS) versus single-trait GWAS. (a) Single causal variant model. Provided that a casual single nucleotide polymorphism (SNP) (red spot) has an effect on trait 1 (cattle size) and trait 2 (cattle color) with β1 and β2, the process of estimation of β1 and β2 using trait 1 and trait 2 is called single-trait GWAS. According to components decomposition, pseudo traits are formed and the process of estimation of β is called PCA-based multiple-trait GWAS. The yellow marker represents genotyped SNP in beadchip. (b) Colocalizing effect model. Two different genetic variants in high linkage disequilibrium that affect different traits. In both situations, we compared the relationships among β1, β2, and β.
Positions, effects, and p-Values of ten quantitative trait nucleotides (QTNs) based on simulated data without environmental correlation.
| Chr a | Pos (bp) | Trait 1 eff | Trait 2 eff | Single-Trait GWAS | Multiple-Trait GWAS | ||||
|---|---|---|---|---|---|---|---|---|---|
| −log( | se eff | −log( | se eff | −log( | se eff | ||||
| 1 | 5167453 | 1.18 | 1.66 | 3.63 | 0.06 | 1.87 | 0.09 | 3.19 | 0.01 |
| 1 | 126001364 | 1.34 | 1.93 | 4.38 | 0.03 | 3.45 | 0.04 | 4.65 | 0.01 |
| 1 | 128776905 | 1.83 | 2.51 | 1.13 | 0.13 | 1.17 | 0.18 | 1.33 | 0.03 |
| 1 | 132347489 | 1.21 | 1.91 | 4.57 | 0.13 | 5.85 | 0.18 | 6.16 | 0.03 |
| 1 | 135921964 | 0.89 | 1.43 | 1.73 | 0.06 | 4.70 | 0.08 | 3.53 | 0.01 |
| 4 | 28841329 | 0.93 | 1.47 | 1.10 | 0.04 | 3.68 | 0.05 | 2.54 | 0.01 |
| 4 | 65810279 | 1.82 | 2.38 | 5.24 | 0.11 | 5.22 | 0.16 | 6.24 | 0.02 |
| 4 | 80902019 | 3.41 | 5.71 | 17.55 | 0.06 | 30.18 | 0.08 | 28.08 | 0.01 |
| 4 | 115266053 | 2.20 | 3.94 | 10.05 | 0.06 | 16.65 | 0.08 | 15.70 | 0.01 |
| 5 | 6270944 | 0.84 | 0.94 | 2.48 | 0.04 | 0.87 | 0.05 | 1.87 | 0.01 |
Note: a One of the simulated data results. Pleiotropic traits were simulated based on 10 QTNs. If the significant threshold was a p-Value < 10−6, only two QTNs (chr4: 80902019 and chr4: 115266053) could be identified based on single-trait GWAS results. Meanwhile, four QTNs (chr1: 132347489, chr4: 65810279, chr4: 80902019, and chr4: 115266053) could be identified based on PCA-based GWAS results. Shaded QTNs are causal variants only found in PCA-based GWAS. GWAS, Genome-Wide Association Study. Chr, Chromosome. Pos, Position. Eff, effective. Se eff, Standard error of estimated effects.
Phenotypic variance and heritability explained by each principle component.
| Scenario | Heritability | Environmental Correlation | PC1 | PC2 | ||
|---|---|---|---|---|---|---|
| Phenotypic Variance (SD a) | Heritability Explained (SD) | Phenotypic Variance (SD) | Heritability Explained (SD) | |||
| 1 | 0.5 | 0 | 75.98 (25.12) | 0.534 (0.04) | 14.96 (4.34) | 0.271 (0.03) |
| 2 | 0.05 | 0 | 56.78 (17.22) | 0.052 (0.01) | 39.81 (10.23) | 0.035 (0.01) |
| 3 | 0.5 | 0.25 | 89.12 (30.09) | 0.580 (0.04) | 9.80 (2.11) | 0.130 (0.07) |
Note: a SD: Standard Deviation.
Figure 2Comparison of power and false discovery rate (FDR) between multiple-trait GWAS and single-trait GWAS. We simulated three situations including medium heritability (a,d), low heritability (b,e), and environmental correlation (c,f). (a–c) Power under different significant levels. (d–f) FDR under different significant levels.
Figure 3Comparison of detection power between multiple-trait GWAS and single-trait GWAS in different minor allele frequencies. MAF: minor allele frequency. Upper left figure reveals a histogram of the minor allele frequency in the simulated data.
Figure 4Comparison of power/ False Discover Rate (FDR) in different levels of linkage disequilibrium in the colocalizing effect model.
Statistical summary and genetic parameters of three phenotypes.
| Trait | Number of Samples | Mean (Kg) (SD) | Heritability | CW | FSW | HMSW |
|---|---|---|---|---|---|---|
| Clod weight (CW) | 1111 | 5.06 (0.88) | 0.57 | 1 | 0.82 a | 0.79 |
| Fore shank weight (FSW) | 1111 | 17.03 (3.15) | 0.56 | 0.90 b | 1 | 0.76 |
| Heel muscle shank weight (HMSW) | 1111 | 1.07 (0.19) | 0.62 | 0.93 | 0.94 | 1 |
Note: a phenotype correlation. b genetic correlation.
Figure 5Manhattan plot of the association study results of real cattle data. The three phenotypes are clod weight (CW), fore shank weight (FSW), and heel muscle shank weight (HMSW). The significant level is 10−7, represented by the red line, and the suggestive significant level is 10−5, represented by the pink line.