| Literature DB >> 26818242 |
Huili Yuan1, Zhenye Li2, Nelson L S Tang3, Minghua Deng4,5,6.
Abstract
BACKGROUND: Expression quantitative trait locus (eQTL) analysis has been widely used to understand how genetic variations affect gene expressions in the biological systems. Traditional eQTL is investigated in a pair-wise manner in which one SNP affects the expression of one gene. In this way, some associated markers found in GWAS have been related to disease mechanism by eQTL study. However, in real life, biological process is usually performed by a group of genes. Although some methods have been proposed to identify a group of SNPs that affect the mean of gene expressions in the network, the change of co-expression pattern has not been considered. So we propose a process and algorithm to identify the marker which affects the co-expression pattern of a pathway. Considering two genes may have different correlations under different isoforms which is hard to detect by the linear test, we also consider the nonlinear test.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26818242 PMCID: PMC4895706 DOI: 10.1186/s12918-015-0245-0
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fig. 1Flowchart of our strategy. Flowchart of our strategy for detecting pathway-associated SNP. We first perform Lasso to adjust the effects of SNPs on the means of gene expression. Then we use covariance test and kernel covariance test to select candidate pathway-SNP modules
Fig. 2Comparison between linear method and kernel method. Simulations under different setups. Setup of the first column is under model 1, the second column is under model 2 and the third column is under model 3. First row: (p, n 1,n 2,θ) = (40, 60, 60, 0.2); Second row: (p, n 1,n 2,θ) = (40, 60, 60, 0.3); Third row: (p, n 1,n 2,θ) = (80, 120, 120, 0.2); Fourth row: (p, n 1,n 2,θ) = (80,120,120, 0.3)
Fig. 3Comparison between Chen’s linear method and other method. Topleft: The two covariance matrices have eight different elements, each with a magnitude generated from U n i f(0,4)∗ max1≤j≤p σ ; Topright: The two covariance matrices have eight different elements, each with a magnitude generated from U n i f(0,400)∗ max1≤j≤p σ ; Bottomleft: The two covariance matrices have 500t different elements, each with a magnitude generated from U n i f(0,4)∗ max1≤j≤p σ ; Bottomright: The two covariance matrices have 500 different elements, each with a magnitude generated from U n i f(0,400)∗ max1≤j≤p σ
Fig. 4Different modules detected under different conditions by different method. a Detected modules under ethanol condition. We found 36 modules by covariance test, 9 modules by kernel covariance test with parameter 1 and 51 modules by kernel covariance test with parameter 10; b Detected modules under glucose condition. We found 86 modules by covariance test, 3 modules by kernel covariance test with parameter 10 and 12 modules by kernel covariance test with parameter 1
New associated pathways and SNPs under ethanol condition
| Pathways | Associated markers |
|---|---|
| Glycolysis/Gluconeogenesis |
|
| Synthesis and degradation of ketone bodies |
|
| Steroid biosynthesis |
|
|
| |
|
| |
| Valine, leucine and isoleucine degradation |
|
|
| |
| Valine, leucine and isoleucine |
|
| biosynthesis |
|
| Histidine metabolism |
|
| Tyrosine metabolism |
|
| Phenylalanine metabolism |
|
| beta-Alanine metabolism |
|
| Taurine and hypotaurine metabolism |
|
| Selenocompound metabolism |
|
|
| |
| Purine metabolism |
|
| Cyanoamino acid metabolism |
|
| Arachidonic acid metabolism |
|
| Linoleic acid metabolism |
|
| Glyoxylate and dicarboxylate metabolism |
|
| Porphyrin and chlorophyll metabolism |
|
| Sphingolipid metabolism |
|
| Pantothenate and CoA biosynthesis |
|
| Terpenoid backbone biosynthesis |
|
|
| |
| Sesquiterpenoid and triterpenoid |
|
| biosynthesis | |
|
| |
|
| |
| Metabolic pathways |
|
|
| |
|
| |
|
| |
|
| |
| Biosynthesis of secondary metabolites |
|
| Carbon metabolism |
|
| 2-Oxocarboxylic acid metabolism |
|
|
| |
|
| |
| mRNA surveillance pathway |
|
| Mismatch repair |
|
| Non-homologous end |
|
| Biosynthesis of amino acids |
|
| MAPK signaling pathway |
|
L means detected by covariance test, (1) means detected by kernel covariance test with parameter 1 and (10) means detected by kernel covariance test with parameter 10. The FDR of the covariance test, kernel covariance test with parameter 1 and kernel covariance test with parameter 10 are 0.25, 0.33 and 0.25 respectively. The FWER of the test by Tony Cai is 0.2
New associated pathways and SNPs under glucose condition
| Pathways | Associated markers |
|---|---|
| Synthesis and degradation of ketone bodies |
|
| Inositol phosphate metabolism |
|
| Riboflavin metabolism |
|
| Fatty acid degradation |
|
| Cysteine and methionine metabolism |
|
| Valine, leucine and isoleucine biosynthesis |
|
|
| |
|
| |
|
| |
| Phenylalanine metabolism |
|
|
| |
|
| |
|
| |
| beta-Alanine metabolism |
|
|
| |
| Arachidonic acid metabolism |
|
| Vitamin B6 metabolism |
|
| Porphyrin and chlorophyll metabolism |
|
| Degradation of aromatic compounds |
|
| ABC transporters |
|
| Glycolysis/Gluconeogenesis |
|
| Pentose phosphate pathway |
|
|
| |
|
| |
|
| |
| Pentose and glucuronate interconversions |
|
| Purine metabolism |
|
| Pyrimidine metabolism |
|
| Glycine, serine and threonine metabolism |
|
| Lysine biosynthesis |
|
| Histidine metabolism |
|
|
| |
|
| |
|
| |
| Tyrosine metabolism |
|
|
| |
| Cyanoamino acid metabolism |
|
| Starch and sucrose metabolism |
|
| Linoleic acid metabolism |
|
| Butanoate metabolism |
|
| Pantothenate and CoA biosynthesis |
|
| Lipoic acid metabolism |
|
| Folate biosynthesis |
|
| Sesquiterpenoid and triterpenoid biosynthesis |
|
| Aminoacyl-tRNA biosynthesis |
|
|
| |
| Biosynthesis of unsaturated fatty acids |
|
| Metabolic pathways |
|
| Biosynthesis of secondary metabolites |
|
| Biosynthesis of amino acids |
|
|
| |
|
| |
|
| |
| Ribosome |
|
| RNA transport |
|
| RNA polymerase |
|
| Proteasome |
|
| Phosphatidylinositol signaling system |
|
| Meiosis - yeast |
|
L means detected by covariance test, (1) means detected by kernel covariance test with parameter 1 and (10) means detected by kernel covariance test with parameter 10. The FDR of the covariance test, kernel covariance test with parameter 1 and kernel covariance test with parameter 10 are 0.20, 0.24 and 0.33 respectively. The FWER of the test by Tony Cai is 0.2
Fig. 5Isoform-specific structure change. Left: Scatter plot of gene YER086W and YCL064C under two different genotypes of YCL023C; Right: Scatter plot of gene YLR355C and YCL064C under two different genotypes of YCL023C. We found the associated pathway-SNP modules only by kernel covariance test. The scatter figures show that YER086W-YCL064C and YLR355C-YCL064C were nonlinear correlated under genotypes of marker YCL023C
Fig. 6Failure to detect correlation between single gene expression level and genotype of YFL029C. Left: Boxplot of expression level of YFL029C; middle, Boxplot of expression level of YKR089C; right: Boxplot of expression level of YJR155W. We can see that the means of YKR089C and YJR155W expressions do not change significantly
Fig. 7Three examples of differential coexpression patterns of 2 genes due to genotype of YFL029C. Left: the co-expression patterns between the two genes YKR089C and YJR155W depend on the genotype of YFL029C. For samples with genotype 1, the co-expression correlation is different from the other samples. Middle: the co-expression patterns between the two genes YKR089C and YPR086W depend on the genotype of YFL029C. Right: the co-expression patterns between the two genes YJR155W and YIL131C depend on the genotype of YFL029C
Fig. 8The possible regulatory mechanism of the marker YFL029C to Linoleic acid metabolism pathway. a KEGG pathway of Linoleic acid metabolism pathway; b The potential regulatory relationship between marker YFL029C and Linoleic acid metabolism pathway