Literature DB >> 27462363

A Canonical Correlation Analysis of AIDS Restriction Genes and Metabolic Pathways Identifies Purine Metabolism as a Key Cooperator.

Hanhui Ye1, Jinjin Yuan1, Zhengwu Wang1, Aiqiong Huang1, Xiaolong Liu1, Xiao Han2, Yahong Chen1.   

Abstract

Human immunodeficiency virus causes a severe disease in humans, referred to as immune deficiency syndrome. Studies on the interaction between host genetic factors and the virus have revealed dozens of genes that impact diverse processes in the AIDS disease. To resolve more genetic factors related to AIDS, a canonical correlation analysis was used to determine the correlation between AIDS restriction and metabolic pathway gene expression. The results show that HIV-1 postentry cellular viral cofactors from AIDS restriction genes are coexpressed in human transcriptome microarray datasets. Further, the purine metabolism pathway comprises novel host factors that are coexpressed with AIDS restriction genes. Using a canonical correlation analysis for expression is a reliable approach to exploring the mechanism underlying AIDS.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27462363      PMCID: PMC4947641          DOI: 10.1155/2016/2460184

Source DB:  PubMed          Journal:  Comput Math Methods Med        ISSN: 1748-670X            Impact factor:   2.238


1. Introduction

Human immunodeficiency virus (HIV) is the basis for acquired immune deficiency syndrome (AIDS) pathogenesis and destroys the lymphoid system with prodigious replicates, which reduces a patient's ability to survive. Since HIV was identified in the 1980s, this pathogen has taken more than 10 million people's lives throughout the world. Researchers have developed considerable information on HIV involving immunology, virology, host genetics, and treatment over the past few decades. Human genetics research involving the infectious disease HIV has progressed considerably after initiation of the human genome project (HGP), which is sequencing the entire human genome, both physically and functionally [1]. Many host genetic factors that influence AIDS epidemiological heterogeneity have been characterized [2-4]. From the HIV entry receptor on lymphoid cells to oncogenes in human glioblastomas, AIDS restriction genes (ARGs) are widely involved in biological pathways, and nearly 40 ARGs have been studied in depth through functional analyses [5-12]. Host genomic analysis is a key approach to studying AIDS epidemiology [13]. Further, genome, transcriptome, proteome, and metabolome biodatasets related to HIV have grown exponentially due to advanced sequencing technology. However, an integrative study on these datasets is limited in terms of understanding the complicated biological network. Recent studies have revealed that metabolic pathways exert certain effects on the control of AIDS disease progression [14]. For example, the oxygen concentration can modulate T-cell differentiation through controlling metabolic status [15]. Metabolizing ATP to adenosine inhibits HIV-specific effector cells. Further, HIV infection is affected by dNTP hydrolysis. Efficient HIV-1 infection of CD4(+) lymphocytes requires sufficient glucose uptake via the Glut1 glucose transporter [16]. Tryptophan and phenylalanine metabolism also play an important role in HIV because HIV pathophysiology is associated with inflammatory stress due to dysregulated amino acid metabolism [17]. The HIV protein NEF impacts lipid-related metabolism through impairing cholesterol metabolism in both infected and bystander cells [18, 19]. This evidence suggests that cross talk between AIDS and the host metabolism is an important research topic that is necessary to resolve the disease mechanism and aid in therapy. Integrating biodatasets with an in-depth analysis of host AIDS restriction genes and metabolic pathways is imperative. In the transcriptome, gene coexpression is a model for understanding how individual genes are correlated in certain conditions [20, 21]. Based on advances in this field, researchers hypothesize that the coexpression of genes in certain pathways indicates an integrative correlation between the two molecular pathways. Full genes in metabolic pathway are available for the human genome. Identifying correlations between a group of metabolic pathway genes and ARGs is a more comprehensive means for understanding integrative biodatasets. However, traditional methods using a Pearson or partial correlation are only suitable for a single gene. A canonical correlation analysis (CCA) is an efficient and powerful approach for measuring coexpression between two sets of genes. A Childhood Asthma Management Program (CAMP) study using a CCA successfully detected genetic regulatory variants [22]. Using the CCA, the glioblastoma transcriptomes of 45 patients were thoroughly analyzed to identify the glioma pathway genes [23]. In this paper, we used a CCA to analyze coexpression between ARGs and metabolic pathways from KEGG. We discuss the most important metabolic pathways coexpressed with the ARGs, which may imply strategies for AIDS diagnosis and therapy.

2. Methods

2.1. Datasets

Human genome expression datasets were downloaded from the website COPRESDB (http://coxpresdb.jp/), which contains approximately 4000 experiments and expression data on 20,000 human genes. Metabolic pathway genes were downloaded from KEGG (http://www.kegg.jp/); this dataset includes 129 typical metabolic pathways with predicted genes. The ARGs were collected from published literature. Two expression datasets were generated to include metabolic pathway gene and ARG expression data, respectively (Tables 1 and 2).
Table 1

HIV host genetic factor genes.

Gene symbolGene IDEffect
APOBEC3B9582 Increase infection
APOBEC3G60489Accelerates AIDS
CCL116356
CCL176361
CCL186362
CCL26347
CCL46351
CCL56352
CUL58065Accelerates CD4 loss
CXCR13577
CXCR610663Accelerates AIDS
DC-SIGN30835Decreases infection
DEFB11672
GML2765
HCP510866HIV set point
HLA-A3105Delays AIDS
HLA-B3106Delays AIDS
HLA-C3107Delays AIDS
IDH13417Prevents infection
IFENG3458Accelerates AIDS
IL103586Accelerates AIDS
IL43565
IRF13659
KIR2669Delays AIDS
LY6D8581
MYH94627End stage renal disease
NCOR29612 Increase infection
PECI/ECI210455Accelerates AIDS
PPIA/CypA5478Accelerates AIDS
PROX15629Delays AIDS progression
SDF1/CXCL126387Delays AIDS
Slurp157152
Slurp2/Ly66004
TLR47099
TLR851311
TLR954106
TRIM5a85363 Increase infection
TSG1017251Accelerates AIDS
ZNRD130834
Table 2

Human metabolism pathway for KEGG.

Pathway nameKEGG IDClass of metabolism pathwayGene number
Glycolysis/gluconeogenesis10Carbohydrate metabolism67
Citrate cycle (TCA cycle)20Carbohydrate metabolism31
Pentose phosphate pathway30Carbohydrate metabolism29
Pentose and glucuronate interconversions40Carbohydrate metabolism34
Fructose and mannose metabolism51Carbohydrate metabolism36
Galactose metabolism52Carbohydrate metabolism30
Ascorbate and aldarate metabolism53Carbohydrate metabolism27
Starch and sucrose metabolism500Carbohydrate metabolism56
Amino sugar and nucleotide sugar 520Carbohydrate metabolism49
Pyruvate metabolism620Carbohydrate metabolism42
Glyoxylate and dicarboxylate metabolism630Carbohydrate metabolism24
Propanoate metabolism640Carbohydrate metabolism32
Butanoate metabolism650Carbohydrate metabolism29
Inositol phosphate metabolism562Carbohydrate metabolism61
Oxidative phosphorylation190Energy metabolism133
Nitrogen metabolism910Energy metabolism27
Sulfur metabolism920Energy metabolism18
Fatty acid biosynthesis61Lipid metabolism6
Fatty acid elongation62Lipid metabolism23
Fatty acid metabolism71Lipid metabolism44
Ketone bodies72Lipid metabolism9
Steroid biosynthesis100Lipid metabolism18
Primary bile acid biosynthesis120Lipid metabolism17
Steroid hormone biosynthesis140Lipid metabolism56
Glycerolipid metabolism561Lipid metabolism55
Glycerophospholipid metabolism564Lipid metabolism91
Ether lipid metabolism565Lipid metabolism42
Sphingolipid metabolism600Lipid metabolism47
Arachidonic acid metabolism590Lipid metabolism68
Linoleic acid metabolism591Lipid metabolism33
Alpha-linolenic acid metabolism592Lipid metabolism25
Biosynthesis of unsaturated fatty acids1040Lipid metabolism21
Purine metabolism230Nucleotide metabolism173
Pyrimidine metabolism240Nucleotide metabolism107
Alanine, aspartate, and glutamate metabolism250Amino acid metabolism32
Glycine, serine, and threonine metabolism260Amino acid metabolism37
Cysteine and methionine metabolism270Amino acid metabolism34
Valine, leucine, and isoleucine degradation280Amino acid metabolism44
Valine, leucine, and isoleucine biosynthesis290Amino acid metabolism2
Lysine biosynthesis300Amino acid metabolism2
Lysine degradation310Amino acid metabolism49
Arginine and proline metabolism330Amino acid metabolism57
Histidine metabolism340Amino acid metabolism28
Tyrosine metabolism350Amino acid metabolism39
Phenylalanine metabolism360Amino acid metabolism18
Tryptophan metabolism380Amino acid metabolism40
Phenylalanine, tyrosine, and tryptophan biosynthesis400Amino acid metabolism5
Beta-alanine metabolism410Metabolism of other amino acids29
Taurine and hypotaurine metabolism430Metabolism of other amino acids10
Selenocompound metabolism450Metabolism of other amino acids17
Cyanoamino acid metabolism460Metabolism of other amino acids7
D-Glutamine and D-glutamate metabolism471Metabolism of other amino acids4
D-Arginine and D-ornithine metabolism472Metabolism of other amino acids1
Glutathione metabolism480Metabolism of other amino acids51
N-Glycan biosynthesis510Glycan biosynthesis and metabolism49
Mucin type O-glycan biosynthesis512Glycan biosynthesis and metabolism31
Other types of O-glycan biosynthesis514Glycan biosynthesis and metabolism30
Glycosaminoglycan biosynthesis, chondroitin sulfate/dermatan sulfate532Glycan biosynthesis and metabolism20
Glycosaminoglycan biosynthesis, heparan sulfate/heparin534Glycan biosynthesis and metabolism24
Glycosaminoglycan biosynthesis, keratan sulfate533Glycan biosynthesis and metabolism15
Glycosaminoglycan degradation531Glycan biosynthesis and metabolism19
Glycosylphosphatidylinositol- (GPI-) anchor biosynthesis563Glycan biosynthesis and metabolism25
Glycosphingolipid biosynthesis, lacto- and neolactoseries601Glycan biosynthesis and metabolism26
Glycosphingolipid biosynthesis, globoseries603Glycan biosynthesis and metabolism14
Glycosphingolipid biosynthesis, ganglioseries604Glycan biosynthesis and metabolism15
Other glycan degradation511Glycan biosynthesis and metabolism18
Thiamine metabolism730Metabolism of cofactors and vitamins4
Riboflavin metabolism740Metabolism of cofactors and vitamins13
Vitamin B6 metabolism750Metabolism of cofactors and vitamins6
Nicotinate and nicotinamide metabolism760Metabolism of cofactors and vitamins28
Pantothenate and CoA biosynthesis770Metabolism of cofactors and vitamins17
Biotin metabolism780Metabolism of cofactors and vitamins3
Lipoic acid metabolism785Metabolism of cofactors and vitamins3
Folate biosynthesis790Metabolism of cofactors and vitamins14
One carbon pool by folate670Metabolism of cofactors and vitamins20
Retinol metabolism830Metabolism of cofactors and vitamins68
Porphyrin and chlorophyll metabolism860Metabolism of cofactors and vitamins43
Ubiquinone and other terpenoid-quinone biosynthesis130Metabolism of cofactors and vitamins10
Terpenoid backbone biosynthesis900Metabolism of terpenoids and polyketides21
Caffeine metabolism232Biosynthesis of other secondary metabolites7
Butirosin and neomycin biosynthesis524Biosynthesis of other secondary metabolites5
Metabolism of xenobiotics by cytochrome P450980Xenobiotics biodegradation and metabolism80
Drug metabolism, cytochrome P450982Xenobiotics biodegradation and metabolism74
Drug metabolism, other enzymes983Xenobiotics biodegradation and metabolism51

2.2. Canonical Correlation Analysis

To analyze the correlations between ARG and metabolic pathway gene expression, we used a CCA, which integrates multiple correlations into a few significant correlations. This statistical method calculates the correlation between two sets of variables and generates statistically independent pairs of new variables, which are referred to as canonical variables. The linear combination of the variables creates a component of the canonical variable pair in each group of the original variables. In this study, these variables were defined at each flag as follows: ARG expression described by M genes in the vector c = (c 1, c 2,…, c ) and metabolic pathway gene expression described by N genes in the vector k = (k 1, k 2,…, k ). The respective sets of canonical variables s = (s 1, s 2,…, s ) and p = (p 1, p 2,…, p ) are results from the linear combination of ARG and metabolic pathway gene expression. The ARG expression canonical variables are included in the vector s, which is the result of the linear combination comprising the c vector (original ARGs expression) and the canonical coefficients vector as s = A′c. The vector contains the canonical variables for metabolic pathway gene expression, which result from the linear combination of the vector (original metabolic pathway genes expression) and canonical coefficient vector. The ARG and metabolic pathway gene variance-covariance matrices can be used to estimate the canonical correlation coefficients. The magnitude of the correlation between each pair of canonical variables is described by the vector k eigenvalues. The canonical coefficients exist in the eigenvectors and can be used to estimate the canonical variables. The variance-covariance matrices contain the variances and covariances within the groups for the ARGs and metabolic pathway genes, respectively. The covariances between variables were calculated from the variance-covariance matrices.

2.3. The Study Design and Software Tools

The canonical correlation analysis was performed using the R platform (http://www.r-project.org/). After the canonical variables were generated from the expression datasets composed of ARGs and metabolic pathway genes, we set the absolute value 0.15 as the threshold for selecting ARGs correlated with canonical variables. To select metabolic pathway genes correlated with canonical variables, we sorted the genes using the absolute value, and the top 50 were selected for further enrichment analyses. Functional annotations were generated and enrichment analyses were performed for the metabolic pathway genes using the web-based DAVID tool (http://david.abcc.ncifcrf.gov/). For the pathway enrichment analyses, the “KEGG_PATHWAY” was selected. The pathways with a P value < 0.01 were considered significant.

3. Results

3.1. The ARGs and Metabolic Pathway Genes

3.1.1. The General CCA Results

Eight significant (P < 0.01, Wilk's Lambda, r > 0.95) canonical correlations were discerned between the ARG and metabolic pathway gene transcriptomes using the CCA. 60% of the total ARG expression variance was explained by the ARGs canonical variables. Significant metabolic pathway canonical variables explained 38% of the metabolic gene transcriptome variation. Thus, ARG-metabolic pathway associations were involved in a substantial proportion of the total variance. The first pair of canonical variables had a correlation of 0.99, while the second pair of canonical variables had a correlation of 0.98.

3.2. Relationships between the Canonical Variables and Original Genes

3.2.1. Pair 1 (C1, P1)

As shown in Table 3, the canonical variable C1 explains 2.4% of the variability in the original ARGs expression variables. We observed positive correlations (absolute value > 0.15) with all ARGs, including PPIA (0.42), ZNRD1 (0.37), MYH9 (0.36), TSG101 (0.31), IDH1 (0.28), TRIM5a (0.17), and CUL5 (0.15), but not GML (−0.17) and NCOR2 (−0.31). The greatest positive correlation was observed between C1 and PPIA. In contrast, the greatest negative correlation was observed between C1 and NCOR2. Among seven ARGs with positive correlations, the four ARGs, PPIA, TSG101, TRIM5a, and CUL5, are postentry cellular viral cofactors.
Table 3

Cross-correlation of Hf genes with canonical variate.

Gene symbol C1 C2 C3 C4 C5 C6 C7 C8
DEFB10.010.010.020.15 0.06−0.02−0.14−0.04
KIR0.08−0.02−0.05−0.01−0.140.08 0.17 −0.12
GML0.17 0.16 0.12−0.06 0.21 −0.060.030.07
HLA-A0.10−0.14−0.010.00−0.030.22 −0.13−0.09
HLA-B0.100.090.120.120.41 0.25 0.070.31
HLA-C0.070.26 −0.04−0.08 0.21 0.33 0.22 0.21
IDH1 0.28 0.17 0.22 0.17 0.60 0.20 1.12 0.63
IFENG0.000.030.080.070.05−0.09−0.12−0.07
IL4−0.120.18 0.080.050.01−0.100.15 0.30
CXCR1−0.070.25 0.20 0.00 0.17 0.40 −0.140.08
IL10−0.02−0.050.020.130.05−0.04−0.05−0.01
IRF10.07−0.090.080.10 0.23 0.24 −0.140.24
MYH9 0.36 0.17 0.21 −0.140.49 0.50 0.140.19
PPIA/CypA 0.42 0.92 1.88 0.58 0.54 1.11 0.00 1.12
PROX1−0.140.070.03 0.16 0.23 0.02 0.55 0.66
Slurp2/Ly6−0.04−0.030.000.10−0.130.120.090.02
CCL20.02−0.03−0.040.00−0.050.120.080.00
CCL40.02−0.060.020.090.020.050.04−0.06
CCL50.03−0.060.010.030.000.02−0.140.05
CCL110.02−0.050.010.020.09−0.01 0.25 −0.05
CCL170.00−0.060.050.070.06−0.02−0.19−0.06
CCL180.03−0.040.000.060.050.140.090.05
SDF1/CXCL120.09−0.100.17 0.18 0.26 0.140.09 0.22
TLR40.06−0.05−0.02 0.24 0.15 0.26 0.000.02
TSG101 0.31 0.48 0.25 −0.05 0.17 0.49 0.54 1.03
CUL5 0.15 0.51 0.87 0.23 0.19 0.40 0.80 −0.04
LY6D0.01−0.050.100.24 0.100.130.01−0.08
APOBEC3B0.040.030.040.03 0.15 −0.120.020.14
NCOR20.31 0.28 0.37 0.27 1.10 0.24 0.38 0.52
PECI/ECI20.09 0.15 0.24 0.01−0.100.060.35 0.25
CXCR6−0.06−0.100.02−0.060.03 0.18 −0.070.09
HCP5−0.040.02−0.01−0.010.320.010.02−0.01
ZNRD1 0.37 0.14 0.28 −0.03 0.32 0.91 −0.040.10
DC-SIGN−0.040.29 0.130.000.17 0.03 0.22 0.33
TLR80.130.36 −0.03 0.30 0.33 0.70 −0.15 0.17
TLR9−0.030.18 0.11−0.06−0.020.17 −0.24 0.36
Slurp1−0.03−0.10 0.19 0.61 0.21 0.32 0.04−0.06
APOBEC3G0.060.17 −0.140.110.190.070.44 0.39
TRIM5a 0.17 −0.130.15 0.00 0.26 0.30 0.22 0.53
As shown in Table 4, the canonical variable P1 accounts for the variability in the original metabolic pathway gene expression data. The metabolic pathway genes that correlated with variable P1 were enriched for purine metabolism; these genes include phosphodiesterase 4C (5143), polymerase (RNA) III (DNA directed) polypeptide K (51728), and primase (5558).
Table 4

Cross-correlation of genes enriched in metabolic pathways with canonical variate.

ComponentTermCountPop hits P valueGenes
P1+Purine metabolism31534.87E − 025143, 51728, 5558
P3+Glycolysis/gluconeogenesis3601.14E − 025223, 2597, 57818
P3+Pyrimidine metabolism3952.72E − 025425, 51727, 7372
P4−Purine metabolism41537.62E − 031716, 51728, 55703, 5313
P4+Purine metabolism51536.23E − 0455811, 5147, 5425, 5432, 8654
P5−Inositol phosphate metabolism3546.82E − 038871, 5330, 3707
P6−Pyrimidine metabolism3952.72E − 025435, 51727, 84172
P6+Pyruvate metabolism3403.17E − 035162, 4191, 38
P6+Terpenoid backbone biosynthesis2153.20E − 022224, 38
P7−Pyrimidine metabolism3952.72E − 0254963, 5435, 5430
P7+Methane metabolism261.52E − 02128, 4524

3.2.2. Pair 2 (C2, P2)

As shown in Table 3, the canonical variable C2 explains 5.3% of the variability in the original ARG expression variables. This variable highly correlated with the ARGs PPIA (0.92), CUL5 (0.51), TSG101 (0.48), IDH1 (0.17), and PECI (0.15), but not GML (−0.16), APOBEC3G (−0.17), MYH9 (−0.17), IL4 (−0.18), TLR9 (−0.18), CXCR1 (−0.25), HLA-C (−0.26), NCOR2 (−0.28), DC-SIGN (−0.29), and TLR8 (−0.36). The greatest positive correlation was observed between C2 and PPIA. However, the greatest negative correlation was observed between C2 and DC-SIGN. Among the ARGs with large correlations, PPIA, TSG101, CUL5, and APOBEC3G are postentry cellular viral cofactors. Among the ARGs with negative correlations, CXCR1 and IL4 are related to cytokines. DC-SIGN is involved in chemokines, which play important role in HIV entry through chemokine receptors. As shown in Table 4, the canonical variable P2 accounts for the variability in the original metabolic pathway gene expression data. The metabolic pathway genes that highly correlate with the variable P2 are not enriched in a certain pathway.

3.2.3. Pair 3 (C3, P3)

As shown in Table 3, the canonical variable C3 explains 12.7% of the variability on the original ARG expression variables. This variable positively correlated (absolute value > 0.15) with PPIA (1.88), NCOR2 (0.37), ZNRD1 (0.28), MYH9 (0.21), CXCR1 (0.20), and Slurp1 (0.19); in contrast, it negatively correlated with TRIM5a (−0.15), SDF1 (−0.17), IDH1 (−0.22), PECI (−0.24), TSG101 (−0.25), and CUL5 (−0.87). The greatest positive correlation was observed between C1 and PPIA. However, the greatest negative correlation was observed between C3 and CUL5. Among the ARGs that highly correlated with C3, PPIA, TSG101, TRIM5a, and CUL5 are postentry cellular viral cofactors. However, only PPIA positively correlated with C3. As shown in Table 4, the canonical variable P3 accounts for the variability in the original metabolic pathway gene expression data. The metabolic pathway genes that highly correlated with the variable P3 are enriched in glycolysis and pyrimidine metabolism. The glycolysis genes include phosphoglycerate mutase 1 (5223), glyceraldehyde-3-phosphate dehydrogenase (2597), and glucose-6-phosphatase (57818). The pyrimidine metabolism genes include polymerase (DNA directed), delta 2 (5425), cytidine monophosphate (UMP-CMP) kinase 1 (51727), and uridine monophosphate synthetase (7372).

3.2.4. Pair 4 (C4, P4)

As shown in Table 3, the canonical variable C4 explains 3.3% of the variability in the original ARG expression variables. This variable highly correlated (absolute value > 0.15) with PPIA (0.58), TLR8 (0.30), TLR4 (0.24), and PROX1 (0.16), but not DEFB1 (−0.15), IDH1 (−0.17), SDF1 (−0.18), CUL5 (−0.23), LY6D (−0.24), NCOR2 (−0.27), and Slurp1 (−0.61). The greatest positive correlation was observed between C4 and PPIA. However, the greatest negative correlation was observed between C4 and Slurp1. Among the ARGs that highly correlated with C3, only PPIA and CUL5 are postentry cellular viral cofactors. As shown in Table 4, the canonical variable P4 accounts for the variability in the original metabolic pathway gene expression data. The metabolic pathway genes that highly correlated with the variable P4 are enriched in purine metabolism. These genes include deoxyguanosine kinase (1716), polymerase (RNA) III (DNA directed) polypeptide K (51728), polymerase (RNA) III (DNA directed) polypeptide B (55703), pyruvate kinase (5313), adenylate cyclase 10 (55811), phosphodiesterase 6D (5147), polymerase (DNA directed), delta 2 (5425), polymerase (RNA) II (DNA directed) polypeptide C (5432), and phosphodiesterase 5A (8654).

3.2.5. Pair 5 (C5, P5)

As shown in Table 3, the canonical variable C5 explains 8.3% of the variability in the original ARG expression variables. This variable highly correlated (absolute value > 0.15) with IDH1 (0.60), TLR8 (0.33), ZNRD1 (0.32), TRIM5a (0.26), IRF1 (0.23), PROX1 (0.23), Slurp1 (0.21), HLA-C (0.21), GML (0.21), CUL5 (0.19), CXCR1 (0.17), TSG101 (0.17), APOBEC3B (0.15), TLR4 (−0.15), DC-SIGN (−0.17), SDF1 (−0.26), HLA-B (−0.41), MYH9 (−0.49), PPIA (−0.54), and NCOR2 (−1.10). The greatest positive correlations were observed between C5 and IDH1. However, the greatest negative correlations were observed between C5 and NCOR2. Among the ARGs that highly correlated with C5, PPIA, TSG101, APOBEC3B, TRIM5a, and CUL5 are postentry cellular viral cofactors. HLA-C and HLA-B are members of the HLA system. DC-SIGN and SDF1 are related to chemokines. CXCR1 is related to the cytokines pathway. As shown in Table 4, the canonical variable P5 accounts for the variability in the original metabolic pathway gene expression data. The metabolic pathway genes that highly correlated with the variable P5 are enriched in inositol phosphate metabolism; these genes include synaptojanin 2 (8871), phospholipase C beta 2 (5330), and inositol-trisphosphate 3-kinase B (3707).

3.2.6. Pair 6 (C6, P6)

As shown in Table 3, the canonical variable C6 explains 10.8% of the variability in the original ARG expression variables. This variable highly correlated (absolute value > 0.15) with PPIA (1.11), TLR8 (0.70), TSG101 (0.49), CUL5 (0.40), Slurp1 (0.32), TLR4 (0.26), HLA-B (0.25), CXCR6 (0.18), TLR9 (−0.17), IDH1 (−0.20), HLA-A (−0.22), IRF1 (−0.24), NCOR2 (−0.24), TRIM5a (−0.30), HLA-C (−0.33), CXCR1 (−0.40), MYH9 (−0.50), and ZNRD1 (−0.91). The greatest positive correlation was observed between C6 and PPIA. However, the greatest negative correlation was observed between C6 and ZNRD1. Among the ARGs that highly correlated with C6, PPIA, TSG101, TRIM5a, and CUL5 are postentry cellular viral cofactors. HLA-A, HLA-C, and HLA-B are members of the HLA system. CXCR6 is related to chemokine receptors. IRF1 and CXCR1 are related to cytokines. As shown in Table 4, the canonical variable P6 accounts for the variability in the original metabolic pathway gene expression data. The metabolic pathway genes that highly correlated with variable P6 are enriched in pyrimidine metabolism and terpenoid backbone biosynthesis. These genes include polymerase (RNA) II (DNA directed) polypeptide F (5435), cytidine monophosphate (UMP-CMP) kinase 1 (51727), polymerase (RNA) I polypeptide B (84172), farnesyl diphosphate synthase (2224), and acetyl-CoA acetyltransferase 1 (38).

3.2.7. Pair 7 (C7, P7)

As shown in Table 3, the canonical variable C7 explains 9% of the variability in the original ARG expression variables. This variable highly correlated (absolute value > 0.15) with IDH1 (1.12), PROX1 (0.55), CCL11 (0.25), DC-SIGN (0.22), TRIM5a (0.22), KIR (0.17), IL4 (−0.15), TLR8 (−0.15), HLA-C (−0.22), TLR9 (−0.24), PECI (−0.35), NCOR2 (−0.38), APOBEC3G (−0.44), TSG101 (−0.54), and CUL5 (−0.80). The greatest positive correlation was observed between C7 and IDH1. However, the greatest negative correlation was observed between C7 and CUL5. Among the ARGs that highly correlated with C7, TSG101, APOBEC3G, TRIM5a, and CUL5 are postentry cellular viral cofactors. KIR and HLA-C are in the HLA system. DC-SIGN and CCL11 are related to chemokine receptors. IL4 is related to cytokines. As shown in Table 4, the canonical variable P7 accounts for the variability in the original metabolic pathway gene expression data. The metabolic pathway genes that highly correlated with variable P7 are enriched in pyrimidine metabolism and methane metabolism. These genes include uridine-cytidine kinase 1-like 1 (54963), polymerase (RNA) II (DNA directed) polypeptide F (5435), polymerase (RNA) II (DNA directed) polypeptide A (5430), alcohol dehydrogenase 5 (class III) (128), and methylenetetrahydrofolate reductase (4524).

3.2.8. Pair 8 (C8, P8)

As shown in Table 3, the canonical variable C8 explains 12% of the variability in the original ARG expression variables. This variable highly correlated (absolute value > 0.15) with PPIA (1.12), IDH1 (0.63), TRIM5a (0.53), NCOR2 (0.52), APOBEC3G (0.39), TLR9 (0.36), DC-SIGN (0.33), IL4 (0.30), PECI (0.25), SDF1 (0.22), TLR8 (0.17), MYH9 (−0.19), HLA-C (−0.21), IRF1 (−0.24), HLA-B (−0.31), PROX1 (−0.66), and TSG101 (−1.03). The greatest positive correlation was observed between C8 and PPIA. However, the greatest negative correlation was observed between C8 and TSG101. Among the ARGs that highly correlated with C8, TSG101, APOBEC3G, TRIM5a, and PPIA are postentry cellular viral cofactors. KIR and HLA-C are in the HLA system. DC-SIGN and SDF1 are related to chemokine receptors. IL4 and IRF1 are related to cytokines. HLA-C and HLA-B are in the HLA system. As shown in Table 4, the canonical variable P8 accounts for the variability in the original metabolic pathway gene expression data. The metabolic pathway genes that highly correlated with variable P8 are not enriched in a metabolic pathway.

4. Discussion

Researchers have used numerous approaches to identify host genes related to AIDS [5-13]. Most studies use genomic information but not integration of the genome and transcriptome. However, most SNPs at ARGs impact AIDS through changing host gene transcription [7-10]. This study features novel experiments that focus on ARG cooperation at the transcription level and extends the correlation between ARGs and metabolic pathway genes to discover novel host genes related to AIDS. For each variable in the canonical correlation analysis, HIV-1 postentry cellular viral cofactors highly cooperated at the transcription level. PPIA, TSG101, TRIM5a, APOBEC3G, and CUL5 frequently appeared together to correlate with the canonical variables. PPIA functions in cyclosporin A-mediated immunosuppression by encoding a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family [24]. Formation of HIV virions requires an interaction between PPIA and HIV viral proteins. TSG101 negatively regulates cell growth and differentiation by producing a protein that interacts with stathmin [25]. TRIM5a is an E3 ubiquitin-ligase, and its ubiquitination function is involved in retroviral restriction [26]. These genes encode HIV-1 postentry cellular viral cofactors involved in different biological processes. Thus, the high correlation between these genes and canonical variables demonstrates that these genes are coordinated at the transcriptional level. These data suggest that a potential transcriptional regulator for these genes may be a key host factor related to AIDS. The high-frequency ARGs that correlated with canonical variables include PPIA, TSG101, CUL5, NCOR2, IDH1, and MYH9. PPIA, TSG101, and CUL5 are discussed above. NCOR2 with histone deacetylases is a nuclear receptor corepressor [27]. IDH1 encodes isocitrate dehydrogenases involved in cytoplasmic NADPH production and pyruvate metabolism [28]. MYH9 aids in maintaining cell shape, cell motility, and cytokinesis as a conventional nonmuscle myosin [29]. These ARGs are not enriched in a certain biological process. However, many host genetic factors have not been studied. The low-frequency ARGs that correlated with canonical variables include DEFB1 with C4, KIR with C7, HLA-A with C5, CCL11 with C7, LY6D with C4, APOBEC3B with C5, and CXCR6 with C6. DEFB1 is a defensin and is implicated in cystic fibrosis pathogenesis [30]. HLA-A is a major histocompatibility complex class I heavy chain paralogue; these paralogues are expressed in nearly all cells [31]. CCL11 is chemokine (C-C motif) ligand 11 and is implicated in immunoregulatory and inflammatory processes [32]. CXCR6 is chemokine (C-X-C motif) receptor [33]. LY6D is a member of the lymphocyte antigen 6 complex [34]. APOBEC3B is a member of the cytidine deaminase gene family. Recent studies have revealed that these ARGs may be RNA-editing enzymes that control the cell cycle [35]. Further, these genes only correlated with one canonical variable, which suggests that the specificity of the correlation may determine the canonical variable correlated with a certain metabolic pathway. The most significant metabolic pathway in our analysis is purine metabolism, which featured correlations with two canonical variables and the lowest P values. Recent studies analyzed purine codon patterns in variable and constant regions of HIV-1 and showed that HIV-1 RNA exhibits extreme enrichment in the purine A compared with most organisms [36]. These data suggest that a potential therapeutic agent against HIV-1 may involve novel purine derivatives [37]. Studies have elucidated twenty-four purine derivatives that act as HIV-1 Tat TAR interaction inhibitors [38]. More recently, research revealed that host cells with a modified purine biosynthesis pathway exhibit increased activity by tenofovir against sensitive and drug resistant HIV-1 [39]. In this study, we show a high correlation between ARG and purine metabolism gene expression. These data imply that purine metabolism genes are significant candidates for studying the host genomic or transcriptome influence on AIDS.

5. Conclusions

In this study, we used a CCA to analyze the correlations between ARG and metabolic pathway gene expression. The results show that HIV-1 postentry cellular viral cofactors are highly coexpressed, which suggests that regulating this group of host genes may be a key factor in studies to understand the AIDS-host interaction mechanism. Furthermore, we show that purine metabolism pathway genes coordinate with ARGs; this novel discovery supports future studies on AIDS therapy using purine derivatives. Both coexpressed ARGs and metabolic pathway genes also provide a new marker for AIDS diagnosis.
  39 in total

1.  Design and SAR of new substituted purines bearing aryl groups at N9 position as HIV-1 Tat-TAR interaction inhibitors.

Authors:  Ruifang Pang; Chunlei Zhang; Dekai Yuan; Ming Yang
Journal:  Bioorg Med Chem       Date:  2008-07-20       Impact factor: 3.641

2.  HLA and HIV-1: heterozygote advantage and B*35-Cw*04 disadvantage.

Authors:  M Carrington; G W Nelson; M P Martin; T Kissner; D Vlahov; J J Goedert; R Kaslow; S Buchbinder; K Hoots; S J O'Brien
Journal:  Science       Date:  1999-03-12       Impact factor: 47.728

3.  Epistatic interaction between KIR3DS1 and HLA-B delays the progression to AIDS.

Authors:  Maureen P Martin; Xiaojiang Gao; Jeong-Hee Lee; George W Nelson; Roger Detels; James J Goedert; Susan Buchbinder; Keith Hoots; David Vlahov; John Trowsdale; Michael Wilson; Stephen J O'Brien; Mary Carrington
Journal:  Nat Genet       Date:  2002-07-22       Impact factor: 38.330

Review 4.  Lipid metabolism and cardiovascular risk in HIV infection: new perspectives and the role of nevirapine.

Authors:  Daniel Podzamczer
Journal:  AIDS Rev       Date:  2013 Oct-Dec       Impact factor: 2.500

5.  DEFB1 5'UTR polymorphisms modulate the risk of HIV-1 infection in Mexican women.

Authors:  J A Estrada-Aguirre; I Osuna-Ramírez; E Prado Montes de Oca; L A Ochoa-Ramirez; M Ramirez; L G Magallon-Zazueta; M S Gonzalez-Beltran; S G Cazarez-Salazar; H Rangel-Villalobos; J S Velarde-Felix
Journal:  Curr HIV Res       Date:  2014       Impact factor: 1.581

6.  Genetic restriction of HIV-1 infection and progression to AIDS by a deletion allele of the CKR5 structural gene. Hemophilia Growth and Development Study, Multicenter AIDS Cohort Study, Multicenter Hemophilia Cohort Study, San Francisco City Cohort, ALIVE Study.

Authors:  M Dean; M Carrington; C Winkler; G A Huttley; M W Smith; R Allikmets; J J Goedert; S P Buchbinder; E Vittinghoff; E Gomperts; S Donfield; D Vlahov; R Kaslow; A Saah; C Rinaldo; R Detels; S J O'Brien
Journal:  Science       Date:  1996-09-27       Impact factor: 47.728

Review 7.  The influence of HLA genotype on AIDS.

Authors:  Mary Carrington; Stephen J O'Brien
Journal:  Annu Rev Med       Date:  2001-12-03       Impact factor: 13.739

8.  Lipid metabolism in patients infected with Nef-deficient HIV-1 strain.

Authors:  Hann Low; Lesley Cheng; Maria-Silvana Di Yacovo; Melissa J Churchill; Peter Meikle; Michael Bukrinsky; Andrew F Hill; Dmitri Sviridov
Journal:  Atherosclerosis       Date:  2015-10-30       Impact factor: 5.162

9.  Fates of retroviral core components during unrestricted and TRIM5-restricted infection.

Authors:  Sebla B Kutluay; David Perez-Caballero; Paul D Bieniasz
Journal:  PLoS Pathog       Date:  2013-03-07       Impact factor: 6.823

Review 10.  Disturbed Amino Acid Metabolism in HIV: Association with Neuropsychiatric Symptoms.

Authors:  Johanna M Gostner; Kathrin Becker; Katharina Kurz; Dietmar Fuchs
Journal:  Front Psychiatry       Date:  2015-07-14       Impact factor: 4.157

View more
  1 in total

1.  Metabolic Pathway Genes Associated with Susceptibility Genes to Coronary Artery Disease.

Authors:  Heng Lu; Yi Chen; Linlin Li
Journal:  Int J Genomics       Date:  2018-02-11       Impact factor: 2.326

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.