| Literature DB >> 24261899 |
Suying Bao, Xueya Zhou, Liangcai Zhang, Jie Zhou, Kelvin Kai-Wang To, Binbin Wang, Liqiu Wang, Xuegong Zhang, You-Qiang Song1.
Abstract
BACKGROUND: The genetic make-up of humans and other mammals (such as mice) affects their resistance to influenza virus infection. Considering the complexity and moral issues associated with experiments on human subjects, we have only acquired partial knowledge regarding the underlying molecular mechanisms. Although influenza resistance in inbred mice has been mapped to several quantitative trait loci (QTLs), which have greatly narrowed down the search for host resistance genes, only few underlying genes have been identified.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24261899 PMCID: PMC4046670 DOI: 10.1186/1471-2164-14-816
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Overview of the network approaches based on seed genes and differential expression. The gene network is constructed from STRING database and represented by an undirected graph consisting of nodes (genes) and weighed edges (links between gene pairs with associated scores). (a) For the seed-based strategy, the score vector for all seeds and other genes within the genome is initialized with the entries corresponding to the seed genes assigned with equal scores whose sum is equal to 1. The vector is iteratively updated by a random walk process over the network until it reaches convergence. Candidate genes are ranked by their scores in the converged vector, which can be interpreted as the steady-state probabilities of staying at the nodes representing the candidate genes. A high probability for the candidate corresponds to a higher similarity to the seeds. As a computationally efficient alternative, a heat kernel diffusion matrix can be used to approximate the distances between all pairs of genes. The candidate genes are then scored according to their average distances to the seeds based on the kernel matrix. (b) The DE-based method does not rely on the definition of seeds but uses a trait-related microarray expression profile to obtain the DE levels of the genes. DE levels were then mapped onto the network. For each candidate gene, the score is calculated as a weighted average of the DE levels of the gene and its network neighbors with the weights derived from the network distances between genes. In this study, the candidate genes within each QTL were scored using two different strategies, and the top 10% ranked by each method was chosen as prioritized genes (winners).
The collection of 14 known host resistance genes
| Entrez ID | Gene symbol | Gene description | Mouse ortholog | Cytoband | Supporting evidence |
|---|---|---|---|---|---|
| 4599 |
| myxovirus (influenza virus) resistance 1 |
| 21q22.3 | Mouse strains homozygous for |
| 9437 |
| natural cytotoxicity triggering receptor 1 |
| 19q13.42 |
|
| 1234 |
| chemokine (C-C motif) receptor 5 |
| 3p21.31 | Deaths among |
| 114548 |
| NLR family, pyrin domain containing 3 |
| 1q44 | Mice lacking |
| 3105 |
| major histocompatibility complex, class I, A |
| 6p21.3 | The magnitude and specificity of influenza A virus-specific cytotoxic T-lymphocyte responses in humans is associated with the HLA-A and -B phenotypes
[ |
| 3106 |
| major histocompatibility complex, class I, B | |||
| 2212 |
| Fc fragment of IgG, low affinity IIa, receptor (CD32) |
| 1q23 | rs1801274 on |
| 84268 |
| RPA interacting protein |
| 17p13.2 | rs8070740 on |
| 3456 |
| interferon, beta 1, fibroblast |
| 9p21 |
|
| 3586 |
| interleukin 10 |
| 1q31-q32 | A promoter polymorphism conferred a significantly decreased risk of adverse response to inactivated influenza vaccine
[ |
| 708 |
| complement component 1, q subcomponent binding protein |
| 17p13.3 | rs3786054 on |
| 3811 |
| killer cell immunoglobulin-like receptor, three domains, long cytoplasmic tail, 1 |
| 19q13.4 | KIR3DL1/S1 and 2DL1 ligand-negative pairs were enriched among H1N1 ICU cases
[ |
| 3803 |
| killer cell immunoglobulin-like receptor, two domains, long cytoplasmic tail, 2 |
| 19q13.4 | KIR2DL2/L3 ligand-positive pairs were enriched among H1N1 ICU cases
[ |
| 10410 |
| interferon induced transmembrane protein 3 |
| 11p15.5 | Mice lacking |
Figure 2Performance evaluation of seed-based network strategy. The ROC curves of the seed-based methods in LOOCV test on known host resistance genes. Four different methods (DIR, SAR, RWR, and sHKDR) as described in the main text were compared. The prioritization performance can be measured as AUC presented next to each method.
Figure 3Empirical survey and performance evaluation of DE-based network strategy. (a) The known influenza host resistance genes are surrounded by differentially expressed genes between resistant and susceptible mouse strains. To visualize the gene expression levels within a network context, a sub-network consisting of only the seed genes and their directly linked neighbors in the STRING database was extracted and visualized using Cytoscape [34] under the edge-weighed spring embedded layout. The distances between seeds and their neighbors were set proportional to their interaction scores. Differential expression levels between resistance and susceptible mouse strains are mapped to the size and color shade of each node. The significant differentially expressed genes were highlighted by unifying the colors of genes with DE levels that ranked at the top 5% (DE level ≥ 0.32) among the whole genome in red and the genes with DE levels that ranked at the bottom 5% (DE level ≤ -0.15) in blue (as illustrated in the inset). All seed genes are highlighted using the same node size and bold fonts of their names. (b) The ROC curves of DE-based methods in the validation test on known host resistance genes. Three methods (DER, DNR, and deHKDR) as described in the main text were compared. The performance measured as AUC is shown next to the name of each method.
QTL studies for candidate genes collection
| Study* | QTL regions† | Influenza virus | Mouse strains |
|---|---|---|---|
| Toth et al., 1999
[ | chr6:48676555-75397704 | H3N2 | CXB |
| Boon et al., 2009
[ | chr2:33–52 Mb; | H5N1 | BXD |
| chr7:107–121 Mb; | |||
| chr11:101–107 Mb; | |||
| chr15:51–57 Mb; | |||
| chr17:68–84 Mb | |||
| Nedelko et al., 2012
[ | chr2:56–68 Mb; | H1N1 | BXD |
| chr5:140–153 Mb; | |||
| chr16:64–78 Mb; | |||
| chr17:30–44 Mb; | |||
| chr19:37–45 Mb | |||
| Boivin et al., 2012
[ | chr2:24–38 Mb; | H3N2 | AcB |
| chr17:37–48 Mb | |||
| Ferris et al., 2013
[ | chr1:21767867–29085401; | H1N1 | preCC |
| chr16:97500418–98213493; | |||
| chr7:89130587–96764352; | |||
| chr15:77427235-86625488 |
*The QTL regions were collected from genome-wide scans of phenotypes related to the outcome of influenza virus infection in inbred mouse.
†The genomic positions are based on the coordinates of NCBI build 37.
Figure 4An overview of the prioritized genes from mouse QTLs. (a) A total of 258 genes (winners) were ranked at the top 10% in each QTL region by the seed- (RWR, sHKDR) or DE-based method (deHKDR). The numbers of winners identified by one, two, or all three methods are shown in a Venn diagram. The winners identified by at least one of the seed-based methods and by the DE-based method were termed 2-strategy winners. The remaining winners (identified by the seed-based methods only or by DE-based method only) were termed single-strategy winners. (b) 2-strategy winners are better supported by the genetic or functional evidence compared with single-strategy winners. Each set of winners(2-strategy winners, DE-only winners, seed-only winners) was annotated by genetic evidence and functional evidence. The proportion of winners supported by one class of evidence or both was plotted as a stacked cylinder. One-tailed hypergeometric test was used to determine the enrichment significance of the supported winners (either supported by genetic or functional evidences) in a winner set, given all prioritized winners as background. P values were annotated above the corresponding cylinders.
Prioritized genes supported by multiple types of studies
| Gene symbol | Gene description | Prioritization method | Supporting source* | Functional annotation and/or literature support | ||||
|---|---|---|---|---|---|---|---|---|
| Seed-based | DE-based | Genet-Assoc | QTL | RNAi | Expr | |||
|
| interferon-induced protein 35 | + | + | + | + |
| ||
|
| eukaryotic translation initiation factor 2-alpha kinase 2 | + | + | + | + | + | + | The encoded protein is a serine/threonine protein kinase that is activated after binding to dsRNA during the course of a viral infection. Mice lacking this gene displayed increased susceptibility to influenza virus infection
[ |
|
| tumor necrosis factor (TNF superfamily, member 2) | + | + | + | + | The encoded protein is a multifunctional proinflammatory cytokine, involved in the regulation of a wide spectrum of biological processes including apoptosis. It harbored polymorphisms associated with the severity of the clinical behavior after infection by the pandemic influenza A/H1N1
[ | ||
|
| tripartite motif-containing 26 | + | + | + | The encoded protein is a member of the tripartite motif (TRIM) family. | |||
|
| interferon induced with helicase C domain 1 | + | + | + | + | Innate immune receptor acting as a cytoplasmic sensor of viral nucleic acids and plays a major role in the activation of a cascade of antiviral responses including the induction of type I interferons and proinflammatory cytokines. The | ||
|
| transporter 2, ATP-binding cassette, sub-family B (MDR/TAP) | + | + | + | Involved in antigen processing and presentation. | |||
|
| folate hydrolase (prostate-specific membrane antigen) 1 | + | + | + | ||||
|
| major histocompatibility complex, class I, E | + | + | + | HLA class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. | |||
|
| leukocyte specific transcript 1 | + | + | + | The protein encoded by this gene is a membrane protein that can inhibit the proliferation of lymphocytes. In humans, | |||
|
| + | + | + | |||||
|
| phospholipase A2, group VII (platelet-activating factor acetylhydrolase, plasma) | + | + | + | The encoded protein a secreted enzyme that catalyzes the degradation of platelet-activating factor to biologically inactive products. It harbored genetic polymorphisms associated with imflammatory diseases like atopy and asthma in humans
[ | |||
|
| TAP binding protein (tapasin) | + | + | + | + | Involved in the association of MHC class I with TAP and in the assembly of MHC class I with peptide. | ||
|
| proteasome (prosome, macropain) subunit, beta type, 9 (large multifunctional peptidase 2, LMP2) | + | + | + | + | + | The proteasome is a multicatalytic proteinase complex. The encoded subunit is involved in antigen processing to generate class I binding peptides. The | |
|
| interleukin 1 receptor antagonist | + | + | + | + | The encoded protein inhibits the activities of interleukin 1 and modulates a variety of interleukin 1 related immune and inflammatory responses. It harbors genetic polymorphisms significantly related to humoral immune response to inactivated seasonal influenza vaccine
[ | ||
|
| complement component 5 | + | + | + | The encoded protein is the fifth component of complement, which plays an important role in inflammatory and cell killing processes. The | |||
|
| death-domain associated protein | + | + | + | The encoded protein may function to regulate apoptosis. Influenza virus can escape the repressional function of Daxx during infection by binding matrix protein 1 with Daxx
[ | |||
|
| major histocompatibility complex, class II, DQ beta 1; similar to major histocompatibility complex, class II, DQ beta 1 | + | + | + | HLA-DR7/4,DQB1*0302genotype was significantly associated (OR = 5.15; 95%CI = 1.94, 13.67; p = 0.001) with clinical hyporesponsiveness after trivalent inactivated influenza vaccine
[ | |||
|
| myxovirus (influenza virus) resistance 1, interferon-inducible protein p78 (mouse) | + | + | + | + | + | Mice susceptible to influenza infection harbor large exonic deletions or nonsense mutations in the | |
|
| major histocompatibility complex, class I, A | + | + | + | The magnitude and specificity of influenza A virus-specific cytotoxic T-lymphocyte responses in humans is related to HLA-A and -B phenotype
[ | |||
|
| major histocompatibility complex, class I, B | + | + | + | + | + | + | |
*The following sources of supporting evidence were collected for each prioritized gene. Genet-Assoc: literature supporting for the gene’s genetic association with host resistance to influenza infection. QTL: candidate genes identified in the original QTL study with independent evidence (harboring founder variants that were associated with the phenotype; co-localization with a cis-eQTL; etc.). RNAi: host genes important for influenza life circle identified through high-throughput RNAi screens. Expr: host genes robustly up- or down- regulated after influenza virus infection identified from multiple microarray experiments. Detailed supporting evidence for each gene was listed in Additional file 2: Table S4. For more details of QTL, RNAi and expression studies, see Additional file 2: Table S5.
Figure 5Pathways enriched by the prioritized genes. Pathways (KEGG, BioCarta, Reactome) significantly enriched (p < 0.01 and FDR < 0.25) by the winners of each method (RWR, sHKDR, deHKDR) or by 2-strategy winners are shown as a heatmap. The color intensity of each cell represents the fold enrichment of the corresponding winner group for each pathway. Only the significantly enriched pathways for each winner group are shown.
Figure 6Prioritized genes in apoptosis and TNFR2 signaling pathways. The graphical representation of the pathways is generated by the ingenuity pathway analysis (IPA) tool. The prioritized genes were highlighted by red dotted circles. The apoptosis and TNFR2 signaling pathways were extracted from the “canonical pathway” mappings. Genes are color coded by their differential expression levels between resistance and susceptible mouse strains. In particular, the genes with higher expression in susceptible mice than in resistance mice were colored red; whereas those having lower expression in susceptible than resistance mice were shown in green. The symbols used to represent molecules and relationships were illustrated in the legend.