| Literature DB >> 30867481 |
Priyanka Mishra1, Jessica Ngo1, Jahanshah Ashkani1, Frederic Pio2.
Abstract
The genetic model organism, Caenorhabditis elegans (C. elegans), shares many genes with humans and is the best-annotated of the eukaryotic genome. Therefore, the identification of new genes and pathways is unlikely. Nevertheless, host-pathogen interaction studies from viruses, recently discovered in the environment, has created new opportunity to discover these pathways. For example, the exogenous RNAi response in C. elegans by the Orsay virus as seen in plants and other eukaryotes is not systemic and transgenerational, suggesting different RNAi pathways between these organisms. Using a bioinformatics meta-analysis approach, we show that the top 17 genes differentially-expressed during C. elegans infection by Orsay virus are functionally uncharacterized genes. Furthermore, functional annotation using similarity search and comparative modeling, was able to predict folds correctly, but could not assign easily function to the majority. However, we could identify gene expression studies that showed a similar pattern of gene expression related to toxicity, stress and immune response. Those results were strengthened using protein-protein interaction network analysis. This study shows that novel molecular pathway components, of viral innate immune response, can be identified and provides models that can be further used as a framework for experimental studies. Whether these features are reminiscent of an ancient mechanism evolutionarily conserved, or part of a novel pathway, remain to be established. These results reaffirm the tremendous value of this approach to broaden our understanding of viral immunity in C. elegans.Entities:
Mesh:
Year: 2019 PMID: 30867481 PMCID: PMC6416287 DOI: 10.1038/s41598-019-40762-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Enrichment factors of the GO categories (#Number of genes that were annotated in each category) determined by GSEA for the 250 genes differentially expressed after Orsay virus infection.
| Panther GO | # | Expected | Fold Enrichment | ± |
|---|---|---|---|---|
| DNA repair | 7 | 1.09 | 6.40 | +2.58E-02 |
| Stress | 17 | 3.52 | 4.83 | +2.48E-05 |
| Catabolism | 13 | 2.74 | 4.74 | +9.44E-04 |
| Cell cycle | 24 | 5.89 | 4.07 | +1.45E-06 |
| Catalytic activity | 15 | 3.86 | 3.88 | +1.90E-03 |
| Regulation | 15 | 3.88 | 3.86 | +2.02E-03 |
| Translation | 13 | 3.45 | 3.77 | +1.00E-02 |
| Nitrogen compound metabolism | 20 | 5.61 | 3.56 | +2.23E-04 |
| Phosphorylation | 14 | 4.41 | 3.18 | +3.13E-02 |
The p-values were adjusted by Bonferroni correction for multiple testing.
The 17 top ranking genes differentially-expressed based on the lowest p-value that have no known function.
| Gene symbol | |
|---|---|
| B0507.8 | 9.82E-11 |
| F26F2.4 | 1.26E-09 |
| F26F2.5 | 3.57E-09 |
| B0507.10 | 4.69E-09 |
| CELE_T26F2.3 | 7.51E-09 |
| CELE_C43D7.4 | 2.01E-08 |
| CELE_C17H1.6 | 4.54E-08 |
| CELE_C17H1.7 | 6.22E-08 |
| CELE_Y75B8A.39 | 1.81E-07 |
| F26F2.2 | 3.79E-07 |
| CELE_C43D7.7 | 4.07E-07 |
| F26F2.3 | 4.08E-07 |
| F26F2.1 | 4.81E-07 |
| C49C8.2 | 1.73E-06 |
| CELE_B0284.4 | 1.90E-06 |
| F42C5.3 | 7.59E-06 |
| sdz-6 | 1.21E-05 |
Structural prediction and annotation using PHYRE2, SWISS-MODEL and IntFOLD3 of the top 17 unknown and highly differentially-expressed genes.
| Gene symbol | PHYRE-2 | SWISS-MODEL | INTFOLD3 | Annotation based on the template function |
|---|---|---|---|---|
| B0507.8 | 0 (26) | 1 (25) | 1 (98) | Transportin 3 – Protein import to nucleus (α -helix) |
| F26F2.4 isoform a | 1 (66) | 1 (32) | 1 (82) | Synaptonemal complex 1 (α-helix) |
| F26F2.4 isoform b | 1 (55) | 5 (90) | Similar to F26F2.4 isoform a | |
| F26F2.5 | 1 (14) | 1 (32) | 4 (96) | Zinc finger domain (α-helix) |
| B0507.10 | 0 (26) | 0 (32) | 1(80) | Spectrin (α-helix bundle) |
|
|
|
|
| |
| CELE_C43D7.4 | 1 (21) | 0 (25) | 1 (90) |
|
| CELE_C17H1.6 | 1 (36) | 0 (31) | 1 (87) | Reticulocyte binding protein 5 signaling (α-helix bundle) |
| CELE_C17H1.7 | 0 (26) | 1 (73) | 1 (55) | Helical repeat domain (α-helix) |
| CELE_Y75B8A.39 | 0 (22) | 0 (23) | 1 (100) | De novo three helix bundle |
| F26F2.2 | 0 (17) | 1 (46) | 5 (87) | Adhesin coiled-coil (α-helix) |
| CELE_C43D7.7 | 0 (17) | 0 (59) | 1 (93) | Virb DNA binding domain (Helix-Turn-Helix) |
| F26F2.3 | 0 (18) | 0 (15) | 1 (95) | alpha/alpha toroid |
| F26F2.1 | 0 (1) | 0 (6) | N.T. (0) |
|
| C49C8.2 | 0 (5) | 0 (35) | 1 (90) | Apolipophorin-iii lipid transport (5 helix bundle) |
| CELE_B0284.4 | 0 (31) | 1 (28) | 1 (90) | Ezra cytoplasmic domain (3 helix bundle repeat) |
| F42C5.3 isoform a | 0 (35) | 0 (17) | 1 (91) | Reticulocyte binding protein 5 signaling (α-helix bundle) |
| F42C5.3 isoform b | 0 (4) | 0 (22) | 1 (86) | Hof1p f-bar domain protein binding (coiled-coil) |
| sdz-6 | 0 (14) | 0 (26) | 1 (82) | Ectodomain of protein eff-1 (α-helix) |
The root mean squared deviation (RMSD) between the best template and model were computed as well as the percent coverage (In brackets) of the C. elegans protein sequence by the model. (N.T.: No Template found).
Figure 1Protein-Protein interaction Network of the linkage between the 17 uncharacterized genes and their associated partners in STRING database (https://string-db.org). The circle represents groups of genes or clusters identified by text-mining and/or co-expression in one single paper.
Figure 2Word cloud analysis of all abstracts in GEO database referring to the 17 uncharacterized genes with a differential-expression filter on. Words were increased in size the more times they were mentioned in text. Stress, metal toxicity, development, and E2F transcription factor are seen to be enriched through this text mining approach. It should be noted that words that were important, but extensively repeated due to the subject of their papers, were removed. These words include: heme, HRG, LIN, cell, cadmium, pocket and transcription.
Figure 3Pipeline of the meta-analysis approach.