| Literature DB >> 23706066 |
Mileidy W Gonzalez1, John L Spouge.
Abstract
BACKGROUND: In the coevolution of viruses and their hosts, viruses often capture host genes, gaining advantageous functions (e.g. immune system control). Identifying functional similarities shared by viruses and their hosts can help decipher mechanisms of pathogenesis and accelerate virus-targeted drug and vaccine development. Cellular homologs in viruses are usually documented using pairwise-sequence comparison methods. Yet, pairwise-sequence searches have limited sensitivity resulting in poor identification of divergent homologies.Entities:
Mesh:
Year: 2013 PMID: 23706066 PMCID: PMC3672079 DOI: 10.1186/1756-0500-6-209
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1DASH’s computational pipeline. DASH compares a pathogen genome against the HMM profiles in Pfam using HMMER. As illustrated in the left-hand example, DASH compares all pathogen proteins against all PfamA profiles, annotating the domain function on the sequences and assigning E-value scores for the comparisons. All similarities with E-values under the user-defined threshold are reported as functional annotations in the pathogen genome. Likewise, DASH functionally annotates the host genome in parallel. With fully annotated genomes, DASH can distinguish the protein functions exclusive to the pathogen, functions exclusive to the host, and functions shared by both organisms.
Figure 2DASH’s web interface. A. DASH’s web interface currently allows the analysis of fifty-two host-virus systems, accepting user-defined scoring parameters or providing statistical guidance for choosing appropriate scoring thresholds. B. DASH results distinguish pathogen-exclusive from shared functions. The web interface permits rows corresponding to shared functions within the virus to be expanded to display the host sequences with the same function.
Functional similarities between HHV-8 and human confirmed by DASH
| ORF9 | 139472809 | 2.12e-141 | 0.99 | 14250670 | 4.95e-134 | 0.97 | DNA polymerase | PF00136, PF03104 | M | Russo |
| ORF61 | 139472880 | 6.11e-124 | 1.00 | 194389080 | 7.11e-211 | 1.00 | Ribonucleotide reductase, large chain | PF02867, PF00317 | M | Russo |
| ORF70 | 139472812 | 3.29e-109 | 1.00 | 4507751 | 2.69e-113 | 1.00 | Thymidylate synthase | PF00303 | M | Russo |
| ORF60 | 139472853 | 2.23e-87 | 0.97 | 4557845 | 1.79e-116 | 1.00 | Ribonucleotide reductase, small chain | PF00268 | M | Russo |
| ORF75 | 139472887 | 1.76369e-42 | 0.99 | 38197270 | 9.58835e-88 | 1.00 | CobB/CobQ-like glutamine amidotransferase, AIR synthase | PF13507, PF02769, PF12818 | M | Russo |
| ORF2 | 139472863 | 5.06e-22 | 0.98 | 4503323 | 6.19e-25 | 0.99 | Dihydrofolate reductase | PF00186 | M | Russo |
| ORF72 | 139472885 | 3.41e-21 | 0.98 | 4502613 | 3.71e-39 | 0.99 | Cyclin | PF00134, PF09080 | I | Russo |
| K9 | 139472878 | 2.00e-16 | 0.74 | 119604110 | 2.26e-59 | 0.99 | Interferon regulatory factor | PF10401, PF00605 | I | Russo |
| ORF46 | 139472838 | 6.58e-15 | 0.97 | 194387970 | 8.97e-20 | 0.97 | Uracil DNA glycosylase | PF03167 | M | Russo |
| ORF74 | 139472805 | 4.12e-14 | 1.00 | 197692661 | 5.57e-94 | 1.00 | 7 transmembrane receptor | PF00001 | M | Russo |
| K4 | 139472814 | 1.18e-13 | 0.95 | 2905626 | 3.40e-15 | 0.95 | Small cytokine, interleukin-8 like | PF00048 | I | Russo |
| K6 | 139472866 | 1.88e-13 | 0.94 | 2905626 | 3.40e-15 | 0.95 | Small cytokine, interleukin-8 like | PF00048 | I | Russo |
| ORF56 | 139472847 | 1.07e-12 | 0.97 | 119625077 | 1.42e-07 | 0.99 | Herpesviridae UL52/UL70 DNA primase | PF03121 | M | Russo |
| ORF71/K13 | 139472802 | 3.17e-12 | 0.93 | 194386924 | 1.55e-18 | 0.99 | CASP8 and FADD-like apoptosis regulator | PF01335 | I | Russo |
| K2 | 139472811 | 8.35e-11 | 0.96 | 23835 | 1.70e-57 | 1.00 | Interleukin-6/G-CSF/MGF family | PF00489 | I | Russo |
| ORF54 | 139472845 | 9.88e-08 | 0.78 | 158256462 | 7.73e-40 | 0.99 | dUTPase | PF00692 | M | Russo |
| K5 | 139472815 | 6.46686e-07 | 1.00 | 221043596 | 1.14442e-06 | 1.00 | Membrane-Associated RING-CH (MARCH) | PF12906 | I | Bartee |
| K3 | 139472864 | 2.58674e-06 | 1.00 | 221043596 | 1.14442e-06 | 1.00 | Membrane-Associated RING-CH (MARCH) | PF12906 | I | Bartee |
| K4.1 | 139472813 | 3.06e-06 | 0.91 | 2905626 | 3.40e-15 | 0.95 | Small cytokine, interleukin-8 like | PF00048 | I | Neipel |
| ORF16 | 139472817 | 3.64e-05 | 0.95 | 10279702 | 3.71e-24 | 1.00 | Apoptosis regulator, Bcl-2 family | PF00452 | I | Russo |
| vIRF-3/K10.5/K10.6 | 139472850 | 1.13e-03 | 0.68 | 119604110 | 2.26e-59 | 0.99 | Interferon regulatory factor | PF10401 | I | Russo |
| KCP/ORF4 | 139472860 | 1.29e-02 | 0.91 | 80478231 | 1.42e-04 | 1.00 | Complement control protein | PF00084 | I | Russo |
| vIRF-2/K11/K11.1 | 139472851 | 6.35e-01 | 0.68 | 119604110 | 2.26e-59 | 0.99 | Interferon regulatory factor | PF10401 | I | Russo |
| vIRF-4/K10/K10.1 | 139472849 | 4.94e+00 | 0.76 | 119604110 | 2.26e-59 | 0.99 | Interferon regulatory factor | PF10401 | I | Russo |
Abbreviation: (Dom Cov.) Domain coverage—fraction of Pfam HMM model coverage by viral/human sequence alignment. Function types: (I) Immunomodulatory, and (M) metabolic.
Exclusively-viral functions identified by DASH in the HHV-8 genome
| ORF25 | 139472823 | 0.0 | 1.00 | Herpes virus major capsid protein | PF03122 | S/M | Russo |
| ORF44 | 139472836 | 0.0 | 1.00 | Helicase | PF02689 | S/M | Russo |
| ORF6 | 139472807 | 0.0 | 0.98 | ssDNA binding protein | PF00747 | S/M | Russo |
| ORF8 | 139472808 | 5.6e-272 | 0.99 | Glycoprotein B | PF00606 | S/M | Russo |
| ORF63 | 139472855 | 1.4e-271 | 1.00 | Herpes virus tegument protein U30 | PF04523 | S/M | Russo |
| K15 | 139472806 | 1.2e-258 | 1.00 | Herpesvirus Latent membrane protein 2 | PF06126 | S/M | Choi |
| ORF24 | 139472822 | 2.6e-208 | 1.00 | Herpesvirus UL87 family | PF03043 | C | DASH |
| ORF43 | 139472835 | 1.1e-200 | 1.00 | Herpesvirus UL6 like | PF01763 | C | DASH |
| ORF22 | 139472821 | 9.1e-197 | 1.00 | Herpesvirus glycoprotein H | PF02489 | S/M | Russo |
| ORF50 | 139472840 | 1.9e-193 | 1.00 | Transcription activation factor (transactivator) | PF03326 | S/M | Russo |
| ORF7 | 139472861 | 7.5e-193 | 1.00 | Herpesvirus processing and transport protein | PF01366 | S/M | Russo |
| ORF19 | 139472819 | 7.2e-182 | 1.00 | Herpesvirus UL25 family | PF01499 | C | DASH |
| K8 | 139472841 | 1.3e-170 | 1.00 | Transcriptional activator | PF07188 | S/M | Wu |
| ORF37 | 139472831 | 3.0e-164 | 0.99 | Alkaline exonuclease | PF01771 | S/M | Russo |
| ORF29 | 139472824 | 1.1e-154 | 1.00 | Probable DNA packing protein | PF02499, PF02500 | S/M | Russo |
| ORF59 | 139472852 | 2.2e-139 | 1.00 | Herpes DNA replication accessory factor | PF04929 | S/M | Russo |
| ORF39 | 139472832 | 6.3e-136 | 0.96 | Herpesvirus glycoprotein M | PF01528 | S/M | Russo |
| ORF68 | 139472883 | 7.6e-133 | 0.99 | Herpesvirus putative major envelope glycoprotein | PF01673 | S/M | Russo |
| ORF58 | 139472879 | 5.2e-128 | 1.00 | Herpesvirus BMRF2 protein | PF04633 | S/M | Russo |
| ORF49 | 139472877 | 3.3e-108 | 1.00 | BRRF1-like protein | PF04793 | S/M | Russo |
| ORF34 | 139472829 | 1.6e-106 | 0.99 | UL95 family | PF03038 | C | DASH |
| ORF26 | 139472871 | 3.1e-103 | 1.00 | Herpesvirus VP23 like capsid protein | PF01802 | S/M | Russo |
| ORF18 | 139472818 | 8.6e-99 | 1.00 | UL79 family | PF03049 | C | DASH |
| ORF23 | 139472870 | 1.1e-98 | 1.00 | Herpesvirus BTRF1 protein conserved region | PF04682 | C | DASH |
| ORF10 | 139472862 | 6.3e-94 | 0.99 | Herpesvirus dUTPase protein | PF04797 | S/M | McGeehan |
| ORF11 | 139472810 | 5.2e-93 | 0.99 | Herpesvirus dUTPase protein | PF04797 | S/M | McGeehan |
| ORF69 | 139472884 | 3.5e-91 | 0.97 | Herpesvirus UL31-like protein | PF02718 | C | DASH |
| ORF55 | 139472846 | 3.9e-90 | 0.96 | Herpes virus U44 protein | PF04533 | C | DASH |
| ORF32 | 139472827 | 4.2e-84 | 1.00 | Herpesvirus UL17 protein | PF04559 | C | DASH |
| ORF66 | 139472857 | 3.9e-82 | 0.99 | UL49 family | PF03117 | C | DASH |
| ORF62 | 139472854 | 4.7e-81 | 1.00 | Herpesvirus capsid shell protein VP19C | PF03327 | S/M | Russo |
| ORF48 | 139472839 | 4.4e-78 | 1.00 | Herpesvirus protein of unknown function | PF05734 | C | DASH |
| ORF17 | 139472867 | 2.2e-77 | 0.65 | Assemblin (Peptidase family S21) | PF00716 | S/M | Unal |
| ORF31 | 139472825 | 8.0e-71 | 0.98 | UL92 family | PF03048 | C | DASH |
| K1 | 139472859 | 1.5e-69 | 1.00 | Glycoprotein | PF02960, PF11049 | S/M | Lee |
| ORF67 | 139472856 | 1.7e-69 | 0.97 | Herpesvirus virion protein U34 | PF04541 | C | DASH |
| ORF33 | 139472828 | 2.4e-69 | 0.99 | Herpesvirus UL16/UL94 family | PF03044 | C | DASH |
| ORF64 | 139472881 | 1.3e-61 | 1.00 | Herpesvirus tegument protein | PF04843 | S/M | Russo |
| ORF20 | 139472820 | 2.1e-61 | 0.99 | Herpes virus protein UL24 | PF01646 | C | DASH |
| ORF65 | 139472858 | 2.2e-59 | 1.00 | Gammaherpesvirus capsid protein | PF06112 | S/M | Russo |
| ORF42 | 139472834 | 5.3e-56 | 1.00 | Herpesvirus UL7 like | PF01677 | C | DASH |
| ORF57 | 139472848 | 1.4e-54 | 0.99 | Herpesvirus transcriptional regulator | PF05459 | S/M | Kirshner |
| ORF35 | 139472830 | 1.6e-47 | 0.99 | Gammaherpesvirus protein of unknown function | PF05852 | C | DASH |
| ORF40 | 139472833 | 4.2e-46 | 1.00 | DNA helicase/primase | PF05774, PF03324 | S/M | Russo |
| ORF47 | 139472876 | 6.3e-36 | 0.95 | Viral glycoprotein L | PF11108 | S/M | Russo |
| ORF52 | 139472843 | 5.1e-35 | 1.00 | Herpesvirus BLRF2 protein | PF05812 | S/M | Russo |
| ORF53 | 139472844 | 1.7e-28 | 0.99 | UL73 viral envelope glycoprotein | PF03554 | C | DASH |
Abbreviation: (Dom Cov.) Domain coverage—fraction of Pfam HMM model coverage by viral/human sequence alignment. Function types: (S/M) Structural/Metabolic, (C) Conserved unknown.
A genome-wide comparative survey of human viruses of disparate sizes and transcription strategies
| Human herpesvirus 8 | DNA | Non-retro-transcribing | 86 | 25 | 29.1 |
| Human parvovirus B19 | DNA | Non-retro-transcribing | 3 | 0 | 0.0 |
| Human herpesvirus 4 | DNA | Non-retro-transcribing | 94 | 12 | 12.8 |
| Human herpesvirus 5 | DNA | Non-retro-transcribing | 165 | 14 | 8.5 |
| Hepatitis B virus | DNA | Retro-transcribing | 7 | 1 | 14.3 |
| Human T-lymphotropic virus 1 | RNA | Retro-transcribing | 11 | 6 | 54.5 |
| Human immunodeficiency virus 1 | RNA | Retro-transcribing | 27 | 11 | 40.7 |
| Human T-lymphotropic virus 2 | RNA | Retro-transcribing | 10 | 3 | 30.0 |
| Zaire ebolavirus | RNA | Non-retro-transcribing | 9 | 0 | 0.0 |
| Rotavirus A | RNA | Non-retro-transcribing | 12 | 0 | 0.0 |
| Influenza A virus (A/Puerto Rico/8/34(H1N1)) | RNA | Non-retro-transcribing | 13 | 0 | 0.0 |
Figure 3Functional domains shared by humans and forty-one RNA non-retro-transcribing viruses. A species tree of the order Mononegavirales from the NCBI taxonomy. Colored dots on the tree show the mononegaviruses with homologous domains in human. The lower left-hand side of the diagram maps colors to functional domains in the form of [color]|[PfamA accession]|[functional domain description]|[number of mononegaviruses showing given function].
Prevalence of homology to human genes in the non-retro-transcribing RNA viral order mononegavirales
| Hendra virus | 8 | 6 | 75.0 |
| Human parainfluenza virus 2 | 7 | 5 | 71.4 |
| Simian virus 41 | 7 | 5 | 71.4 |
| Goose paramyxovirus SF02 | 6 | 4 | 66.7 |
| Newcastle disease virus B1 | 6 | 4 | 66.7 |
| Mossman virus | 8 | 5 | 62.5 |
| Nipah virus | 8 | 5 | 62.5 |
| Tupaia paramyxovirus | 8 | 5 | 62.5 |
| Avian paramyxovirus 6 | 7 | 4 | 57.1 |
| Canine distemper virus | 7 | 4 | 57.1 |
| Dolphin morbillivirus | 7 | 4 | 57.1 |
| Menangle virus | 7 | 4 | 57.1 |
| Rinderpest virus (strain Kabete O) | 7 | 4 | 57.1 |
| Beilong virus | 11 | 6 | 54.5 |
| Bovine parainfluenza virus 3 | 6 | 3 | 50.0 |
| Measles virus | 8 | 4 | 50.0 |
| Mumps virus | 8 | 4 | 50.0 |
| Parainfluenza virus 5 | 8 | 4 | 50.0 |
| Peste-des-petits-ruminants virus | 8 | 4 | 50.0 |
| Tioman virus | 8 | 4 | 50.0 |
| Mapuera virus | 9 | 4 | 44.4 |
| Porcine rubulavirus | 9 | 4 | 44.4 |
| Fer-de-Lance paramyxovirus | 8 | 3 | 37.5 |
| Human parainfluenza virus 3 | 8 | 3 | 37.5 |
| J-virus | 11 | 4 | 36.4 |
| Human parainfluenza virus 1 | 10 | 3 | 30.0 |
| Sendai virus | 10 | 3 | 30.0 |
| Avian metapneumovirus | 9 | 2 | 22.2 |
| Human metapneumovirus | 9 | 2 | 22.2 |
| Australian bat lyssavirus | 5 | 1 | 20.0 |
| European bat lyssavirus 1 | 5 | 1 | 20.0 |
| European bat lyssavirus 2 | 5 | 1 | 20.0 |
| Mokola virus | 5 | 1 | 20.0 |
| Rabies virus | 5 | 1 | 20.0 |
| Borna disease virus | 6 | 1 | 16.7 |
| Sudan ebolavirus | 8 | 1 | 12.5 |
| Bovine respiratory syncytial virus | 11 | 1 | 9.1 |
| Human respiratory syncytial virus | 11 | 1 | 9.1 |
| Pneumonia virus of mice J3666 | 12 | 1 | 8.3 |
| Bundibugyo ebolavirus | 9 | 0 | 0.0 |
| Reston ebolavirus | 8 | 0 | 0.0 |
a Refer to http://tinyurl.com/spouge-dash for full genome annotations for each virus including lists of viral-exclusive and shared genes.
Summary of homologous domains between humans and forty-one mononegaviruses
| PF06407 | Borna disease virus P40 | Target for MHC class I-restricted cytotoxic T-cell response | 2.4e-266 - 2.4e-266 | 1.00 | 4.6e-7 - 2.5e-77 | 0.52 | 6 | Endogenous, Bornavirus-like nucleoprotein 1, |
| PF01728 | FtsJ-like methyltransferase | Methyltransferase | 7.8e-4 - 1.8e-6 | 0.52 | 6.6e-6 - 2.4e-66 | 0.65 | 43 | Many c |
| PF14314 | Virus-capping methyltransferase | Virus-capping methyltransferase | 4.1e-84 - 2.9e-106 | 0.98 | 5.1e-04 - 5.1e-04 | 0.10 | 1 | Hypothetical protein |
| PF00642 | Zinc finger | Mediates binding specificity | 4.5e-05 - 4.5e-05 | 0.89 | 1.0e-3 - 6.5e-13 | 0.93 | 294 | Many c |
| PF12803 a | mRNA methyltransferase | Cap methylation of mRNA. | 9.0e-93 - 4.6e-117 | 1.00 | 1.4e-4 - 2.5e-4 | 0.37 | 3 | Putative ADP-ribosylation factor-like protein 5C, hCG31412, isoform CRA_a, hCG31412, isoform CRA_b |
| PF00429 a | ENV polyprotein (coat polyprotein) | Coat polyprotein | 1.0e-03 - 1.0e-03 | 0.17 | 9e-4 - 1.6e-68 | 0.27 | 51 | Many c |
| PF00523b | Fusion glycoprotein (F0) | Viral attachement | 7.4e-96 – 8.8e-189 | 1.00 | 1.4e-64 - 1.4e-64 | 0.44 | 1 | Angrgm-52 |
| PF00661 b | Viral matrix protein (M) | Viral assembly | 4.1e-84 – 2.3e-146 | 0.99 | 2.9e-106 - 2.9e-106 | 0.37 | 1 | Angrem52 |
| PF14313 b | N-terminal Paramyxovirinae phosphoprotein (P) | Phosphoprotein | 4e-4 - 2e-26 | 0.95 | 1.4e-21 - 1.4e-21 | 0.98 | 1 | AngRem104 |
| PF13825 b | N-terminal Paramyxovirus structural protein (V/P) | Structural protein | 7.6e-4 - 5.4e-153 | 0.75 | 2.2e-4 - 2.2e-4 | 0.42 | 1 | AngRem104 |
a Similarities with weaker evidence (i.e. high E-values, low domain coverage, numerous human sequence hits).
b Similarities likely resulting from mis-annotation of human sequences.
c Refer to http://tinyurl.com/spouge-dash for detailed lists of the similarity hits for individual viruses.