| Literature DB >> 35269907 |
Francisco J Mancebo1, Marcos Parras-Moltó2,3,4, Estéfani García-Ríos1,5, Pilar Pérez-Romero1.
Abstract
CMV is a major cause of morbidity and mortality in immunocompromised individuals that will benefit from the availability of a vaccine. Despite the efforts made during the last decade, no CMV vaccine is available. An ideal CMV vaccine should elicit a broad immune response against multiple viral antigens including proteins involved in virus-cell interaction and entry. However, the therapeutic use of neutralizing antibodies targeting glycoproteins involved in viral entry achieved only partial protection against infection. In this scenario, a better understanding of the CMV proteome potentially involved in viral entry may provide novel candidates to include in new potential vaccine design. In this study, we aimed to explore the CMV genome to identify proteins with putative transmembrane domains to identify new potential viral envelope proteins. We have performed in silico analysis using the genome sequences of nine different CMV strains to predict the transmembrane domains of the encoded proteins. We have identified 77 proteins with transmembrane domains, 39 of which were present in all the strains and were highly conserved. Among the core proteins, 17 of them such as UL10, UL139 or US33A have no ascribed function and may be good candidates for further mechanistic studies.Entities:
Keywords: cytomegalovirus; pangenome; proteome; transmembrane
Mesh:
Substances:
Year: 2022 PMID: 35269907 PMCID: PMC8911422 DOI: 10.3390/ijms23052768
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Characteristics of CMV strains used in this study and their corresponding accession number at nucleotide database.
| CMV Strains | Isolation Source | Number of Culture Passages | Accession Number |
|---|---|---|---|
| AD169 | Adenoids of a 7-year-old girl | Many times in human fibroblasts | FJ527563.1 |
| Towne | Urine of a 2-month-old infant with microcephaly and hepatosplenomegaly | Many times in human fibroblasts | FJ616285.1 |
| Toledo | Urine from a congenitally infected infant | Several times in human fibroblasts | GU937742.2 |
| TR | Vitreous humor from eye of HIV-positive male | Several times in human fibroblasts | KF021605.1 |
| VR7863 | Urine samples of a congenitally infected neonate and cultured in endothelial and epithelial cells | Cultured in endothelial and epithelial cells | KX544838.1 |
| TB40-E_UNC | Throat swab of a bone marrow transplant patient | Cultured adapted | KX544839.1 |
| HANSCTR4 | Blood from stem cell transplant recipient (D-R+) | Sequenced directly from clinical material via target enrichment | KY123653.1 |
| AD169-BAC20 | - | - | MN920393.1 |
| Merlin | Urine from a congenitally infected child | 3 times in human fibroblasts | NC006273.2 |
Figure 1Schematic representation of the applied workflow. Fasta format protein sequences from nine CMV genomes were analyzed in parallel to predict transmembrane domains and to create an entire set of genes from all strains (pangenome). Transmembrane topology was studied following three different approaches: PureseqTM, Phobius and TMHMM, under default parameters. Predicted transmembrane proteins were compared with orthologous proteins identified by BLAST with the whole Mantis database for the prediction of functional annotation. Proteins that were common to all nine genome datasets formed the core protein set, and functions were annotated accordingly for each transmembrane protein.
Figure 2Predicted transmembrane proteins for the studied CMV genomes. Proteins with at least one predicted transmembrane domain with one of three tested methods were annotated as transmembrane proteins. The number of transmembrane domains for each protein was represented using the indicated chromatic scale ranging from zero to eight regions for each of the three methods used (PureseqTM, Phobius and TMHMM). For each strain, the presence of the gene was represented with the filled red circles and absent genes with empty red circles.
CMV predicted transmembrane proteins indicating the cellular localization based on biotool Uniprop, the ascribed functions based on a bibliographic search and the number of predicted domains using the three different tools. (*) indicates unknown or non-verified function.
| Gene | Localization | Function | Number of TM Domains | References |
|---|---|---|---|---|
| UL1 * | VM | Unknown. pUL1 could modulate CMV host cell tropism. | 1–2 | [ |
| UL2 * | HM | Unknown. | 1 | - |
| UL5 * | V | Unknown. It is suggested to be involved in efficient viral assembly, propagation and replication. | 1–3 | [ |
| UL6 * | HM | Unknown. | 1–2 | - |
| UL7 | UL7 is involved in immunomodulation. | 2–3 | [ | |
| UL8 * | HM | UL8 decreases the release of a large number of pro-inflammatory factors later after infection of THP-1 myeloid cells. UL8 may exert an immunosuppressive role key for CMV survival in the host. | 1–2 | [ |
| UL9 * | HM | Unknown function. | 1–3 | [ |
| UL10 * | M | Unknown. Potential role in immunomodulation. | 1–2 | [ |
| UL11 | HM, ERM | pUL11 interacts with CD45 phosphatase on T cells, inducing the IL-10 secretion. | 1–2 | [ |
| UL14 * | HM | Unknown. | 0–1 | - |
| UL15A * | HM | Unknown. | 1 | - |
| UL16 | HM | Immunoevasion and inhibition of the activation of NK cells. | 1 | [ |
| UL18 | HM | Immunomodulation and immunoevasion. | 1–2 | [ |
| UL20 * | ERM | Unknown. UL20 could be destined to sequester cellular proteinases not known to date for degradation in lysosomes. | 1–2 | [ |
| UL33 | HM | UL33 has homology with GPCR which activates different ligand-independent signalling pathways and also involved in virus dissemination. | 6–7 | [ |
| UL37 | ERM, GM, MM | Viral replication. | 2–3 | [ |
| UL40 | HM | Immunomodulation. | 0–2 | [ |
| UL41A * | VM | Unknown. UL41A not to code for proteins. | 1 | [ |
| UL42 * | HM, C | Unknown. Potential role in immunoevasion. | 1 | [ |
| UL50 | HNM | Assembly, maturation and egress of virions. | 1 | [ |
| UL55 | VM, HM, GM | Glycoprotein B participates in viral entry. | 1–3 | [ |
| UL73 | VM, HM, GM | Glycoprotein N is involved in the binding of the virus to the host cell, viral spread and virion morphogenesis. | 1 | [ |
| UL74A * | VM | Unknown | 1 | - |
| UL75 | HM, VM | Glycoprotein H participates in viral entry. | 1 | [ |
| UL78 * | HM, ERM | Unknown. UL78 is a G protein-coupled receptor. | 6–7 | [ |
| UL100 | HM, VM | Envelope glycoprotein M participates in viral entry. | 8 | [ |
| UL119 | VM | Immunoevasion. | 1 | [ |
| UL120 * | HM | Unknown. | 1–2 | - |
| UL121 * | HM | Unknown. | 1–2 | - |
| UL124 * | HM | Potential role in latency. | 0–1 | [ |
| UL132 | VM | Essential for CMV assembly compartment formation and the efficient production of infectious particles. | 1–2 | [ |
| UL133 | GM | UL133 forms a complex with UL138 and UL136. It is involved in the establishment of CMV latency. | 2 | [ |
| UL135 | HM, GM | Immunomodulation. Post entry Tropism in Endothelial Cells. | 0–1 | [ |
| UL136 | HM | Replication, latency, and dissemination. Post entry Tropism in Endothelial Cells. | 1 | [ |
| UL138 | GM | Latency and DNA replication. | 1 | [ |
| UL139 * | HM | Unknown. Potential role in immunomodulation. | 1–2 | [ |
| UL140 * | HM | Unknown. | 1 | - |
| UL141 | ERM | Immunomodulation and DNA replication. | 1 | [ |
| UL142 | ERM | Immunomodulation. | 0–1 | [ |
| UL144 | HM | Inhibition of T-cell activation and latency. | 1 | [ |
| UL147 * | EXR | Unknown. Potential role in immunomodulation. | 0–1 | [ |
| UL147A | HM | Immunomodulation. | 0–1 | [ |
| UL148 | ERM | Viral ER-resident glycoprotein that interacts with UL116 promoting the incorporation of gH/gL complexes into virions. | 1 | [ |
| UL148A | HM | Immunoevasion of NK cells. | 1–2 | [ |
| UL148B * | HM | Unknown. | 1 | [ |
| UL148C * | HM | Unknown. | 0–3 | [ |
| UL148D * | HM | Unknown. | 1 | [ |
| US2 | ERM | Immunomodulation. | 1–2 | [ |
| US3 | ERM | Immunoevasion. | 1 | [ |
| US6 | ERM | Immunomodulation. | 1 | [ |
| US7 | ERM | Immunoevasion. | 1 | [ |
| US8 | ERM, GM | Immunomodulation. | 1 | [ |
| US9 | ERM, GM, CK | Glycoprotein US9 is an antagonist of IFN signalling to persistently evade host innate antiviral responses. | 0–1 | [ |
| US10 | ERM | Inhibition of the host immune response. | 1–2 | [ |
| US11 | ERM | Inhibition of the host immune response. | 0–1 | [ |
| US12 | HM | Inmunomodulation of NK cells activation. | 6–7 | [ |
| US13 * | HM | Unknown. | 7 | - |
| US14 | HM | Inmunomodulation of NK cells activation. Potential role in virions maturation and egress. | 5–7 | [ |
| US15 * | HM | Unknown. | 7 | - |
| US16 | HM, C | Tropism in endothelial and epithelial cells. | 6–7 | [ |
| US17 | HM | Immunomodulation. | 7 | [ |
| US18 | HM. | Immunoevasion of NK cell. | 7–8 | [ |
| US19 * | HM | Unknown. Its delection affect NK cell activation. | 6–7 | [ |
| US20 | M | Inhibition NK cell activation. Also participates in the viral replication process in endothelial cells. | 7 | [ |
| US21 | HM | Viroporin that modulates calcium homeostasis and protects cells against apoptosis. | 7–8 | [ |
| US27 | V, HM | Immunomodulation. Also is required for efficient viral spread by the extracellular route. | 7 | [ |
| US28 | HM | Immunomodulation. Lytic and latent CMV infection. Possible role in regulation of the actin cytoskeleton or cytoskeletal remodelling. | 7 | [ |
| US29 * | HM | Unknown. | 0–2 | - |
| US30 * | HM | Unknown. | 1–2 | - |
| US33A * | - | Unknown. | 0–1 | [ |
| US34A * | HM. | Unknown. Potential target of SUMO complex. | 1–2 | [ |
| RL8A * | HM | Unknown. | 1 | - |
| RL9A * | HM | Unknown. | 1 | - |
| RL10 * | VM | Unknown. | 1–2 | - |
| RL11 | HM | Immunomodulation. RL11 is a type I transmembrane glycoproteins which bind immunoglobulin G Fc. I | 1–2 | [ |
| RL12 * | VM | Unknown. RL12 is a Fc binding protein. | 1–2 | [ |
| RL13 * | VM | Unknown. Potential role in replication, immunoevasión and viral spread by cell-free or cell-to-cell mechanisms. | 1 | [ |
* indicates unknown or non-verified function. CK: Cytoskeleton C: Cytoplasm, ERM: Host endoplasmic reticulum membrane, EXR: Extracellular region, GM: Golgi reticulum membrane, HM: host membrane, HMN: Host nucleus membrane, M: Membrane, MM: Mitochondrion membrane, V: Virion, VM: Virion membrane.
Figure 3Functional analysis of the 39 core proteins. (A) Pie chart of the proteins found in all the studied strains were grouped based on their functions. For each group, the number of predicted transmembrane domains is also indicated. When the number of transmembrane domains predicted was different using the three methods, a range of values is shown. The number of proteins in each section is marked in blue and the percentage between brackets. (B) Genomic location of the 17 proteins with non-described function.
Figure 4Percentage identity heatmap among the 39 core proteins from all the studied strains using AD169 as a reference strain. Color scales range from red (0% identity) to white (100% identity). Strains and genes were clustered following a hierarchical clustering method (HCL). Genes were clustered from top to bottom of the figure based on their similarity within the indicated strains. Core proteins with unknown functions are highlighted in red.