| Literature DB >> 28545254 |
Chantal Hulo1, Patrick Masson2, Ariane Toussaint3, David Osumi-Sutherland4, Edouard de Castro5, Andrea H Auchincloss6, Sylvain Poux7, Lydie Bougueleret8, Ioannis Xenarios9, Philippe Le Mercier10.
Abstract
Bacterial viruses, also called bacteriophages, display a great genetic diversity and utilize unique processes for infecting and reproducing within a host cell. All these processes were investigated and indexed in the ViralZone knowledge base. To facilitate standardizing data, a simple ontology of viral life-cycle terms was developed to provide a common vocabulary for annotating data sets. New terminology was developed to address unique viral replication cycle processes, and existing terminology was modified and adapted. Classically, the viral life-cycle is described by schematic pictures. Using this ontology, it can be represented by a combination of successive events: entry, latency, transcription/replication, host-virus interactions and virus release. Each of these parts is broken down into discrete steps. For example enterobacteria phage lambda entry is broken down in: viral attachment to host adhesion receptor, viral attachment to host entry receptor, viral genome ejection and viral genome circularization. To demonstrate the utility of a standard ontology for virus biology, this work was completed by annotating virus data in the ViralZone, UniProtKB and Gene Ontology databases.Entities:
Keywords: life-cycle; ontology; phage
Mesh:
Year: 2017 PMID: 28545254 PMCID: PMC5490803 DOI: 10.3390/v9060126
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Bacterial virus vocabulary. The table lists the 68 terms of the bacterial virus vocabulary as cited in the text. New terms created during this work in the three databases are indicated by a grey background. The accession numbers are indicated for GO terms GO:XXXXXXX, UniProtKB Keywords KW-XXX, and ViralZone pages VZ-XXX. The other columns indicate the number of annotations assigned to this vocabulary/ontology. The UniProtKB column displays the number of annotations made using the corresponding KW in UniProtKB bacterial virus entries (as of release 2017_04). An asterisk after a UniProtKB KW indicates a term that is also used for eukaryotic virus annotation. GO annotation lists the total number of annotation using the corresponding GO term. Terms in italics are children of the terms above them in the table.
| UniProt Keywords | GO Terms | UniProt KW | ViralZone Pages | UniProt Entries | |
|---|---|---|---|---|---|
| Tailed Bacterial virus | VZ-4076 | NA | |||
| Capsid protein | GO:0046728 | KW-0167 | * | 166 | |
| | GO:0098021 | KW-1232 | 24 | ||
| Viral tail protein | GO:0098015 | KW-1227 | 138 | ||
| | GO:0098027 | KW-1229 | 11 | ||
| | GO:0098026 | KW-1228 | 21 | ||
| | GO:0098025 | KW-1226 | 33 | ||
| Viral tail fiber protein | GO:0098024 | KW-1230 | 42 | ||
| Capsid inner membrane protein | GO:0039641 | KW-1231 | 15 | ||
| Viral attachment to host cell | GO:0019062 | KW-1161 | * | VZ-956 | 72 |
| | GO:0098671 | KW-1233 | * | VZ-3943 | 29 |
| | GO:0098670 | KW-1234 | * | VZ-3942 | 16 |
| | GO:0039666 | KW-1175 | VZ-981 | 15 | |
| | GO:0098931 | KW-1240 | VZ-3949 | 0 | |
| Degradation of host cell envelope components during virus entry | GO:0098994 | KW-1235 | VZ-3938 | 29 | |
| | GO:0098932 | KW-1236 | VZ-3940 | 19 | |
| | GO:0098995 | KW-1237 | VZ-3939 | 3 | |
| | GO:0098996 | KW-1238 | VZ-3896 | 4 | |
| Viral penetration into host cytoplasm | GO:0046718 | KW-1162 | * | VZ-4016 | 161 |
| | GO:0098997 | KW-1239 | VZ-3941 | 1 | |
| | GO:0044694 | KW-1172 | * | VZ-979 | 7 |
| | GO:0039678 | KW-1171 | VZ-986 | 130 | |
| | GO:0099000 | KW-1242 | VZ-3950 | 30 | |
| | GO:0099001 | KW-1243 | VZ-3952 | 41 | |
| | GO:0099002 | KW-1244 | VZ-3954 | 32 | |
| | GO:0039667 | KW-1241 | VZ-3953 | 17 | |
| | GO:0099008 | KW-1173 | * | VZ-985 | 0 |
| Viral genome circularization | GO:0099009 | KW-1253 | VZ-3968 | 8 | |
| Viral genome integration | GO:0044826 | KW-1179 | * | VZ-980 | 14 |
| Viral receptor tropism switching | GO:0098678 | KW-1264 | VZ-4498 | 10 | |
| Latency-replication decision | GO:0098689 | KW-1252 | VZ-3964 | 4 | |
| Viral reactivation from latency | GO:0019046 | KW-1272 | 35 | ||
| Host defense evasion | GO:0044413 | 0 | |||
| | GO:0099018 | KW-1258 | VZ-3966 | 16 | |
| CRISPR-Cas system evasion by virus | GO:0098672 | KW-1257 | VZ-3962 | 3 | |
| | GO:0099016 | KW-1256 | VZ-3963 | 6 | |
| | KW-1259 | VZ-3961 | 3 | ||
| | VZ-4077 | 0 | |||
| Host gene expression shutoff by virus | GO:0039657 | KW-1190 | * | 12 | |
| Bacterial host gene expression shutoff by virus | KW-1261 | VZ-4496 | 12 | ||
| Bacterial host transcription shutoff by virus | KW-1263 | VZ-4497 | 4 | ||
| Degradation of host chromosome by virus | GO:0099015 | KW-1247 | VZ-3947 | 8 | |
| Inhibition of host DNA replication by virus | GO:0098673 | KW-1248 | VZ-3948 | 9 | |
| Modulation of host virulence by virus | GO:0098676 | KW-1254 | * | VZ-3965 | 9 |
| | KW-1255 | VZ-3967 | 9 | ||
| Superinfection exclusion | GO:0098669 | KW-1260 | VZ-3971 | 3 | |
| Viral DNA replication | GO:0039693 | KW-0235 | * | 65 | |
| | * | VZ-1939 | 0 | ||
| | * | VZ-2676 | 0 | ||
| | * | VZ-1940 | 0 | ||
| | * | VZ-4017 | 0 | ||
| Viral RNA replication | GO:0039694 | KW-0693 | * | 8 | |
| Viral genome packaging | GO:0019072 | KW-1231 | * | VZ-3944 | 15 |
| Host cell lysis by virus | GO:0044659 | KW-0578 | * | VZ-1077 | 79 |
| | GO:0039640 | VZ-4296 | 0 | ||
| | GO:0044660 | * | 223 | ||
| | VZ-4056 | 0 | |||
| Viral extrusion | GO:0099045 | KW-1249 | VZ-3951 | 22 | |
| Viral genome excision | GO:0032359 | KW-1250 | VZ-3969 | 20 | |
| Viral capsid assembly | GO:0019069 | KW-0118 | * | VZ-1950 | 85 |
| | GO:0046797 | KW-1273 | 0 | ||
| Viral budding | GO:0046755 | KW-1198 | * | VZ-1947 | 0 |
| Viral tail assembly | GO:0098003 | KW-1245 | VZ-3955 | 90 | |
| | GO:0098004 | KW-1246 | VZ-3956 | 9 | |
Figure 1The ontology of viral release parent-child relationships. This tree consists of terms used to annotate the steps of viral release. ViralZone pages (VZ), UniProtKB keyword (KW) or GO terms accession numbers (GO:) are indicated. The hierarchy is shared by GO and KW except for budding for which the GO hierarchy is indicated with dotted lines. Boxes are colored blue for new UniProtKB KW, pink for old KW and white when the term is not related to a KW. The dotted line represents an inconsistency that will be corrected in future releases between GO and UniProt KW hierarchy: GO “virus budding” is not yet child to the “virus release from host cell” term.
Figure 2Structure of bacterial virus particles. The picture displays the different virion structures classified under three categories: icosahedral naked capsid, filamentous or enveloped particle. A representative viron structure is represented for each of the nine bacteria virus families.
Figure 3Entry pathways of bacterial viruses. This picture represents the principal ViralZone controlled vocabularies for virus entry. The representation of viral entry is chronological. The virus genome which is encapsuled in a virion on the top and left of the figure will follow alternative pathways until initiating transcription/replication processes or latency.
Figure 4Virus crossing of the bacterial envelope. Schematic representation of different routes of envelope crossing used by bacterial viruses. The different envelopes of Mollicutes, Gram-positive and Gram-negative bacteria are indicated with their associated routes of entry.
Figure 5Bacterial host-virus interactions. This picture represents ViralZone controlled vocabularies for bacterial host-virus interactions. A red arrow indicates a process induced by the virus, a red line ended by a “stop” point out a process inhibited by viruses, and the up and down arrow in a yellow diamond-shape signals a process modulated up or down by viruses.
Figure 6Release pathways of bacterial viruses. This picture represents the ViralZone controlled vocabulary describing the bacterial virus release pathway. The representation is chronological: The virus genome begins at the bottom of the picture after the transcription/replication processes and will follow alternative pathways until exiting the host cell at the top of picture.