| Literature DB >> 30487777 |
Amina Cherif Louazani1, Emeline Baptiste1, Anthony Levasseur1, Philippe Colson1, Bernard La Scola1.
Abstract
Faustoviruses are the first giant viruses of amoebae isolated on Vermamoeba vermiformis. They are distantly related to African swine fever virus, the causative agent of lethal hemorrhagic fever in domestic pigs. Structural studies have shown the presence of a double protein layer encapsidating the double-stranded DNA genome of Faustovirus E12, the prototype strain. The major capsid protein (MCP) forming the external layer has been shown to be 645-amino acid-long. Unexpectedly, its encoding sequence has been found to be scattered along a 17 kbp-large genomic region. Using RNA-seq, we studied expression of Faustovirus E12 genes at nine time points over its entire replicative cycle. Paired-end 250 bp-long read sequencing on MiSeq instrument and double-round spliced alignment enabled the identification of 26 different splice-junctions. Reads corresponding to junctions represented 2% of mapped reads and mostly matched with the predicted MCP encoding sequences. Moreover, our study enabled describing a 1,939 bp-long transcript that corresponds to the MCP, delineating 13 exons. At least two types of introns coexist in the MCP gene: group I introns that can self-splice (n = 5) and spliceosome-like introns with non-canonical splice sites (n = 7). All splice-sites were non-canonical with five types of donor/acceptor splice-sites among which AA/TG was the most frequent association.Entities:
Keywords: capsid; faustovirus; giant virus; splicing; transcriptome
Year: 2018 PMID: 30487777 PMCID: PMC6247863 DOI: 10.3389/fmicb.2018.02534
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Flowchart illustrating the workflow of this study. This flowchart shows the general pipeline of this RNA-seq study, starting from sample preparation and RNA extraction to cDNA sequencing and data analyses. The biological interpretation of expression count was possible through the functional categories clustering of expressed genes.
FIGURE 2Map of Faustovirus E12 genome coverage during the replication cycle. The predicted protein coding sequences are represented on the external circle in red and blue boxes for the forward and reverse strand, respectively. The single base resolution coverage for each time point is reported in the colored concentric circles from t = 0 to 20 h for the complete replication cycle samples set. Position 0 is at the 12 o’clock position.
FIGURE 3Functional categories distribution of expressed genes during the replication cycle of Faustovirus E12. Faustovirus E12 ORFs were assigned a functional category based on sequence homology with the Nucleo-Cytoplasmic Virus Orthologous Groups (NCVOG) proteins database. The ratio of expression per functional category is reported for each time point.
ASFV virion-forming proteins with homologs detected in the transcriptome of Faustovirus E12.
| ASFV gene | ASFV protein | Function | Faustovirus E12 homolog protein | T8H | T11H | T17H | T20H |
|---|---|---|---|---|---|---|---|
| B646L | Major capsid protein p72 | Morphogenesis | MCP | + | + | + | + |
| CP2475L | Polyprotein pp220 | Morphogenesis | AIB52024 | + | + | + | + |
| CP530R | Polyprotein pp62 | Morphogenesis | AIB52025 | + | + | + | + |
| S273R | Polyprotein processing protease | Morphogenesis | AIB52094 | + | + | + | + |
| O174L | DNA polymerase X | DNA integrity | AIB52077 | + | |||
| E296R | AP endonuclease | DNA integrity | AIB52085 | + | + | + | + |
| E165R | dUTPase | DNA integrity | AIB51748 | + | + | + | + |
| NP419L | DNA ligase | DNA integrity | AIB52048 | + | + | + | + |
| NP1450L | RNA polymerase subunit 1 | Transcription | AIB52040 | + | + | + | + |
| EP1242L | RNA polymerase subunit 2 | Transcription | AIB51752 | + | + | + | + |
| H359L | RNA polymerase subunit 3-11 | Transcription | AIB52132 | + | + | + | + |
| D205R | RNA polymerase subunit 5 | Transcription | AIB52137 | + | + | + | |
| C147L | RNA polymerase subunit 6 | Transcription | AIB51823 | + | |||
| D339L | RNA polymerase subunit 7 | Transcription | AIB52143 | + | + | + | + |
| Q706L | VACV D11-like helicase | Transcription | AIB52129 | + | + | + | + |
| B962L | VACV I8-like RNA helicase | Transcription | AIB51810 | + | + | + | + |
| D1133L | VACV D6-like RNA helicase | Transcription | AIB52142 | + | + | + | + |
| G1340L | VACV A7 early transcription factor large subunit-like | Transcription | AIB52005 | + | + | + | + |
| NP868R | mRNA-capping enzyme | Transcription | AIB52055 | + | + | + | + |
| C475L | Poly(A) polymerase | Transcription | AIB51816 | + | |||
| EP424R | Putative RNA methyltransferase | Transcription | AIB52114 | + | + | + | |
| EP152R | Protein EP152R | AIB51785 | + | ||||
| B169L | Uncharacterized protein | AIB51862 | + | ||||
| H339R | α-NAC binding protein pH339R | AIB52116 | + | + | + | + | |
| M1249L | Uncharacterized protein | AIB51842 | + | + | + | + | |
| C129R | Uncharacterized protein | AIB51831 | + | + | |||
| K421R | Uncharacterized protein | AIB51770 | + | + | + | ||
| H240R | Uncharacterized protein | AIB52125 | + | + | + | ||
| QP383R | Uncharacterized protein | AIB52104 | + | + | + | + | |
| C122R | Uncharacterized protein | AIB51826 | + | ||||
| M448R | Uncharacterized protein | AIB51841 | + | + | + | + |
FIGURE 4Genome-wide map of splicing events in Faustovirus E12 across its replication cycle. Predicted and curated splice junctions resulting from the second-round mapping of RNA-seq reads against the genome with HISAT2 are reported in colored boxes corresponding to the dataset where they were detected. When in close genomic coordinates, junctions appear in two layers for display purpose. The annotated protein coding sequences are represented on the external circle in blue wedges.
FIGURE 5Faustovirus E12 major capsid protein gene structure. The MCP gene contains 13 exons and 12 introns among which five are group I self-spliced introns and two contain a GIY-YIG homing endonuclease inserted ORF. The donor and acceptor splice sites are represented for potentially spliceosome-processed introns (A). The association of donor/acceptor splice site in these introns shows the high abundance of the donor site AA and its frequent association with TG acceptor site (B).