| Literature DB >> 32390970 |
Rodrigo Araújo Lima Rodrigues1,2, Amina Cherif Louazani2, Agnello Picorelli3, Graziele Pereira Oliveira1,2, Francisco Pereira Lobo3, Philippe Colson2,4, Bernard La Scola2,4, Jônatas Santos Abrahão1.
Abstract
Marseilleviruses comprise a family of large double-stranded DNA viruses belonging to the proposed order "Megavirales." These viruses have a circular genome of ∼370 kbp, coding hundreds of genes. Over a half of their genes are associated with AT-rich putative promoter motifs, which have been demonstrated to be important for gene regulation. However, the transcriptional profile of Marseilleviruses is currently unknown. Here we used RNA sequencing technology to get a general transcriptional profile of Marseilleviruses. Eight million 75-bp-long nucleotide sequences were robustly mapped to all 457 genes initially predicted for Marseillevirus isolate T19, the prototype strain of the family, and we were able to assemble 359 viral contigs using a genome-guided approach with stringent parameters. These reads were differentially mapped to the genes according to the replicative cycle time point from which they were obtained. Cluster analysis indicated the existence of three main temporal categories of gene expression, early, intermediate and late, which were validated by quantitative reverse transcription polymerase chain reaction assays targeting several genes. Genes belonging to different functional groups exhibited distinct expression levels throughout the infection cycle. We observed that the previously predicted promoter motif, AAATATTT, as well as new predicted motifs, were not specifically related to any of the temporal or functional classes of genes, suggesting that other components are involved in temporally regulating virus transcription. Moreover, the host transcription machinery is heavily altered, and many genes are down regulated, including those related to translation process. This study provides an overview of the transcriptional landscape of Marseilleviruses.Entities:
Keywords: RNA-seq; gene expression; giant virus; marseillevirus; promoter motifs; transcriptome
Year: 2020 PMID: 32390970 PMCID: PMC7192143 DOI: 10.3389/fmicb.2020.00651
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
The top 20 most expressed annotated genes.
| Gene | Presence of promoter | Expression pattern | Predicted function | 0 h | 1 h | 2 h | 4 h | 5 h | 6 h | 8 h | 10 h | 12 h | Total |
| MAR_ORF300 | N | L | – | 1349 | 31 | 69 | 36587 | 49038 | 50252 | 55730 | 67099 | 58015 | 318169 |
| MAR_ORF390 | Y | L | multiple zinc ribbon protein | 1776 | 9869 | 27205 | 51234 | 49629 | 43289 | 39411 | 45095 | 50282 | 317790 |
| MAR_ORF317 | N | L | – | 1354 | 170 | 40 | 36306 | 39576 | 39271 | 54230 | 35973 | 68786 | 275705 |
| MAR_ORF342 | Y | L | major capsid protein | 609 | 48 | 26 | 61325 | 45875 | 40048 | 66725 | 23909 | 30871 | 269436 |
| MAR_ORF384 | N | L | – | 899 | 134 | 23 | 38560 | 35602 | 42222 | 50020 | 54510 | 34821 | 256790 |
| MAR_ORF370 | N | L | – | 1317 | 317 | 948 | 17200 | 28825 | 27160 | 39832 | 89992 | 36077 | 241669 |
| MAR_ORF219 | Y | L | – | 793 | 106 | 12744 | 84839 | 47325 | 22149 | 20213 | 17578 | 19583 | 225329 |
| MAR_ORF029 | N | I | – | 1843 | 24117 | 60761 | 27563 | 19259 | 17693 | 24533 | 14184 | 22469 | 212422 |
| MAR_ORF305 | Y | L | – | 379 | 32 | 28 | 35828 | 39222 | 43383 | 28499 | 15090 | 25099 | 187560 |
| MAR_ORF413 | Y | L | histone H3 | 520 | 44 | 66 | 16092 | 35578 | 41292 | 29851 | 22200 | 25556 | 171199 |
| MAR_ORF109 | Y | L | – | 926 | 260 | 24676 | 29570 | 17876 | 9340 | 5808 | 5875 | 10734 | 105065 |
| MAR_ORF421 | N | L | – | 210 | 62 | 1986 | 17949 | 17338 | 21450 | 21452 | 9317 | 15270 | 105033 |
| MAR_ORF412 | N | E | – | 54414 | 40209 | 5914 | 197 | 101 | 196 | 130 | 202 | 192 | 101555 |
| MAR_ORF026 | Y | L | – | 402 | 179 | 173 | 6830 | 15641 | 19574 | 15741 | 14843 | 27536 | 100920 |
| MAR_ORF193 | Y | L | – | 385 | 74 | 2351 | 13831 | 16722 | 14791 | 16327 | 15747 | 17850 | 98078 |
| MAR_ORF021 | Y | L | zinc finger protein | 720 | 77 | 16 | 8775 | 16500 | 19814 | 19344 | 15478 | 17237 | 97962 |
| MAR_ORF389 | Y | E | – | 27733 | 43921 | 21364 | 514 | 409 | 534 | 396 | 744 | 438 | 96053 |
| MAR_ORF250 | N | L | – | 202 | 227 | 94 | 15714 | 20837 | 21501 | 16581 | 9799 | 10734 | 95689 |
| MAR_ORF014 | N | E | – | 66374 | 22286 | 2559 | 98 | 84 | 17 | 59 | 53 | 62 | 91592 |
| MAR_ORF147 | N | E | – | 44432 | 36392 | 2414 | 141 | 94 | 117 | 85 | 77 | 58 | 83810 |
FIGURE 1Marseillevirus gene expression classes. (A) Heat map of Marseillevirus gene expression profiles. Rows correspond to all 457 putative genes and columns to the 9 infection time points (0 h corresponds to 30 min of infection due to virus adsorption period). Expression profiles are clustered using hierarchical clustering. A dendrogram of the clustering is shown on the left. (B) Heat map of the same expression profiles partitioned into three main classes, “early” (top), “intermediate” (center), and “late” (bottom), with the k-means clustering algorithm. Expression levels are displayed from green (low expression) to red (high expression). Gray lines indicate absence of data for the gene at a particular infection time point. Both clustering methods were performed by applying Euclidean distance as a similarity metric. Among the 457 MRSV-encoded genes, 83 (18%) are expressed early, 218 (48%) are intermediate genes, and 156 (34%) are late genes. The function of each gene related to their temporal expression pattern can be found in Supplementary Data Sheet S1.
FIGURE 2Genome map of Marseillevirus. Genome map highlighting the different categories of genes. Rings starting from outer to innermost correspond to (i) genome coordinates in kilobases; (ii) predicted protein-coding genes oriented forward or reverse on DNA strands, with different colors corresponding to distinct functional gene categories; (iii) genes followed by the predicted core AAATATTT promoter motif; (iv) distribution of genes from different temporal classes, early, intermediate, and late. A color legend is provided to the right of the figure.
FIGURE 3Functional category analysis and correlation to the temporal expression profile of viral genes. (A) Pie-chart representing the number of Marseillevirus genes distributed into different functional categories, according to the NCVOG clusters; (B) Distribution of genes from different functional categories into temporal gene expression classes. Only genes with known functions are included in the graph.
FIGURE 4Promoter motifs association to temporal and functional classes of genes. (A) Distribution of four different predicted motifs according to genes belonging to different temporal categories; (B) Association of promoter motifs to genes belonging to different functional categories. Only genes followed by intergenic regions containing the motifs are represented. Color legend for each figure is provided in the image.
FIGURE 5Distribution of RNA-seq reads according to their origin across the infection cycle. The relative frequency of reads aligning to Marseillevirus genome and A. castellanii strain Neff nuclear and mitochondrial genomes during the infection cycle.
FIGURE 6Functional distribution of amoebal genes with altered expression during the infection by Marseillevirus. Downregulated and upregulated genes correspond to genes with expression reduced or increased by at least 1 log2-fold-change and False Discovery Rate of 0.05 in the pairwise comparison of grouped samples from early, intermediate and late infection time points.