| Literature DB >> 29308275 |
Avi Shukla1, Anirvan Chatterjee1, Kiran Kondabagil1.
Abstract
Curiously, in viruses, the virion volume appears to be predominantly driven by genome length rather than the number of proteins it encodes or geometric constraints. With their large genome and giant particle size, amoebal viruses (AVs) are ideally suited to study the relationship between genome and virion size and explore the role of genome plasticity in their evolutionary success. Different genomic regions of AVs exhibit distinct genealogies. Although the vertically transferred core genes and their functions are universally conserved across the nucleocytoplasmic large DNA virus (NCLDV) families and are essential for their replication, the horizontally acquired genes are variable across families and are lineage-specific. When compared with other giant virus families, we observed a near-linear increase in the number of genes encoding repeat domain-containing proteins (RDCPs) with the increase in the genome size of AVs. From what is known about the functions of RDCPs in bacteria and eukaryotes and their prevalence in the AV genomes, we envisage important roles for RDCPs in the life cycle of AVs, their genome expansion, and plasticity. This observation also supports the evolution of AVs from a smaller viral ancestor by the acquisition of diverse gene families from the environment including RDCPs that might have helped in host adaption.Entities:
Keywords: genome expansion; genome plasticity; giant virus; repeat domain-containing proteins
Year: 2018 PMID: 29308275 PMCID: PMC5753266 DOI: 10.1093/ve/vex039
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
The basic composition, structure, and functions of different repeat domain proteins in diverse forms of life excluding Megavirales
| Multigene repeat families | Composition | Structural unit | Tertiary structure | Participates in | Commonly found in | References |
|---|---|---|---|---|---|---|
| Ankyrin repeats (ANK) | 33 aa | Two antiparallel α-helices joined by β-hairpin at 90° forming L-shaped structure | Cupped hand shape solvent accessible groove formed by repeating protomers | Cell cycle regulation, cytoskeletal binding, protein trafficking across membrane, acquired resistance. | Prokaryotes and eukaryotes | |
| Leucine rich repeats (LRR) | 20 to 29 aa | A β-sheet and an α-helix arranged in an anti-parallel manner | Multiple repeats are oriented parallel to the axis forming horse-shoe like structure | Protein -protein interaction, signal transduction and formation of protein complexes | Prokaryotes and eukaryotes | |
| FNIP/IP22 repeats | 22 aa | A β-sheet and an α-helix arranged in an anti-parallel manner | Horse shoe like structure (like LRR) | Interaction of calmodulin binding proteins, increases cell motility and chemotaxis | Dictyostelium and NCLDV | |
| Tetratricopeptide repeats (TPR) | 34 aa | Multiple array of α-helix turn α-helix unit packaged in parallel | A right-handed super-helix that provide concave groove for molecule binding | Cell cycle regulation, chaperone functioning, protein translocation, bacterial pathogenesis, and biogenesis of multi-functional pilli | Prokaryotes and eukaryotes including humans | |
| Sel1 repeats | 33 to 44 aa | Multiple array of α-helix turn α-helix unit packaged in parallel | A right-handed super-helix | ER-associated protein ubiquitination, regulation of mitosis and septum formation, host-pathogen interaction | Bacteria and eukaryotes | |
| WD 40 repeats | 40 aa | Four anti-parallel β-sheet arranged radially with flanking dipeptide | Propeller like structure | Gene regulation, chromatin modelling, transmembrane signalling, mRNA modification, vesicle fusion and adhesion complex of malarial parasites | Eukaryotes | |
| Kelch repeats | 44 to 56 aa | Four anti-parallel β-sheet arranged radially with flanking dipeptide | Propeller like structure | Actin binding, manipulates cell organization and morphology | Prokaryotes, eukaryotes and viruses | |
| MORN repeats | 23 aa | Not known | Not known | Parasites' budding, protein translocation, flagellum biogenesis, form junctional complex between plasma membrane to endoplasmic reticulum, promotes phagocytosis of bacterium | Prokaryotes and eukaryotes |
Figure 1.A near–linear relationship between the genome size and the number of genes encoding RDCPs in AVs. Core genes and RDCPs were manually curated from 13 published genomes. Core function definitions were chosen as per the previous reports (Raoult et al. 2004; Yutin, et al. 2009; Yutin, Raoult, and Koonin 2013; Yutin et al. 2013). These included genes encoding DNA replication, recombination and repair, transcription and RNA processing, translation, and post-translation modifications, nucleotide metabolism, virion packaging, and morphogenesis. Genes encoding these functions in the 13 representative NCLDV families were retrieved as per annotations in the public databases. Genes for which annotations was not updated, but yielded significant alignment matches in Interpro, CDD, Pfam and Smart servers, were also included. (A) Scatterplot of the number of repeat protein families plotted against genome size. A high correlation between number of repeat protein families and genomes size (r2 = 0.87) was observed. (B) Scatterplot of the number of genes encoding core viral functions plotted against genome size, which shows a poor association between the two (r2 = 0.11). In (A) and (B), the shaded area indicates the standard error as per a linear regression model. The size of the data label (solid dot) representing genomes is proportional to the genome size. Number alongside the data label corresponds to the number inside the ideograms shown in Figure 1C. (C) Circos-generated ideograms of giant viral genomes. Outer concentric represents the clusters of repeat domain proteins/multigene families that include proteins containing ANK repeats = red, FNIP repeats = green, MORN repeats = blue, sel1 repeats = yellow, TPR = purple, WD40 repeats = black, LRR and kelch repeat = gray, and the inner concentric denotes the core genome. APMV, Acanthamoeba polyphaga mimivirus; APMoV, Acanthamoeba polyphaga momouvirus; PBCV 1, Paramecium bursaria chlorella virus 1; ASFV, African swine fever virus.
Figure 2.A speculative hypothesis on the RDCP driven lineage-specific genome expansion in AVs. (A) Genome size distribution and B family DNA polymerase phylogeny. ML Tree of B family DNA polymerase amino acid sequence was constructed using FastTree with default settings using a representative sequence from 13 NCLDV families. A large red circle on the internal node of the AV lineage indicates a more recent ancestor from which we believe genome expansion has ensued, especially in the amoebal milieu. Smaller red circles indicate a much recent ancestor from which independent genome expansion strategy might have led to larger genomes in Faustoviruses and Pithovirus. Black and purple circles indicate ancestors of unknown genome size and nature. More genome sequences are needed to resolve the genome size distribution pattern and its evolutionary link to the nature of the ancestor in large DNA viruses. (B) Circos ideogram of Mimivirus genome. Three concentrics, labeled as 1, 2, and 3 represent RDCPs, core and hypothetical genes, and mobile elements, respectively. The bipartite AV genome consists of a conserved core region derived from a common ancestor, and the RDCPs that are clustered in the genomic termini of the AVs. In addition to aiding in genome expansion, the RDCPs may also help in survival in the competitive environment (see Fig. 3 for details). In an allopatric condition, most of these RDCPs are lost causing a reduced genome size (Boyer et al., 2011; Colson and Raoult 2012).
Figure 3.Putative roles of various RDCPs in the AV infection cycle. Giant capsid mimics the size of bacteria for promoting phagocytosis in a sympatric environment prohibiting the host encystment. Once inside, it suppresses the host immune system by interfering with host defense mechanisms by interacting with various host proteins via repeat domain-containing protein (that also mimic some of the host proteins) or/and deviating them to ubiquitination. The distinct phases of the intra-amoebal life cycle of a virus involve: (1) Particle size plays an important role in the mode entry on viruses. As seen in other viruses (Cui et al. 2014), the large particle size may be driven by genome expansion, caused by accumulation of RDCPs. (2) Once phagocytosed, the encystment of the trophozoite is arrested and the fusion of the phagosome to the lysosome is inhibited by ankyrin, TPR, WD40, and Sel1 repeat domains proteins, as has been reported in intra-amoebal parasitic bacteria (Shchelkunov, Blinov, and Sandakhchiev 1993; Newton et al. 2007; Cerveny et al. 2013; Nguyen, Liu, and Thomas 2014). Some of the RDCPs have been reported to be packaged in the virion indicating their role in the initiation of the viral replication cycle (Renesto et al. 2006). (3) The viral genome is released into the cytoplasm from the phagosome and the formation of a replication center is initiated by the recruitment of various cytoplasmic membranes, mitochondria, and cytoskeletal components. This formation requires a number of complex interactions and signaling pathways, that are probably mediated by FNIP, ANK repeats, Sel1, WD40, or/and MORN repeats domain proteins. (4) During infection, RDCPs such as LRR, FNIP, IP22, WD40, ANK repeats, and F-box proteins might interfere with host defense mechanisms. They have been shown to modify/regulate the host gene expression and subvert the host proteins to ubiquitination or mimics some of the inhibitory molecules to suppress the immune pathways (Sharma and Pandey 2015). (5) During the infection cycle, host cell morphology changes to avoid superinfection. This morphological change is brought about by MORN, Kelch, FNIP and ANK repeat domain proteins (Table 1). In addition, MORN repeat containing protein might also promote the degradation of other internalized microorganisms. (6) Unlike AVs, bacteria are unable to interfere with the formation of the phagolysosome, and are consequently digested by the hydrolytic enzymes in the lysosome (Cosson and Soldati 2008; Akya, Pointon and Thomas 2009). Although phagocytosis of AVs and bacteria is primarily driven by particle size, they have distinct fates. The RDCPs emerge as crucial drivers of both, the particle size and a successful viral life cycle.