| Literature DB >> 32246084 |
Nisrine Chelkha1,2, Issam Hasni1,2,3, Amina Cherif Louazani1,2, Anthony Levasseur1,2, Bernard La Scola4,5, Philippe Colson6,7.
Abstract
Vermamoeba vermiformis is a predominant free-living amoeba in human environments and amongst the most common amoebae that can cause severe infections in humans. It is a niche for numerous amoeba-resisting microorganisms such as bacteria and giant viruses. Differences in the susceptibility to these giant viruses have been observed. V. vermiformis and amoeba-resisting microorganisms share a sympatric lifestyle that can promote exchanges of genetic material. This work analyzed the first draft genome sequence of a V. vermiformis strain (CDC-19) through comparative genomic, transcriptomic and phylogenetic analyses. The genome of V. vermiformis is 59.5 megabase pairs in size, and 22,483 genes were predicted. A high proportion (10% (n = 2,295)) of putative genes encoded proteins showed the highest sequence homology with a bacterial sequence. The expression of these genes was demonstrated for some bacterial homologous genes. In addition, for 30 genes, we detected best BLAST hits with members of the Candidate Phyla Radiation. Moreover, 185 genes (0.8%) best matched with giant viruses, mostly those related to the subfamily Klosneuvirinae (101 genes), in particular Bodo saltans virus (69 genes). Lateral sequence transfers between V. vermiformis and amoeba-resisting microorganisms were strengthened by Sanger sequencing, transcriptomic and phylogenetic analyses. This work provides important insights and genetic data for further studies about this amoeba and its interactions with microorganisms.Entities:
Mesh:
Year: 2020 PMID: 32246084 PMCID: PMC7125106 DOI: 10.1038/s41598-020-62836-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Genomic composition and gene repertoire of Vermamoeba vermiformis CDC-19.
| Feature | |
|---|---|
| Genome size (bp) | 59,550,895 |
| GC content (%) | 41.7 |
| DNA scaffolds | 14,852 |
| Maximum scaffold length (bp) | 432,427 |
| Minimum scaffold length (bp) | 500 |
| N50 (bp) | 7,608 |
| Total number of genes | 22,483 |
| Proportion of genes with a size ≥300 bp | 20,299 |
| Non-ORFan genes | 15,263 |
| ORFan genes | 7,220 |
| Genes assigned to COGs | 12,593 |
COG, clusters of orthologous groups of proteins; N50, sequence length of the shortest contig at 50% of the total genome length.
Figure 1Taxonomical distribution of the V. vermiformis CDC-19 predicted proteins.
Figure 2Phylogenetic diversity and number of reads generated from the V. vermiformis CDC-19 DNA that best matched with bacteria.
Examples of highly expressed genes best matching with bacterial genes.
| Gene id* | Best hit | Function | Organism | Reads count |
|---|---|---|---|---|
| g6416 | WP_014678436.1 | Tandem-95 repeat protein | 349 | |
| g2762 | ANE82214.1 | Hypothetical protein A7U43_25800 | 147 | |
| g4799 | EQB29884.1 | Arylsulfatase regulatory protein | 45 | |
| g11285 | WP_054281538.1 | NADPH dehydrogenase NamA | 37 | |
| g4808 | ACF68028.1 | Transposase | 31 |
*In V. vermiformis .
Figure 3Phylogenetic reconstructions for four examples of putative lateral sequence transfers implicating V. vermiformis and bacteria. Lateral sequence transfer was inferred from the comparison of V. vermiformis predicted sequences with their best BLAST hits. (a,b) Trees based on two proteins with sequence similarity with a non-CPR bacterial homolog. (c,d) Trees based on two proteins with sequence similarity with a CPR homolog. In dark yellow: V. vermiformis genes. Colors of branches are related to bootstrap values.
Figure 4Taxonomical origins of predicted genes with a giant virus as best hit.
Figure 5Phylogenetic reconstructions for two examples (a,b) of putative lateral sequence transfers implicating V. vermiformis and giant viruses. Lateral sequence transfer was inferred from the comparison of V. vermiformis predicted sequences with their best BLAST hits. In dark yellow: V. vermiformis genes. Colors of branches are related to bootstrap values.
Nine examples of expressed genes best matching with viral sequences, including Orpheovirus IHUMI-LCC2 and Kaumoebavirus.
| Gene id* | Best hit | Function | Organism |
|---|---|---|---|
| g4093 | YP_009449258.1 | Domain of unknown function (DUF4114) | Orpheovirus IHUMI-LCC2 |
| g4206 | YP_009449258.1 | Domain of unknown function (DUF4114) | Orpheovirus IHUMI-LCC2 |
| g10030 | YP_009449258.1 | Domain of unknown function (DUF4114) | Orpheovirus IHUMI-LCC2 |
| g12378 | YP_009448979.1 | Cytochrome P450-like protein E-class | Orpheovirus IHUMI-LCC2 |
| g13490 | YP_009449258.1 | Domain of unknown function (DUF4114) | Orpheovirus IHUMI-LCC2 |
| g13679 | YP_009449258.1 | Domain of unknown function (DUF4114) | Orpheovirus IHUMI-LCC2 |
| g14004 | YP_009448979.1 | Cytochrome P450-like protein E-class | Orpheovirus IHUMI-LCC2 |
| g15726 | YP_009352567.1 | Peroxinectin | Kaumoebavirus |
| g18288 | YP_009449258.1 | Domain of unknown function (DUF4114) | Orpheovirus IHUMI-LCC2 |
*In V. vermiformis.
Figure 6Rhizomes representation of the proteins best matching with giant viruses. Taxonomical distribution of V. vermiformis CDC-19 predicted proteins for which best BLAST hits were members of family Mimiviridae (a) and other giant viruses (b).