| Literature DB >> 30279542 |
Marco Severgnini1, Barbara Lazzari2,3, Emanuele Capra3, Stefania Chessa3, Mario Luini4, Roberta Bordoni1, Bianca Castiglioni3, Matteo Ricchi5, Paola Cremonesi3.
Abstract
Prototheca zopfii (P. zopfii, class Trebouxiophyceae, order Chlorellales, family Chlorellaceae), a non-photosynthetic predominantly free-living unicellular alga, is one of the few pathogens belonging to the plant kingdom. This alga can affect many vertebrate hosts, sustaining systemic infections and diseases such as mastitis in cows. The aim of our work was to sequence and assemble the P. zopfii genotype 1 and genotype 2 mitochondrial and plastid genomes. Remarkably, the P. zopfii mitochondrial (38 Kb) and plastid (28 Kb) genomes are models of compaction and the smallest known in the Trebouxiophyceae. As expected, the P. zopfii genotype 1 and 2 plastid genomes lack all the genes involved in photosynthesis, but, surprisingly, they also lack those coding for RNA polymerases. Our results showed that plastid genes are actively transcribed in P. zopfii, which suggests that the missing RNA polymerases are substituted by nuclear-encoded paralogs. The simplified architecture and highly-reduced gene complement of the P. zopfii mitochondrial and plastid genomes are closer to those of P. stagnora and the achlorophyllous obligate parasite Helicosporidium than to those of P. wickerhamii or P. cutis. This similarity is also supported by maximum likelihood phylogenetic analyses inferences. Overall, the P. zopfii sequences reported here, which include nuclear genome drafts for both genotypes, will help provide both a deeper understanding of the evolution of Prototheca spp. and insights into the corresponding host/pathogen interactions.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30279542 PMCID: PMC6168571 DOI: 10.1038/s41598-018-32992-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Organelles and nuclear genome annotation statistics.
| Size (nt) | Number Scaffolds | %GC | Total features | CDS | tRNA | rRNA | Number introns (size) | GenBank Accession | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Mitochondrion | 78,500 | 1 | 28.2 | 62 | 32 | 27 | 3 | 6 (5,482) | NC_025413.1 | |
| 57,274 | 1 | 28.7 | 70 | 39(+2c) | 26 | 3 | 7 (6,589) | NC_026009.1 | ||
| 49,343 | 1 | 25.6 | 65 | 37 | 25 | 3 | 2 (8,208) | NC_017841.1 | ||
| 55,328 | 1 | 25.8 | 65 | 36 | 26 | 3 | 5 (4,709) | NC_001613.1 | ||
| 38,164 | 1 | 28.7 | 62 | 33 | 26 | 3 | 0 (0) | MF197533.1 | ||
| 39,222 | 1 | 28.7 | 63 | 34 | 26 | 3 | 1 (776) | MF197534.1 | ||
| Plastid | 124,793 | 1 | 34.0 | 112 | 79 | 30 | 3 | 3 (1,657) | NC_015359.1 | |
| 84,576 | 1 | 30.8 | 109 | 76 | 30 | 3 | 0 (0) | NC_023775.1 | ||
| 37,454 | 1 | 26.9 | 54 | 26 | 25 | 3 | 1 (486) | NC_008100.1 | ||
| 48,188 | 1 | 25.7 | 56 | 25 (+3d) | 25 | 3 | 0 (0) | AP018372.1 | ||
| 51,673 | 1 | 29.7 | 72 | 40 | 29 | 3 | 0 (0) | AP018373.1 | ||
| 55,636 | 1 | 31.1 | 70 | 40 | 27 | 3 | 0 (0) | KJ001761.1 | ||
| 28,698 | 1 | 27.0 | 47 | 19 | 25 | 3 | 0 (0) | MF197535.1 | ||
| 28,638 | 1 | 26.8 | 47 | 19 | 25 | 3 | 0 (0) | MF197536.1 | ||
| Nuclear | 42.2 M | 414 | 67.1 | 9,780 | 9,780 | — | — | n.a. | ADIC00000000.1 | |
| 22.9 M–32.7 Ma | 113 | 62.8 | 7,016 | 7,014 | — | 2 | n.a. | APJO00000000.1 | ||
| 12.4 M | 5,666 | 61.7 | 6,033 | 6,033 | — | — | n.a. | AYPS00000000.1 | ||
| 16.9 M | 27 | 71.4 | 7,041 | 7,041 | — | — | n.a. | BCJY00000000.1 | ||
| 20.0 M | 29 | 60.3 | 6,884 | 6,884 | — | — | n.a. | BCIH01000000.1 | ||
| ~29.0 Mb | 2,860b | n.a. | n.a. | n.a. | n.a. | n.a. | n.a. | — | ||
| ~26.5 M | 6,956 | 67.3 | 6,884 | 6,884 | — | — | n.a. | PEIA01000000 | ||
| ~24.7M | 4,555 | 73.5 | 6,381 | 6,381 | — | — | n.a. | PGFX00000000 |
For each organism, the nuclear or organellar DNA size, the number of scaffolds in the assembly, the percentage of GC content (%GC), the number of introns and size, the number of genes (Total features), subdivided into: coding sequences (CDS), the number of transfer RNAs (tRNA), and of ribosomal RNAs (rRNA) are reported; last column reports NCBI GenBank Accession number for the assemblies.
aOriginal WGS sequencing (BioProject PRJNA182710, year: 2014) covers 21,856,191 bp, whereas a more recent sequencing (PRJNA362700, year: 2017) estimates a total length of 32,730,026 bp.
bAs reported in[51].
cA. protothecoides mitochondrion also includes 2 pseudogenes.
dP. stagnora plastid also includes 3 ORFs of unknown function.
Figure 1P. zopfii genotype 2 mitochondrion and plastid circular plot. Circular plots depicting the annotation of P. zopfii genotype 2 mitochondrion (A) and plastid (B). Gene annotation is reported on the outermost circle of the plot; CDS are in blue, tRNA are in green and rRNA are in red. Innermost circles represent gene orientation, GC content and skew. Other rings report the extent and the % identity of the plastid features with those of proximal organisms (C. variabilis, A. protothecoides, Helicosporidium sp., P. wickerhamii plus P. cutis and P. stagnora for plastid only) and with P. zopfii genotype 1. Transparency is proportional to the degree of identity between P. zopfii genotype 2 and each reference genome; no transparency indicates 100% identity. % identity was calculated on the basis of BLASTn (for tRNA and rRNA) and BLASTp (for CDS) matches with the corresponding features on the reference plastid genome.
Figure 2Multi alignment of P. zopfii mitochondrion and plastid sequences. Gene order comparison between the mitochondrial (A) and plastid (B) sequences of P. zopfii and other members of the Trebouxiophyceae class. Organisms are ordered by descending organelle size.
Figure 3Phylogenetic analysis of P. zopfii. (A) Maximum Likelihood (ML) tree inferred from the super-alignment of 9 plastid ribosomal proteins (i.e.: RPL2, RPL5, RPL14, RPL16, RPL20, RPS8, RPS11, RPS12, and RPS14). Bootstrap values are indicated above the lines. Core Trebouxiophyceae was used as outgroup. Among Chlorellales, (dark green vertical bar) non-photosynthetic species are highlighted in blue; tree lengths for this group are not drawn to scale. (B) Heatmap representing the pairwise average percentage of identity between CDS of 8 organisms belonging to Chlorellales (P. zopfii and its closest relatives).
Comparison of genes of the C. variabilis chloroplast with genes of the P. zopfii and P. stagnora plastids.
| Functiona | Presentb | Absentc | |
|---|---|---|---|
| Transcription (plastid-encoded RNA polymerase) | — | ||
| Translation | — | ||
| Ribosomal proteins | Small subunit | rps3, rps4, rps7, rps8, rps9, rps11, rps12, rps14, rps19 | rps2, rps18 |
| Large subunit | rpl2, rpl5, rpl14, rpl16, rpl20, | rpl12, | |
| Photosynthesis | ATP synthase | — | atpA, atpB, atpE, atpF, atpH, atpI |
| Photosystem I | — | psaA, psaB, psaC, psaI, psaJ, psaM | |
| Photosystem II | — | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | |
| Cytochromecomplex | — | petA, petB, petD, petG, petL | |
| Metabolism | accD, cysT | rbcL, cysA | |
| Chlorophyll biosynthesis | — | chlB, chlI, chlL, chlN | |
| Protein quality control | clpP | ||
| Assembly, membrane insertion | — | ccsA, secG | |
Genes are grouped according to function. aFunction and names of chloroplast genes follow the categorization and nomenclature reported in[52].
bGenes present in P. zopfii. Genes present in P. zopfii genotype 1 and 2 but absent in P. stagnora plastid are in boldface.
cGenes absent in both P. zopfii and P. stagnora. Genes present in P. stagnora but absent in P. zopfii genotype 1 and 2 plastid are in boldface.