| Literature DB >> 34548559 |
Matteo Mozzicafreddo1, Sandra Pucciarelli2, Estienne C Swart3, Angela Piersanti2, Christiane Emmerich3, Giovanna Migliorelli2, Patrizia Ballarini2, Cristina Miceli2.
Abstract
The macronuclear (MAC) genomes of ciliates belonging to the genus Euplotes species are comprised of numerous small DNA molecules, nanochromosomes, each typically encoding a single gene. These genomes are responsible for all gene expression during vegetative cell growth. Here, we report the analysis of the MAC genome from the Antarctic psychrophile Euplotes focardii. Nanochromosomes containing bacterial sequences were not found, suggesting that phenomena of horizontal gene transfer did not occur recently, even though this ciliate species has a substantial associated bacterial consortium. As in other euplotid species, E. focardii MAC genes are characterized by a high frequency of translational frameshifting. Furthermore, in order to characterize differences that may be consequent to cold adaptation and defense to oxidative stress, the main constraints of the Antarctic marine microorganisms, we compared E. focardii MAC genome with those available from mesophilic Euplotes species. We focussed mainly on the comparison of tubulin, antioxidant enzymes and heat shock protein (HSP) 70 families, molecules which possess peculiar characteristic correlated with cold adaptation in E. focardii. We found that α-tubulin genes and those encoding SODs and CATs antioxidant enzymes are more numerous than in the mesophilic Euplotes species. Furthermore, the phylogenetic trees showed that these molecules are divergent in the Antarctic species. In contrast, there are fewer hsp70 genes in E. focardii compared to mesophilic Euplotes and these genes do not respond to thermal stress but only to oxidative stress. Our results suggest that molecular adaptation to cold and oxidative stress in the Antarctic environment may not only be due to particular amino acid substitutions but also due to duplication and divergence of paralogous genes.Entities:
Mesh:
Year: 2021 PMID: 34548559 PMCID: PMC8455672 DOI: 10.1038/s41598-021-98168-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
E. focardii macronuclear genome assembly statistics in comparison with E. vannus[79] and E. octocarinatus[38].
| Assembler name and version | SPAdes 3.10.1 | SPAdes 3.7.1 | Mira 4.0 / SPAdes 2.5.0 |
| Assembly size (Mb) | 49.33 | 85.1 | 88.9 |
| %GC | 31.51 | 37.0 | 28.2 |
| Contigs (n) | 31,114 | 38,245 | 41,980 |
| Telomeres (n) | 87,301 | 156,713 | 132,894 |
| N50 (bp) | 2047 | 2714 | 2947 |
| Mean contig length (bp) | 1583 | 2225 | 2117 |
| Max contig length (bp) | 50,409 | 40,045 | 53,269 |
| 2-telomeres contigs (n) | 17,798 | 25,871 | 30,058 |
| Mean 2-telomere contig length (bp) | 1954 | 2369 | 2474 |
| 1-telomeres contigs (n) | 6136 | 7637 | 4368 |
| Mean 1-telomere contig length (bp) | 1280 | 2039 | 2114 |
| 0-telomeres contigs (n) | 7180 | 4737 | 7554 |
| Mean 0-telomers contig length (bp) | 927 | 1736 | 700 |
| % Contigs (with telomere) | 76.9 | 87.2 | 81.9 |
E. focardii Gene prediction features in comparison with E. vannus[79] and E. octocarinatus[38].
| Nanochromosomes with only 1 gene (n) | 23,224 | / | / |
| Nanochromosomes with more genes (n) | 2650 | / | / |
| Total genes (n) | 28,881 | 43,040 | 29,076 |
| Exons (n) | 81,715 | 175,735 | 96,843 |
| Introns (n) | 52,832 | / | / |
| Average intron length (bp) | 69.02 | / | / |
| Average number of introns per gene (n) | 1.83 | / | / |
| Average CDS length (bp) | 1085 | 1460 | 1178 |
| Number of genes with assigned function | 15,357 (53.17%) | / | / |
| Number of predicted enzymes | 3306 (11.45%) | / | / |
CEGs missing from alignment results using default CEGMA criteria.
| CEG name | |||
|---|---|---|---|
| DNA-directed RNA polymerase, subunit RPB10 | KOG3497 | KOG3497 | |
| Mitochondrial F1F0-ATP synthase, subunit delta/A TP16 | KOG1758 | KOG1758 | KOG1758 |
| 6-Phosphogluconate dehydrogenase | KOG2653 | KOG2653 | KOG2653# |
| Sugar (pentulose and hexulose) kinases | KOG2531 | KOG2531 | KOG2531 |
| Predicted snRNP core protein | KOG3448 | ||
| Glucose-6-phosphate 1-dehydrogenase | KOG0563 | KOG0563 | KOG0563 |
| OTU (ovarian tumor)-like cysteine protease | KOG2606 | ||
| 6-phosphogluconolactonase- like protein | KOG3147 | KOG3147 | KOG3147 |
| Spindle assembly checkpoint protein | KOG3285 | KOG3285 | KOG3285 |
| RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 | KOG2807 | ||
| UDP-glucose pyrophosphorylase | KOG2638 | KOG2638 | KOG2638 |
| Mitochondrial import inner membrane translocase, subunit TIM13 | KOG1733 | KOG1733 | KOG1733 |
| Translation initiation factor 3, subunit g (eIF-3 g) | KOG0122 | KOG0122 | |
| Uncharacterized conserved protein | KOG2967 | ||
| Mitochondrial import inner membrane translocase, subunit TIM9 | KOG3479 | KOG3479 | |
| Predicted translation initiation factor related to eIF-2B alpha/beta/delta subunits (CIG2/IDI2) | KOG1468 | KOG1468 | KOG1468 |
| Ubiquitin fusion-degradation protein | KOG1816 | ||
| Uncharacterized conserved protein | KOG3237 | ||
| Molecular chaperone Prefoldin, subunit 4 | KOG1760 | ||
| Small nuclear ribonucleoprotein (snRNP) SMF | KOG3482 | KOG3482* | |
| 60S ribosomal protein L38 | KOG3499 | KOG3499 | |
| 60S ribosomal protein L39 | KOG0002 | ||
| Multifunctional methyltransferase subunit TRM112-like protein isoform 1 | KOG1088 | ||
| Retention in endoplasmic reticulum 1 | KOG1688 | ||
| Translationally-controlled tumor protein | KOG1727 | ||
| Dolichyl-P-Glc:Man9GlcNAc2-PP-dolichyl glucosyltransferase | KOG2575 | ||
| Ubiquitin-like protein 5 | KOG3493* | ||
| H/ACA ribonucleoprotein complex subunit 3 | KOG3503 | ||
| Total CEGs not found | 14 | 17 | 20 (17) |
#Found into a contig without telomeres set (with %GC > 45).
*Found into the assembly set but not into the genes predicted set.
Figure 1(A) Secondary and tertiary structures of a potential stop-suppressor tRNA of UAG. (B) alignment of predicted tRNA-Glu genes and candidate suppressor. (C) Frequencies of amino acids aligned to TGA = Cysteine, TAA/TAG and CAA/CAG = Glutamine in tblastn matches (amino acid counts were obtained from BLAST highest scoring pairs with e-value < 1e−20, a match criterion of 21 bp amino acids centered on the codon of interest and > = 40% amino acid identity over 10 amino acids either side of subject codon – example provided below graph). “n” indicates the number of aligned codons analyzed for each bar. Query O. trifallax proteins were obtained from oxy.ciliate.org.
Tubulins comparison between E. focardii, E. crassus and E. octocarinatus. *NODE_87922_length_309_cov_44.0984 (fragment of 309 nt without telomeres only present in the total non-cleaned assembly).
| Tubulin | Protein assembly code | ||
|---|---|---|---|
| α1 | Protein_13754° | Protein_27015° | 18,065.g7566.t1° § |
| α2 | Protein_13962 (94.88% of identity vs α1) | Protein_27022° | |
| α3 | Protein_14463 (85%) | Protein_28219° | |
| α4 | Protein_14635 (59%) | Protein_29453 | |
| α5 | Protein_14449 (57%) | Protein_30084 | |
| α6 | Protein_15234 (57%) | ||
| α7 | Protein_13592 (54%) | ||
| β1 | Protein_13059° | Protein_28250 | 8213.g27318.t1° |
| β2 | Protein_14529° | Protein_23030 | 5984.g25146.t1° § |
| β3 | Protein_14159° | Protein_27936° | 4,144,198.g23194.t1 |
| β4 | Protein_28002 | 5259.g24445.t1 | |
| β5 | Protein_13114 | Protein_20290 | 6445.g25572.t1 |
| γ1 | Protein_13695° | Protein_26311° | |
| γ2 | * ° | Protein_25799° | 11,092.g1034.t1° |
| δ1 | Protein_16384 | 24,288.g13101.t1 | |
| δ2 | Protein_06776 | 30,256.g19015.t1 | |
| ε | Protein_13848 | 17,307.g6847.t1 | |
| 2182.g10828.t1 | |||
| 29,123.g17894.t1 | |||
| 29,737.g18500.t1 | |||
| 414,497.g23602.t1 | |||
| α-like | Protein_09756 (similar to Protein_13962) Protein_11578 (similar to Protein_15234) Protein_12280 (similar to Protein_15234) Protein_12357 (similar to Protein_13962) Protein_13335 (similar to Protein_15234) Protein_14210 (similar to Protein_15234 frag) Protein_16674 (similar to Protein_13754) | 6 proteins | |
| β-like | Protein_11303 (similar to Protein_14529) Protein_15589 (similar to Protein_14159) | 6 proteins |
Protein sequence already deposited in UniProtKB (the ID numbers of E. crassus proteins are: Q8MM87 for TBα1, Q8MU38 for TBα2, Q8MU39 for TBα3, P20365 for TBβ3, P54403 for TBγ1 and P54404 for TBγ2; the ID numbers of E. octocarinatus proteins are: Q08114 for TBα1, Q08115 for TBβ1, Q0GGY3 for TBβ2 and P90548 for TBγ2).
§Contig contains a further sequence portion (in comparison with that deposited in UniProtKB) most likely derived by an assembling or gene prediction error.
Figure 2Phylogenetic trees of tubulin super families: (A) α-tubulins, (B) β-tubulins and (C) γ-tubulins. Only bootstrap percentage values higher than 50%, indicated near nodes, are shown.
Figure 3(A) trajectories data as backbone RMSDs for E. focardii α-tubulin isotype 2, for the first 10 ns, at 27 °C (red) and at 4 °C (blue). (B) the bar plot shows ΔRMSD values for each α-tubulin of E. focardii (blue), E. crassus (green), E. octocarinatus (yellow) and T. thermophila (6U0H, orange; ATU1 in Fig. 2) calculated as the difference between the RMSD at 27 °C and the RMSD at 4 °C (see "Materials and methods" section).
Antioxidant enzyme comparison between E. focardii, E. crassus and E. octocarinatus.
| Anti-oxidant enzyme | Protein assembly code | ||
|---|---|---|---|
| SOD1a | Protein_25445 § ° | Protein_49381 + | 2168.g10728.t1 § |
| SOD1b | Protein_25349 § ° | Protein_49686 § | 20,871.g10083.t1 § |
| SOD1c | 32,362.g21016.t1 + | ||
| SOD1d | Protein_26998 + | Protein_46068 § | |
| SOD1e | Protein_23236 § | Protein_46028 § | 19,489.g8844.t1 + |
| SOD2a | Protein_23056 # ° | Protein_44659 # | 20,227.g9513.t1 # |
| SOD3a | Protein_23639 + | Protein_44373 + | |
| SOD3b | Protein_23528 + | 33,273.g21824.t1 + | |
| CAT1 | Protein_12515 | Protein_24239 | 7699.g26798.t1 |
| CAT2 | Protein_12995 | Protein_24753 | 32,233.g20904.t1 |
| CAT3 | Protein_25541 | Protein_24714 | |
| CAT4 | Protein_25374 | 32,233.g20903.t1 | |
| CAT5 | Protein_27502 | ||
| CAT6 | Protein_28050 | ||
| CAT7 | Protein_23587 | ||
| PRX1 | Protein_22541 | Protein_45540 | 7627.g26732.t1 |
| PRX2 | Protein_22695 | Protein_40156 | 7810.g26912.t1 |
| PRX3 | Protein_23342 | Protein_45912 | 8218.g27323.t1 |
| PRX4 | Protein_23707 | Protein_44467 | 6416.g25544.t1 |
| PRX5a | Protein_28419 | ||
| PRX5b | Protein_44989 | ||
| PRX6 | Protein_39913 | ||
| TRXR1 | Protein_09887 | Protein_21154° | 9 proteins |
| TRXR2 | Protein_20916° | ||
| TRXR3 | Protein_10344 | Protein_19652 | |
| GR1 | Protein_14684 | Protein_27646° | |
| GR2 | Protein_14104 | Protein_25768 | |
| GPx1 | Protein_21251 | Protein_44685° | 11,975.g1890.t1 |
| GPx2 | Protein_22198 | Protein_44915° | 13,463.g3154.t1 |
| GPx3 | Protein_23095 | Protein_44916° | 32,168.g20842.t1 |
| GPx4 | Protein_25138 | Protein_43343 | |
| GPx5 | Protein_22265 | Protein_44948 | 8754.g27852.t1 |
| GPx6 | Protein_48111 | ||
| GPx7 | Protein_43209 | ||
| GS1 | Protein_14094 | Protein_25486 | 9010.g28082.t1 |
| GS2 | Protein_14269 | Protein_37527 | |
| GS3 | Protein_14270 | Protein_27010 | |
| GST | 69 proteins | 60 proteins | 63 proteins |
°Protein sequence already deposited in UniProtKB (the ID numbers of E.crassus proteins are: B8XTW3 for TRXR1, B8XTW4 for TRXR2, J9SMC8 for GR1, B8XTW9 for GPx1, B8XTX0 for GPx2 and J9T5F4 for GPx3).
§Protein having Cu/Zn superoxide dismutase signature (Prosite pattern ID: PS00087 and PS00332).
#Fe/Mn superoxide dismutase signature (Prosite pattern ID: PS00088).
+Protein annotated as Cu/Zn superoxide dismutase but not have any known signature.
Figure 4Phylogenetic trees of SODs (A) and CATs (B). Only bootstrap percentage values higher than 50%, indicated near nodes, are shown. Copper/zinc SODs are highlighted in green, iron/manganese SODs in blue, and nickel SODs in red.
Number of antioxidant enzymes predicted in ciliate organisms phylogenetically close to E. focardii. These values were obtained consulting the annotated proteins files.
| SOD | CAT | PRX | TRXR | GR | GPx | GS | GST | TGR | Total | |
|---|---|---|---|---|---|---|---|---|---|---|
| 7 | 6 | 5 | 2 | 2 | 5 | 3 | 69 | 0 | 99 | |
| 6 | 4 | 6 | 3 | 2 | 7 | 3 | 60 | 0 | 91 | |
| 6 | 5 | 4 | 2 | 2 | 3 | 0 | 31 | 0 | 53 | |
| 6 | 3 | 4 | 9 | 0 | 4 | 1 | 63 | 3 | 93 | |
| 11 | 4 | 0 | 1 | 0 | 10 | 2 | 58 | 0 | 86 | |
| 7 | 4 | 1 | 1 | 1 | 11 | 1 | 23 | 0 | 49 | |
| 4 | 1 | 0 | 0 | 0 | 12 | 1 | 75 | 6 | 99 |
Figure 5Phylogenetic tree of heat-shock protein Hsp70 super families. Only bootstrap percentage values higher than 50%, indicated near nodes, are shown.
Figure 6Hsp70 transcript abundance in stressed E. focardii cells normalized to the abundance in control cells. mRNA levels were determined by qPCR of the Hsp70 isoforms (reported as protein assembly code) using the comparative threshold method. Mean ± SD (n = 4). Total RNA was purified from cells grown at their standard temperature of 4° C (i.e., not shocked), from cells which were exposed to 18 °C (orange) via a heat ramp (see "Materials and methods" section), or from cells exposed to 100 µM of hydrogen peroxide (blue); control and experimental treatments were for 30 min. Data are reported as the mean of three experiments.