| Literature DB >> 34681800 |
Ewa Górecka1, Romain Gastineau1, Nikolai A Davidovich2, Olga I Davidovich2, Matt P Ashworth3, Jamal S M Sabir4, Claude Lemieux5, Monique Turmel5, Andrzej Witkowski1.
Abstract
We provide for the first time the complete plastid and mitochondrial genomes of a monoraphid diatom: Schizostauron trachyderma. The mitogenome is 41,957 bp in size and displays two group II introns in the cox1 gene. The 187,029 bp plastid genome features the typical quadripartite architecture of diatom genomes. It contains a group II intron in the petB gene that overlaps the large single-copy and the inverted repeat region. There is also a group IB4 intron encoding a putative LAGLIDADG homing endonuclease in the rnl gene. The multigene phylogenies conducted provide more evidence of the proximity between S. trachyderma and fistula-bearing species of biraphid diatoms.Entities:
Keywords: LAGLIDADG; biraphid; diatoms; genome; monoraphid; multigene; organellar; phylogeny
Mesh:
Year: 2021 PMID: 34681800 PMCID: PMC8541233 DOI: 10.3390/ijms222011139
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Light (LM) and Scanning electron microscopy (SEM) images of S. trachyderma. (a–c): live specimen with plastids. (a,b): valve view. (c): girdle view. (d–g): in LM. (d,e): raphe valve. (f–g): sternum valve. (h–k): in SEM. (h): sternum valve. (i): raphe valve.
Figure 2Map of the mitochondrial genome of S. trachyderma.
Figure 3MAUVE alignment of the mitochondrial genomes of Berkeleya fennica (KM886611), Didymosphenia geminata (KX889125), Fistulifera saprophila (MT383640), Fistulifera solaris (KT363689), Proschkinia sp. (MH800316), and Schizostauron trachyderma (MZ520767). The legend below shows the gene content of the blocks of synteny (conserved protein-coding genes and rRNA only).
Figure 4Map of the plastid genome of S. trachyderma.
List of the various ORF found in the mitochondrial and plastid genomes of Schizostauron trachyderma, with the name, position, putative conserved domains, and best blastp result.
| Name | Position | Putative Conserved Domain | Best Blastp Result |
|---|---|---|---|
| orf143a | Plastid genome (LSC) | None | CAA45586.1 from |
| orf227a | Plastid genome (LSC) | Putative serine recombinase | AZJ16760.1 from |
| orf166a | Plastid genome (LSC) | None | AZJ16664.1 from |
| orf123b | Plastid genome (LSC) | None | WP_101018572.1 from |
| orf148b | Plastid genome (LSC) | None | AXF37983.1 from |
| orf406a | Plastid genome (LSC) | None | CAA45586.1 from |
| orf152b | Plastid genome (LSC) | None | YP_009028997.1 from |
| orf418a | Plastid genome (LSC) | Putative integrase recombinase | YP_009686230.1 from |
| orf175a | Plastid genome (LSC) | None | QUW40432.1 from |
| orf147a | Plastid genome (LSC) | None | None significant |
| orf134a | Plastid genome (LSC) | Putative integrase recombinase (not complete, contains His-289, Arg-292, and Tyr-324) | YP_009029005.1 from |
| orf104a | Plastid genome (LSC) | Putative integrase recombinase (not complete) | YP_009686252.1 from |
| orf187b | Plastid genome (LSC) | None | HAC63963.1 from |
| orf110a | Plastid genome (LSC) | Putative serine recombinase (not complete in C terminal, lacks the DNA binding site) | QGW12742.1 from |
| orf238b | Plastid genome (LSC) | None | QUS63763.1 from |
| orf417b | Plastid genome (LSC) | None | AXF37982.1 from |
| orf224a | Plastid genome (LSC) | Putative serine recombinase | YP_009496149.1 from |
| orf127a | Plastid genome (LSC) | None | None significant |
| orf103a | Plastid genome (LSC) | None | YP_009028999.1 from |
| orf158b | Plastid genome (LSC) | None | YP_009029000.1 from |
| orf299a | Plastid genome (LSC) | Putative integrase recombinase | YP_009497021.1 from |
| orf305a | Plastid genome (LSC) | None | AZJ16668.1 from |
| orf494a | Plastid genome (IR) | Putative reverse transcriptase | QGW12739.1 from |
| orf119 | Plastid genome (IR) | Putative serine recombinase (not complete in N terminal, lacks most of the presynaptic site 1 dimer interface) | QGW12742.1 from |
| orf139 | Plastid genome (IR) | Putative LAGLIDADG | AAL34315.1 from |
| orf100 | Plastid genome (IR) | None | AZJ16668.1 from |
| orf190a | Plastid genome (SSC) | None | YP_009308934.1 from |
| orf144a | Plastid genome (SSC) | None | None significant |
| orf132b | Plastid genome (SSC) | None | AZJ16659.1 from |
| orf391a | Plastid genome (SSC) | None | CAA45582.1 from |
| orf112a | Plastid genome (SSC) | Putative integrase recombinase (not complete in C terminal, displays only the first conserved Arg-173) | YP_009028833.1 from |
| orf117c | Plastid genome (SSC) | None | CAA45586.1 from |
| orf152a | Plastid genome (SSC) | None | HAC63961.1 from |
| orf317a | Plastid genome (SSC) | Putative integrase recombinase | NNF85095.1 from |
| orf290b | Plastid genome (SSC) | None | YP_009028832.1 from |
| orf188a | Plastid genome (SSC) | None | AZJ16664.1 from |
| orf234a | Plastid genome (SSC) | Putative serine recombinase | QWM93463.1 from |
| orf148a | Plastid genome (SSC) | None | YP_009059189.1 from |
| orf313a | Plastid genome (SSC) | None | QUS63950.1 from |
| orf120b | Plastid genome (SSC) | None | None significant |
| orf294a | Plastid genome (SSC) | Putative integrase recombinase | YP_009495909.1 from |
| orf516a | Plastid genome (SSC) | None | CAA45586.1 from |
| orf645 | Mitogenome | Putative reverse transcriptase | YP_009317775.1 from |
| orf690 | Mitogenome | Putative reverse transcriptase | QUS63794.1 from |
Figure 5Sequence motifs obtained by aligning LAGLIDADG proteins from Schizostauron trachyderma and 13 sequences from various species of Chlorophyceae corresponding to IB4-L8 LAGLIDADG proteins. The alignment displayed as a LOGO shows the presence of two conserved motifs, QWIVGFVDG and PFFE.
Figure 6Comparison of the gene composition for the plastid genomes of all the species used in the phylogeny below. A 1/blue indicates the presence of the gene, a 0/white its lack, and a Ψ/green a pseudogene version of the gene.
Figure 7Maximum likelihood phylogeny inferred from an alignment of concatenated protein-coding genes from 37 mitochondrial genomes of diatoms, including that of S. trachyderma. The araphid species Ulnaria acus served as an outgroup in this analysis. The best scoring RAxML tree (log likelihood = −414,469.369802) is presented with bootstrap support values denoted on the nodes. The scale bar indicates the number of substitutions per site. The araphid species Ulnaria acus is used as an outgroup.
Figure 8Maximum likelihood phylogeny inferred from an alignment of concatenated protein-coding genes from 26 plastid genomes of diatoms, including that of S. trachyderma. The araphid species Ulnaria acus served as an outgroup in this analysis. The best scoring RAxML tree (log likelihood = −878,702.878358) is presented with bootstrap support values denoted on the nodes. The scale bar indicates the number of substitutions per site.