| Literature DB >> 27811175 |
Alison S Jacob1,2, Lee O'Brien Andersen3, Paulina Pavinski Bitar4, Vincent P Richards4,5, Sarah Shah6, Michael J Stanhope4, C Rune Stensvold1,3, C Graham Clark7.
Abstract
Complete mitochondrion-related organelle (MRO) genomes of several subtypes (STs) of the unicellular stramenopile Blastocystis are presented. Complete conservation of gene content and synteny in gene order is observed across all MRO genomes, comprising 27 protein coding genes, 2 ribosomal RNA genes, and 16 transfer RNA (tRNA) genes. Despite the synteny, differences in the degree of overlap between genes were observed between subtypes and also between isolates within the same subtype. Other notable features include unusual base-pairing mismatches in the predicted secondary structures of some tRNAs. Intriguingly, the rps4 gene in some MRO genomes is missing a start codon and, based on phylogenetic relationships among STs, this loss has happened twice independently. One unidentified open reading frame (orf160) is present in all MRO genomes. However, with the exception of ST4 where the feature has been lost secondarily, orf160 contains variously one or two in-frame stop codons. The overall evidence suggests that both the orf160 and rps4 genes are functional in all STs, but how they are expressed remains unclear.Entities:
Keywords: mitochondrion; phylogeny; stramenopile; tRNA
Mesh:
Substances:
Year: 2016 PMID: 27811175 PMCID: PMC5203790 DOI: 10.1093/gbe/evw255
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
MRO Genome Characteristics of Blastocystis sp. Subtypes 1–4 and 6–9
| Genome Characteristics | ST 1 | ST 2 | ST 3 | ST 3 | ST 3 | ST 4 | ST 4 | ST 6 | ST 7 | ST8 | ST9 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NandII | Flemming | DMP/08-326 | DMP/08-1043 | DMP/IH:478 | DMP/02-328 | DMP/10-212 | SSI:754 | B | DMP/08-128 | F5323 | |
| GenBank Accession number | EF494740 | KU900235 | HQ909886 | HQ909887 | HQ909888 | EF494739 | KU900236 | KU900237 | CU914152 | KU900238 | KU900239 |
| Genome size (bp) | 28,385 | 28,305 | 28,243 | 28,268 | 28,242 | 27,718 | 27,817 | 28,806 | 29,270 | 27,958 | 28,788 |
| Coding density (%) | 77.5 | 78.0 | 77.5 | 77.6 | 77.7 | 77.1 | 77.4 | 77.0 | 77.1 | 77.0 | 77.3 |
| Intergenic spacer (IGS) content (%) | 4.1 | 3.7 | 3.8 | 3.9 | 4.1 | 3.5 | 3.6 | 4.1 | 4.8 | 3.7 | 4.1 |
| Sum of IGS sizes (bp) | 1,165 | 1,044 | 1,081 | 1,102 | 1,160 | 964 | 996 | 1,167 | 1,399 | 1,043 | 1,179 |
| Average size of IGS (bp) | 33.3 | 30.7 | 30.0 | 30.6 | 33.1 | 36.1 | 28.5 | 35.3 | 37.8 | 29.8 | 38.0 |
| Overlapped genes | 6 | 8 | 7 | 7 | 7 | 8 | 9 | 11 | 7 | 9 | 11 |
| G + C content (%) | 19.9 | 19.7 | 21.6 | 21.4 | 21.6 | 21.9 | 21.6 | 18.9 | 20.1 | 22.7 | 18.8 |
aIncludes rps4 and orf160.
Selected Intergenic Region Data from the MRO Genomes of Blastocystis
| ST1 | ST2 | ST3 | ST3 | ST3 | ST4 | ST4 | ST6 | ST7 | ST8 | ST9 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NandII | Flemming | DMP/08-326 | DMP/IH:478 | DMP/08-1043 | DMP/02-328 | DMP/10-212 | SSI:754 | B | DMP/08-128 | F5323 | |
| Accession number | EF494740 | KU900235 | HQ909886 | HQ909888 | HQ909887 | EF494739 | KU900236 | KU900237 | CU914152 | KU900238 | KU900239 |
| 86 | 85 | 86 | 86 | 88 | 95 | 88 | 94 | 86 | 94 | ||
| 54 | 63 | 62 | 62 | 61 | 55 | 59 | 67 | 59 | 69 | ||
| 21 | 21 | 19 | 41 | 90 | 15 | ||||||
| 10 | 9 | 21 | 21 | 20 | 29 | 22 | 11 | 28 |
aDenotes intergenic regions where the direction of transcription changes.
bDenotes other regions in which the intergenic distance exceeds 100 bp (bold text) in at least some STs.
. 1.—Coding regions of a representative MRO genome of Blastocystis sp. Black blocks represent genes that are predicted to be transcribed clockwise (outer ring) or anti-clockwise (inner ring). Grey blocks represent tRNA genes, which are identified by their amino acid using the standard single letter code. Mf and Me1/Me2 are initiator and elongator methionyl tRNAs, respectively. The inner circle shows the approximate size scale. The map is modified from that in Pérez-Brocal and Clark (2008).
. 2.—Phylogenetic relationships among the stramenopiles based on concatenated nad genes. Stramenopile species are grouped according to their second highest rank in the taxonomic classification of Adl et al. (2012). The tree shown is that inferred using IQ-Tree analysis as described in the methods section. The Bayesian analysis tree was identical except for the branch order of Fucus and Pylaiella. Some of the branches have been collapsed to simplify the topology; the number alongside the genus indicates the number of taxa represented. Bootstrap support and posterior probabilities are adjacent to each node in the order Maximum Likelihood/IQ-Tree/Bayesian Analysis. Where boot strap support was 100% and Posterior Probabilities were 1.0, the node is indicated by a filled circle. Bootstrap support lower than 50% and posterior probability values of < 0.5 are indicated by asterisks. GenBank accession numbers of the sequences used are listed in parentheses.
Overlapping Genes and Length of Overlap (in bp) for the Eight MRO Genomes of Blastocystis
| Isolate | ST1 | ST2 | ST3 | ST3 | ST3 | ST4 | ST4 | ST6 | ST7 | ST8 | ST9 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NandII | Flemming | DMP/08-326 | DMP/IH:478 | DMP/08-1043 | DMP/02-328 | DMP/10-212 | SSI:754 | B | DMP/08-128 | F5323 | |
| GenBank Accession | EF494740 | KU900235 | HQ909886 | HQ909888 | HQ909887 | EF494739 | KU900236 | KU900237 | CU914152 | KU900238 | KU900239 |
| 8 | 8 | 11 | 11 | 11 | 8 | 8 | 8 | 11 | 11 | 8 | |
| – | – | – | – | – | 4 | 4 | 4 | – | – | 4 | |
| – | – | – | – | – | – | – | – | – | 10 | – | |
| 56 | 56 | 56 | 55 | 56 | 56 | 56 | 56 | 56 | 56 | 56 | |
| – | – | – | – | – | – | – | 10 | – | – | 10 | |
| – | 7 | – | – | – | – | 11 | – | – | – | – | |
| – | – | – | – | – | – | – | 8 | – | – | 8 | |
| 11 | 11 | 11 | 11 | 11 | 11 | 11 | 8 | 11 | 11 | 8 | |
| – | – | – | – | – | – | – | – | – | 17 | – | |
| – | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | |
| 23 | 23 | 8 | 8 | 8 | 20 | 20 | 23 | 26 | 23 | 20 | |
| – | – | – | – | – | – | – | 1 | – | – | 1 | |
| 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | |
| 7 | 38 | 7 | 38 | 38 | 7 | 7 | 38 | 69 | 38 | 69 | |
| Total overlap | 115 | 163 | 113 | 143 | 144 | 126 | 137 | 176 | 193 | 237 | 204 |
aThe triplet and the single letter amino acid letter designation is given.
bThe absence in some STs of an initiation codon in rps4 and the presence of in-frame stop codons in orf160 complicates the calculation of the overlap lengths involving these genes as it is unclear where the true start of translation is. Accordingly, we have annotated the start of rps4 as the first triplet after the stop codon of the preceding nad4 gene and the start of orf160 as the conserved ATG upstream of the in-frame stop codon(s); the overlap lengths involving these two genes must be viewed as tentative.
. 3.—Multiple sequence alignment of the conceptual translation of the N terminus of rps4. Amino acids corresponding to sites of potential alternative initiation codons (TTG/CTG/GTG/ATT) are highlighted as white letters on a grey background. # and * represent the stop codons TAA and TAG respectively. A 25 amino acid sequence in frame has been deleted in ST2 for presentation reasons. No potential alternative initiation codons are found in the deleted residues.
. 4.—Multiple sequence alignment corresponding to the first 19 amino acids of orf160. The region presented corresponds to the 56 bp overlap of orf160 with nad7. @ = TGA and * = TAG stop codons respectively. The arrows indicate the only positions in which stop codons are seen.
. 5.—Schematic representation of the proposed gain and loss of stop codons in orf 160 (A) and loss of the start codon in rps4 (B). The cladogram shown is based on Figure 2, with the positions of ST5 and ST10, for which no MRO genome sequence exists, being based on information from SSU rDNA-based trees. Arrows indicate the proposed approximate position of the events under discussion.