| Literature DB >> 33297939 |
C Kay1, T A Williams2, W Gibson2.
Abstract
BACKGROUND: Trypanosomes are single-celled eukaryotic parasites characterised by the unique biology of their mitochondrial DNA. African livestock trypanosomes impose a major burden on agriculture across sub-Saharan Africa, but are poorly understood compared to those that cause sleeping sickness and Chagas disease in humans. Here we explore the potential of the maxicircle, a component of trypanosome mitochondrial DNA to study the evolutionary history of trypanosomes.Entities:
Keywords: Kinetoplast; Maxicircle; Mitochondrial DNA; Phylogeny; RNA editing; Trypanosome
Year: 2020 PMID: 33297939 PMCID: PMC7724854 DOI: 10.1186/s12862-020-01701-9
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Global overview of maxicircle gene coding region. Top: alignment of the maxicircle mitochondrial DNA gene coding regions from 27 isolates (top to bottom, Trypanosoma brucei H866, 1829 ALJO, Lister 427, TREU 927, TSW 55, J10, LF1; T. congolense IL3000, WG81, GAM2, IL3900, IL3578, ERA D1; T. simiae ERA C2; T. godfreyi KEN7, ERA F1; T. vivax Liem 176, Y486, Tv2323, T. cruzi, CL Brener, Esmeraldo; T. lewisi, T. conorhini, T. copemani, T. grayi, T. theileri, Leishmania tarentolae). Tracts of poly-T or poly-A are shown coloured turquoise or red respectively. An approximate scale is shown. Segmental gene deletions in the alignment are indicated by arrows and are also shown below as coloured bars; the deletion from T. equiperdum STIB 818 is also shown for comparison. Bottom: gene order in the maxicircle gene coding region. Non-edited genes in are shown in white, minimally edited genes in light grey and extensively edited (pan-edited) genes in grey. Editing categories are on the basis of T. vivax [14]
Characteristics of pre-edited and non-edited mitochondrial maxicircle genes in trypanosomatids
Left: Variation in coding region size relates to the sequence contribution of pre-edited genes. Trypanosome gene coding regions (WCR) are shorter than for related trypanosomatids Leishmania and Crithidia. These size differences reflect the contraction of pre-edited regions (Σedited); the remaining gene coding region (WCR-Σedited) is relatively invariant in length. All numbers present the base length of ungapped sequences. Right: Non-edited genes show trends of GC loss which suggest vulnerability to loss of function. Universally non-edited genes COI, ND4 and the first 500 codons of ND5 (ND5*) were analysed for codon usage (% AU codons) and possible composition changes (reducible GC%), which is the percentage of alternate synonymous codons with reduced GC content. Coding regions which are [ ] or have [ ] that impact calculations are highlighted. Numbers have been shaded by value order on a column by column basis. Trypanosomes below the solid line, with non-salivarian above and salivarian below the dashed line.
Individual maxicircle genes show different trends for GC composition and T:C ratios
From an alignment of gene coding regions, aligned sequence regions were extracted and analysed in the reading direction for GC% (left) and the ratio of T:C (right). Extensively edited genes ([ ]) have greater GC% than lightly ([ ]) or non-edited (white) genes. Likewise, the T:C ratio is very high in some edited genes e.g. ND9, CR3, CR4. Shading as Table 1.
Fig. 2Correlation between sequence length and GC% for six pre-edited maxicircle genes. There is an inverse correlation between the length and GC% of pre-edited genes in trypanosome maxicircles, where salivarian trypanosomes (red filled circle) generally have shorter, more GC-rich pre-edited genes than non-salivarian trypanosomes (black filled circle).This relationship also holds true for other trypanosomatid species (Additional file 6: Figure S4)
Fig. 3Time resolved phylogenies of T. congolense and T. brucei. The savannah (a) and pan-African (b) clades are expanded below their respective trees. The coloured boxes correspond to countries of origin on the map of Africa (inset). STRUCTURE and hierBAPS groups are indicated by the white boxes. Timelines are in years before present and node values are posterior probabilities < 1. Arrowed nodes 1 and 2 are discussed in the text. The putative position of T. equiperdum STIB818 inferred from an independent ML tree (Additional file 10: Figure S6) is indicated in b