| Literature DB >> 23345457 |
João M P Alves1, Myrna G Serrano, Flávia Maia da Silva, Logan J Voegtly, Andrey V Matveyev, Marta M G Teixeira, Erney P Camargo, Gregory A Buck.
Abstract
It has been long known that insect-infecting trypanosomatid flagellates from the genera Angomonas and Strigomonas harbor bacterial endosymbionts (Candidatus Kinetoplastibacterium or TPE [trypanosomatid proteobacterial endosymbiont]) that supplement the host metabolism. Based on previous analyses of other bacterial endosymbiont genomes from other lineages, a stereotypical path of genome evolution in such bacteria over the duration of their association with the eukaryotic host has been characterized. In this work, we sequence and analyze the genomes of five TPEs, perform their metabolic reconstruction, do an extensive phylogenomic analyses with all available Betaproteobacteria, and compare the TPEs with their nearest betaproteobacterial relatives. We also identify a number of housekeeping and central metabolism genes that seem to have undergone positive selection. Our genome structure analyses show total synteny among the five TPEs despite millions of years of divergence, and that this lineage follows the common path of genome evolution observed in other endosymbionts of diverse ancestries. As previously suggested by cell biology and biochemistry experiments, Ca. Kinetoplastibacterium spp. preferentially maintain those genes necessary for the biosynthesis of compounds needed by their hosts. We have also shown that metabolic and informational genes related to the cooperation with the host are overrepresented amongst genes shown to be under positive selection. Finally, our phylogenomic analysis shows that, while being in the Alcaligenaceae family of Betaproteobacteria, the closest relatives of these endosymbionts are not in the genus Bordetella as previously reported, but more likely in the Taylorella genus.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23345457 PMCID: PMC3590767 DOI: 10.1093/gbe/evt012
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FGenomic atlas of Candidatus Kinetoplastibacterium crithidii (A) and Ca. K. blastocrithidii (B). Tracks represent, from the outside to inside: genomic coordinates, in kbp; genes in the plus strand (defined as the strand where dnaA is located, blue); genes in the minus strand (red); rRNA genes in the plus strand; rRNA genes in the minus strand; tRNA genes in the plus strand; tRNA genes in the minus strand; GC content above or below average; GC skew.
FMaximum likelihood supermatrix phylogeny (233 concatenated orthologs) of the Betaproteobacteria class, with Alpha- and Gammaproteobacteria as outgroups. For the Betaproteobacteria, families are indicated on the right of each clade. Numbers on branches represent bootstrap support (only support of 50 or greater shown). Black circles with white numbers mark the different betaproteobacterial orders: 1, Neisseriales; 2, Nitrosomonadales; 3, Gallionellales; 4, Methylophilales; 5, Hydrogenophilales; 6, Burkholderiales; 7, Rhodocyclales. The five endosymbionts sequenced in this work are represented in bold.
Comparative Genome Statistics
| CKbla | CKcri | CKdes | CKgal | CKonc | Tequi | Axylo | |
|---|---|---|---|---|---|---|---|
| Genome length | 820,029 | 821,932 | 833,125 | 822,140 | 810,172 | 1,695,860 | 7,359,146 |
| Overall GC % | 32.55 | 30.96 | 30.17 | 32.36 | 31.23 | 37.42 | 65.78 |
| Protein-coding genes | 723 | 730 | 742 | 726 | 693 | 1,556 | 6,815 |
| % of genome in genes | 92.03 | 91.90 | 91.68 | 91.85 | 90.92 | 93.45 | 91.57 |
| rRNA genes | 9 | 9 | 9 | 9 | 9 | 9 | 10 |
| tRNA genes | 43 | 44 | 43 | 43 | 43 | 38 | 60 |
| Other noncoding RNAs | 7 | 7 | 6 | 7 | 7 | ND | ND |
| Pseudogenes | 7 | 1 | 1 | 4 | 20 | 0 | 0 |
| Average gene length | 1,004 | 1,010 | 1,007 | 1,012 | 1,017 | 1,008 | 986 |
| Average intergenic length | 84 | 85 | 87 | 86 | 96 | 82 | 94 |
Note.—CKbla, Candidatus Kinetoplastibacterium blastocrithidii; CKcri, Ca. K. crithidii; CKdes, Ca. K. desouzaii; CKonc, Ca. K. oncopeltii; Tequi, Taylorella equigenitalis; Axylo, Achromobacter xylosoxidans; ND, not determined.
aAchromobacter xylosoxidans numbers include plasmid-located sequences.
bIncludes protein-coding and RNA genes, and pseudogenes.
cArranged in three identical clusters of the 16S, 23S, and 5S rRNA genes (except for A. xylosoxidans, where one of the clusters has a duplicated 5S gene).
dOnly protein-coding genes.
FOrthologous group presence/absence display for the five Candidatus Kinetoplastibacterium genomes, in order of occurrence in the sequence after the dnaA gene. Boxes in the same column represent putative orthologs as identified by OrthoMCL. Organisms are denoted as: CKdes for Ca. K. desouzaii; CKcri for Ca. K. crithidii; CKbla for Ca. K. blastocrithidii; CKonc for Ca. K. oncopeltii; and CKgal for Ca. K. galatii. Box sizes are not proportional to gene lengths in the genome, and blocks are grouped in sets of 10 for aesthetic reasons only. Blue colour box denotes that ortholog is present in more than one organism; green, ortholog is present in one organism only; red, ortholog is present as a pseudogene; white, ortholog is absent.
Genes Under Positive Selection in Ca. Kinetoplastibacterium
| Annotation | COG Category Class |
|---|---|
| Shikimate kinase | |
| Glutamine amidotransferase | |
| 3-Isopropylmalate dehydrogenase small subunit | |
| Leucyl aminopeptidase | |
| Small-conductance mechanosensitive ion channel protein (MscS) | Cell envelope biogenesis, outer membrane |
| Lipoprotein NlpD | Cell envelope biogenesis, outer membrane |
| Uroporphyrin-III C-methyltransferase (hemX) | Coenzyme metabolism |
| Chromosomal replication initiator protein (dnaA) | |
| DNA polymerase III subunit delta | |
| TatD DNAse family protein | |
| ATP-dependent RNA helicase (rhlE) | |
| F-type H+-transporting ATPase subunit gamma | Energy production and conversion |
| F-type H+-transporting ATPase subunit epsilon | Energy production and conversion |
| Methyltransferase | General function prediction only |
| Fe/S cluster insertion protein ErpA | Posttranslational modification, protein turnover, and chaperones |
| Predicted ATPase of the DUF815 family and AAA + superfamily | Posttranslational modification, protein turnover, and chaperones |
| Small subunit ribosomal protein S14 | Translation, ribosomal structure, and biogenesis |
| Large subunit ribosomal protein L2 | Translation, ribosomal structure, and biogenesis |
| Large subunit ribosomal protein L9 | Translation, ribosomal structure, and biogenesis |
| Glutaminyl-tRNA synthetase | Translation, ribosomal structure, and biogenesis |
aCOG category classes in bold typeface are overrepresented (see text) in relation to the genome; those in normal typeface have representation not significantly different from the genome.
bConfidence of the positive selection inference, as calculated by PAML, is greater than 0.99 (all others are between 0.95 and 0.99).
cGenes identified both by M1a–M2a and M7–M8 tests.
dTwo amino acids were detected as being under positive selection (all other had one).