| Literature DB >> 28074180 |
Ryo Nakao1, Takashi Abe2, Shunsuke Funayama2, Chihiro Sugimoto3.
Abstract
Tsetse flies (Glossina spp.) are the primary vectors of trypanosomes, which can cause human and animal African trypanosomiasis in Sub-Saharan African countries. The objective of this study was to explore the genome of Glossina morsitans morsitans for evidence of horizontal gene transfer (HGT) from microorganisms. We employed an alignment-free clustering method, that is, batch learning self-organising map (BLSOM), in which sequence fragments are clustered based on the similarity of oligonucleotide frequencies independently of sequence homology. After an initial scan of HGT events using BLSOM, we identified 3.8% of the tsetse fly genome as HGT candidates. The predicted donors of these HGT candidates included known symbionts, such as Wolbachia, as well as bacteria that have not previously been associated with the tsetse fly. We detected HGT candidates from diverse bacteria such as Bacillus and Flavobacteria, suggesting a past association between these taxa. Functional annotation revealed that the HGT candidates encoded loci in various functional pathways, such as metabolic and antibiotic biosynthesis pathways. These findings provide a basis for understanding the coevolutionary history of the tsetse fly and its microbes and establish the effectiveness of BLSOM for the detection of HGT events.Entities:
Mesh:
Year: 2016 PMID: 28074180 PMCID: PMC5198169 DOI: 10.1155/2016/3164624
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Workflow for data processing and BLSOM analysis.
Figure 2Tsetse+Prokaryotes-BLSOM. (a) BLSOM using the degenerate tetranucleotide set for the tsetse fly plus 5,600 identified prokaryotes. Lattice points that include the sequences from the tsetse fly are indicated in green, those that contain no genomic sequences are indicated in white, and those containing sequences from a prokaryote are indicated in blue. Lattice points that include both tsetse fly- and prokaryote-sequences are shown in black. (b) Distribution of tsetse fly genome sequences. Only green lattice points are shown.
Origins of HGT candidates at the phylum level.
| Phylum | Number of contigs |
|---|---|
| Actinobacteria | 10 |
| Alphaproteobacteria | 90 |
| Aquificae | 2 |
| Bacteroidetes | 370 |
| Betaproteobacteria | 4 |
| Crenarchaeota | 1 |
| Cyanobacteria | 1 |
| Epsilonproteobacteria | 16 |
| Euryarchaeota | 12 |
| Firmicutes | 758 |
| Fusobacteria | 1 |
| Gammaproteobacteria | 23 |
| Spirochetes | 1 |
| Unassigned | 1,671 |
|
| |
| Total | 2,960 |
Prediction was obtained using Tsetse+Prokaryotes-BLSOM.
Figure 3Genus-BLSOM. (a) Alphaproteobacteria. (b) Bacteroidetes. (c) Epsilonproteobacteria. (d) Euryarchaeota. (e) Firmicutes. (f) Gammaproteobacteria. Lattice points that include sequences from more than one genus are indicated in black, those including no sequences are indicated in white, and those including sequences from a single genus are indicated in individual color.
Origins of HGT candidates at the genus level.
| Phylum | Genus1 | Number of contigs |
|---|---|---|
| Alphaproteobacteria |
| 4 |
|
| 8 | |
|
| 1 | |
|
| 8 | |
|
| 5 | |
|
| 56 | |
| Unassigned | 8 | |
|
| ||
| Bacteroidetes |
| 4 |
|
| 1 | |
|
| 1 | |
| Flavobacteria2 | 187 | |
|
| 12 | |
|
| 18 | |
|
| 3 | |
|
| 1 | |
|
| 9 | |
|
| 1 | |
|
| 37 | |
|
| 10 | |
|
| 4 | |
|
| 4 | |
| Unassigned | 78 | |
|
| ||
| Firmicutes |
| 4 |
|
| 239 | |
|
| 83 | |
|
| 4 | |
|
| 1 | |
|
| 3 | |
|
| 4 | |
|
| 1 | |
|
| 17 | |
|
| 31 | |
|
| 7 | |
|
| 1 | |
|
| 1 | |
|
| 1 | |
|
| 134 | |
|
| 9 | |
|
| 3 | |
|
| 2 | |
|
| 3 | |
| Unassigned | 210 | |
|
| ||
| Gammaproteobacteria |
| 4 |
|
| 6 | |
|
| 1 | |
|
| 1 | |
|
| 1 | |
|
| 1 | |
|
| 1 | |
| Unassigned | 8 | |
1Prediction was obtained using Genus-BLSOM. 2Classification to the class level was obtained.