| Literature DB >> 30991952 |
Kishor Dhaygude1, Abhilash Nair2, Helena Johansson2, Yannick Wurm3, Liselotte Sundström2,4.
Abstract
BACKGROUND: Adapting to changes in the environment is the foundation of species survival, and is usually thought to be a gradual process. However, transposable elements (TEs), epigenetic modifications, and/or genetic material acquired from other organisms by means of horizontal gene transfer (HGTs), can also lead to novel adaptive traits. Social insects form dense societies, which attract and maintain extra- and intracellular accessory inhabitants, which may facilitate gene transfer between species. The wood ant Formica exsecta (Formicidae; Hymenoptera), is a common ant species throughout the Palearctic region. The species is a well-established model for studies of ecological characteristics and evolutionary conflict.Entities:
Keywords: Endosymbionts; Formica exsecta; Genome; Horizontal gene transfer; Transposons; Wolbachia
Mesh:
Year: 2019 PMID: 30991952 PMCID: PMC6469114 DOI: 10.1186/s12864-019-5665-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1De novo genome assembly of F. exsecta genome, summarized by the following metrics: a) Overall assembly length, b) Number of scaffolds/contigs, c) Length of the longest scaffold/contig, d) Scaffold/contig N50 and N90, e) Percentage GCs and percentage Ns, f) BUSCO completeness, g) Scaffold/contig length/count distribution
Genome assembly statistics for F. exsecta and its Wolbachia endosymbiont
| Genome Assembly Stats | FE | |
|---|---|---|
| Total length | 277,719,392 (277 MB) | 3,096,460 (3.09 MB) |
| Total contigs | 14,617 | 69 |
| Contigs (> = 1000 bp) | 3136 (98.24% genome) | 68 (99.97% genome) |
| Contigs (> = 50,000 bp) | 545 (89.59% genome) | 22 (75.48% genome) |
| N50: | 997,654 bp | 104,167 bp |
| N75: | 318,356 bp | 54,296 bp |
| L50: | 73 | 11 |
| L75: | 185 | 22 |
| GC (%) | 36.00 | 35.13 |
BUSCO quality metrics for the genome assemblies of F. exsecta and the Wolbachia endosymbiont of F. exsecta (wFex)
| BUSCO metric | Eukaryota | Insecta | Arthropoda | Hymenoptera | Bacteria | Proteobacteria |
|---|---|---|---|---|---|---|
| Complete | 299 (98.7%) | 1634 (98.5%) | 2549 (95.29%) | 4249 (96.2%) | 107 (72.30%) | 158 (71.49%) |
| Complete and single copy | 283 (93.4%) | 1572 (94.8%) | 2446 (91.44%) | 4151 (94.0%) | 35 (23.65%) | 55 (24.88) |
| Complete and duplicated | 16 (5.3%) | 62 (3.7%) | 103 (3.86%) | 98 (2.2%) | 72 (48.65%) | 103 (46.60%) |
| Fragmented | 1 (0.3%) | 15 (0.9%) | 195 (7.29%) | 123 (2.8%) | 9 (6.08%) | 11 (4.97%) |
| Missing | 3 (1.0%) | 9 (0.6%) | 34 (1.27%) | 43 (1.0%) | 32 (21.62%) | 52 (23.52%) |
| Total | 303 (100%) | 1658 (100%) | 2675 (100%) | 4415 (100%) | 148 (100%) | 221 (100%) |
Fig. 2Taxonomic distribution of the best BLASTp hits of F. exsecta proteins to the non-redundant (nr) protein database (E < 10− 5)
Fig. 3Venn diagram showing the distribution of gene families (orthologous clusters) among five ant species including three closely related members of the subfamily Formicinae (F. exsecta, Camponotus floridanus, Lasius niger), and two distinctly related ants (Solenopsis invicta and Cerapachys biroi)
Fig. 4Phylogeny of the Wolbachia supergroups a, b, and c strains with the newly assembled wFex genome. The phylogenetic reconstructions are based on individual analyses of 12 single copy core genes of 25 Wolbachia strains. The support values on the branch labels indicate Bayesian posterior probabilities. The letters a-c indicate the separate supergroups
Fig. 5Venn diagram displaying the overlap in orthologous genes among four Wolbachia species including the newly assembled wFex strain and the wDac, wRi, wMel strains reported previously
HGT inserts from Wolbachia present in the genome of F. exsecta with details of its length and position in the F. exsecta genome. The presence of similar insert regions in other eukaryote genomes is also shown
| HGT region in | Length HGT (bp) | Transposon region near HGT | Transposon Name | Observed in other species | Other Host Species name with position of similar insertion | |
|---|---|---|---|---|---|---|
| Transposase | scaffold83: 2271642–2,272,117 | 475 | scaffold83: 2271642–2,272,117 | transposase | Complete |
|
| ABC transporter ATP-binding protein, porphobilinogen deaminase, D-alanine--D-alanine ligase, DNA processing protein DprA, triose-phosphate isomerase | scaffold233: 1712452–1,725,498 | 13,046 | scaffold233: 1714122–1,714,241 | transposase | Partial (few gene region) |
|
| DNA repair protein RadC, transposase, DNA ligase, ABC transporter permease, ATP-dependent protease La | scaffold574: 102007–116,197 | 14,190 | scaffold574: 105963–106,483 | transposase | Partial (few gene region) | |
| probable carboxypeptidase, type IV secretion system,conjugal transfer protein TrbL, lysyl-tRNA synthetase, UDP-N-acetylmuramoylalanine-D-glutamate ligase | scaffold707: 1–38,814 | 38,813 | scaffold707: 35826–36,154 | Mariner Mos1 transposase (Ant origin) | Partial (few gene region) | |
| DNA methylase, Ankyrin repeat domain protein, regulatory protein RepA, site-specific recombinase, cytochrome b-like | scaffold741: 1–47,265 | 47,264 | scaffold741: 54020–54,482, scaffold741: 52587–52,910 | IS110 family transposase, Integrase | Partial (few gene region) |
Summary statistics for the raw sequencing data, before and after filtering reads. “Coverage depth” was calculated based on the estimated assembled genome size (300 Mb)
| Insert Size | Pair reads Length (bp) | Raw | After Filter | ||
|---|---|---|---|---|---|
| Total Data (G) | Sequence coverage (X) | Total Data (G) | Sequence coverage (X) | ||
| 170 bp | 100 bp | 22.68 | 45.36 | 20.96 | 41.93 |
| 500 bp | 100 bp | 8.54 | 17.08 | 7.34 | 14.69 |
| 800 bp | 100 bp | 8.84 | 17.69 | 5.14 | 10.29 |
| 2 kb | 100 bp | 13.23 | 26.46 | 7.05 | 14.10 |
| 5 kb | 100 bp | 14.51 | 29.02 | 4.74 | 9.49 |
| 10 kb | 100 bp | 11.77 | 23.53 | 5.51 | 11.02 |
| 20 kb | 100 bp | 20.40 | 40.81 | 2.91 | 5.81 |
| Total | – | 99.97 | 199.95 | 53.66 | 107.32 |