| Literature DB >> 32605600 |
Emilie Lefoulon1,2, Travis Clark1, Fanni Borveto3, Marco Perriat-Sanguinet3, Catherine Moulia3, Barton E Slatko4, Laurent Gavotte3.
Abstract
BACKGROUND: Wolbachia are the most widely spread endosymbiotic bacteria, present in a wide variety of insects and two families of nematodes. As of now, however, relatively little genomic data has been available. The Wolbachia symbiont can be parasitic, as described for many arthropod systems, an obligate mutualist, as in filarial nematodes or a combination of both in some organisms. They are currently classified into 16 monophyletic lineage groups ("supergroups"). Although the nature of these symbioses remains largely unknown, expanded Wolbachia genomic data will contribute to understanding their diverse symbiotic mechanisms and evolution.Entities:
Keywords: Biotin; Genomics; Pseudoscorpion; Symbiosis; Target enrichment; Wolbachia
Mesh:
Substances:
Year: 2020 PMID: 32605600 PMCID: PMC7325362 DOI: 10.1186/s12866-020-01863-y
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Fig. 1Phylogenetic tree of pseudoscorpions based on the COI (mitochondrial cytochrome oxidase I) gene. The total length of datasets is 659 bp. The topology was inferred using Maximum Likelihood (ML) inference using IQTREE [39]. The Best-fit model calculated using ModelFinder according to BIC index was TVM + R7. Nodes are associated with Bootstrap values based on 1000 replicates, only bootstrap values superior to 70 are indicated
Information for the wApol genome sequences
| Before assembly | Illumina reads without enrichment | 76,487,892 | 8,349,854 |
| Illumina reads with enrichment | 8,714,832 | 7,422,205 | |
| CCS PacBio reads without enrichment | 13,467 | 2091 | |
| CCS PacBio reads with enrichment | 2032 | 13,070 | |
| Selection of reads mapped to the assembly | Illumina reads without enrichment | 109,309 (0.4%) | 166,537 (1.99%) |
| Illumina reads with enrichment | 1,912,222 (18%) | 1,428,179 (19.24%) | |
| CCS PacBio reads without enrichment | 110 (0.8%) | 35 (1.6%) | |
| CCS PacBio reads with enrichment | 885 (40.9%) | 3343 (25.57%) | |
| Draft genome assembly | Number of contigs | 373 | 200 |
| Size of the largest contig | 25,286 | 40,755 | |
| Total length (bp) | 1,445,964 | 1,404,177 | |
| Contigs > = 10,000 bp | 28 | 43 | |
| N50 | 5741 | 10,346 | |
| L50 | 73 | 40 | |
| GC% | 35.61 | 35.49 | |
| Number of Coding Sequences | 1746 | 2215 | |
| Number of RNAs | 39 | 45 | |
| BUSCO analysis | Complete BUSCOs % | 70.2% | 69% |
| Fragmented BUSCOs % | 7.2% | 8.2% | |
| Missing BUSCOs % | 22.6% | 22% |
Summary of de novo assembly processed in the current study using Unicycler: statistics using QUAST [41]; annotation using RAST [42] pipeline; assessment of the draft using BUSCO v3 [43]
Fig. 2Summary of ANI calculations for Wolbachia genomes. The Average Nucleotide Identity (ANI) between the wApol draft genomes and 13 complete genomes of Wolbachia evaluated using the ANI Calculator [45]
Fig. 3Organization and molecular analysis of the biotin operons of Wolbachia. a Organization of the biotin operon. The operons among the Wolbachia and four outgroups were identified and then aligned using MAFFT [46]. b Phylogeny of the biotin operon (concatenation of contig containing bioA and bioD with the contig containing bioH, bioF and bioB). The length of datasets is 4603 bp. The topology was inferred using Maximum Likelihood (ML) inference using IQ-TREE [39]. The Best-fit model calculated using ModelFinder according to AIC index was TVM + I + G4. Nodes are associated with Bootstrap values based on 1000 replicates. The Wolbachia supergroups are indicated by the color: purple for supergroup F; dark blue for supergroup A; orange for supergroup A and green for supergroup S
Fig. 4Unrooted phylogenetic trees of Wolbachia based on 2 and 6 markers by Maximum Likelihood. a Analysis based on concatenation of ftsZ and coxA; the total length of datasets is 1217 bp. The topology was inferred using Maximum Likelihood (ML) inference using IQ-TREE [39]. The Best-fit model calculated using ModelFinder according to BIC index was TIM3 + I + G4. b Analysis based on concatenation of 16S rDNA, dnaA, ftsZ, coxA, fbpA and gatB; the total length of datasets is 3481 bp. The Best-fit model calculated using ModelFinder according to BIC index was GTR + I + G4. Nodes are associated with Bootstrap values based on 1000 replicates, only bootstrap value superior to 70 are indicated. The Wolbachia supergroups (A–S) are indicated. The bolded names indicate data produced in this study and in the case of wApol, the asterisk (*) indicates sequences produced by PCR amplification
Fig. 5Phylogenomic analyses of Wolbachia. a Analysis based on concatenation of 320 single-copy orthogroups representing an 89,847 amino-acid matrix. The topology was inferred using Maximum Likelihood (ML) inference using IQTREE [39]. The Best-fit model calculated using ModelFinder according to BIC index was JTT + F + I + G4. b Analysis based on concatenation of 167 single-copy orthogroups representing 40,488 amino-acid matrix. The Best-fit model calculated using ModelFinder according to BIC index was JTT + F + I + G4. Nodes are associated with Bootstrap values based on 1000 replicates, only bootstrap value superior to 70 are indicated. The Wolbachia supergroups (A–S) are indicated