| Literature DB >> 27547523 |
Julie M Allen1, J Gordon Burleigh2, Jessica E Light3, David L Reed4.
Abstract
Phylogenetic trees can reveal the origins of endosymbiotic lineages of bacteria and detect patterns of co-evolution with their hosts. Although taxon sampling can greatly affect phylogenetic and co-evolutionary inference, most hypotheses of endosymbiont relationships are based on few available bacterial sequences. Here we examined how different sampling strategies of Gammaproteobacteria sequences affect estimates of the number of endosymbiont lineages in parasitic sucking lice (Insecta: Phthirapatera: Anoplura). We estimated the number of louse endosymbiont lineages using both newly obtained and previously sequenced 16S rDNA bacterial sequences and more than 42,000 16S rDNA sequences from other Gammaproteobacteria. We also performed parametric and nonparametric bootstrapping experiments to examine the effects of phylogenetic error and uncertainty on these estimates. Sampling of 16S rDNA sequences affects the estimates of endosymbiont diversity in sucking lice until we reach a threshold of genetic diversity, the size of which depends on the sampling strategy. Sampling by maximizing the diversity of 16S rDNA sequences is more efficient than randomly sampling available 16S rDNA sequences. Although simulation results validate estimates of multiple endosymbiont lineages in sucking lice, the bootstrap results suggest that the precise number of endosymbiont origins is still uncertain.Entities:
Keywords: 16S rDNA; Anoplura; Endosymbiont; Endosymbiosis; Gammaproteobacteria; Phylogenetics; Sucking lice
Year: 2016 PMID: 27547523 PMCID: PMC4958000 DOI: 10.7717/peerj.2187
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Table of Anoplura endosymbiont sequences.
Family and species of sucking lice (Phthiraptera: Anoplura) from which endosymbionts were targeted. Also indicated are the collection locality, louse taxon label (for use in the laboratory), mammalian host, presence of putative endosymbiont (where the superscript “b” indicates that Bartonella, a louse pathogen, was sequenced), percent AT content, if the top hit from a BLAST search was an endosymbiont, and finally, if the top hit from the BLAST search was an endosymbiont from a sucking louse.
| Louse family and species Country and State | Taxon label | Host (Order: Family) Museum Voucher (if known) | Endosymbiont present | %AT | BLAST endosymbiont | BLAST Anoplura |
|---|---|---|---|---|---|---|
| Echin3.17.09.2 | Yes | 45% | Yes | No | ||
| Hpsu7.14.09.4 | Yes | 52% | Yes | Yes | ||
| Axcro4.26.09.1 | Yes | 50% | Yes | No | ||
| Axsp7.14.09.5 | Yes (2) | 49%, 45% | Yes, Yes | No, No | ||
| Hofer7.14.09.8 | No | – | ||||
| Hosp4.17.09.7 | No | – | ||||
| Hoony8.27.08.6 | No | – | ||||
| Hosp7.14.09.6 | No | – | ||||
| Hosic7.14.09.9 | No | – | ||||
| Linog6.22.09.1 | Yes | 52% | Yes | No | ||
| Qnpic3.31.08.1 | Yes | 54% | Yes | Yes | ||
| Qnpic6.30.09.2 | Yes | 53% | Yes | Yes | ||
| Qnsp3.31.08.3 | Yes | 54% | Yes | Yes | ||
| Pdcap9.20.05.2NW | Yes | 51% | Yes | Yes | ||
| Pdhum5.19.05.2 | Yes | 51% | Yes | Yes | ||
| Fzehr8.20.08.1 | Yes | 52% | Yes | No | ||
| Fzehr6.30.09.4 | Yes | 51% | Yes | No | ||
| Lnlae6.30.09.3 | Yes | 54% | Yes | No | ||
| Lesp4.26.09.2 | Yes | 53% | Yes | No | ||
| Nescp6.30.09.5 | Yes | 53% | Yes | No | ||
| Neneo8.20.08.2 | No | – | No | |||
| Sathrax4.26.09.3 | Yes | 45% | Yes | No | ||
| Ptgor9.14.08.1 | Yes | 53% | Yes | Yes |
Notes.
USA, United States (AK, Alaska; AZ, Arizona; CA, California; CO, Colorado; FL, Florida; MD, Maryland; OR, Oregon; TX, Texas); MX, Mexico.
MLZ, Moore Laboratory of Zoology; LSUMNZ, Louisiana State University Museum of Natural Science; NMMNH, New Mexico Museum of Natural History.
Figure 1Subset of large phylogenetic tree showing placement and close relatives of endosymbiotic bacteria in Anoplura.
A subtree of the full 42,266 Gammaproteobacteria tree showing 9 of the 10 endosymbiont lineages from sucking lice (red). For all louse endosymbionts, the louse host genus or group is indicted. All of these sequences cluster together either within or near other known endosymbiont lineages (green) and Arsenophonus, a clade of insect bacterial endosymbionts; the arrow points to the Most Recent Common Ancestor (MRCA) of this clade. The 10th lineage of endosymbiont clusters with the genus Legionella, which is not shown due to space constraints.
Figure 2Box Plots showing number of endoysmbiont lineages in differently sampled datasets.
The number of sucking louse endosymbiont lineages inferred from phylogenetic trees with different sampling. The number of taxa in each alignment is plotted on a log10 scale. Boxplots represent the number of endosymbionts calculated from either the 200 bootstrap replicates for the phylogenetically sampled data sets (in black), across the 100 randomly sampled data sets (red) or the simulated data sets (blue). Boxes represent 50% of the data; whiskers extend to 1.5 times the interquartile range representing 95% of the data, and * shows the average. “X” corresponds to the number of lineages calculated from the ML tree for each data set.
Figure 3Box plots with number of lineages for reduced phylogenetic trees.
The number of sucking louse endosymbiont lineages found for reduced phylogenetic trees. Boxplots represent the number of endosymbiont lineages calculated from 200 bootstrap replicates for the data sets. The 200 bootstrap trees for each data set were then pruned to the taxa found in the smaller data sets and the number of endosymbiont lineages counted. The original data sets are plotted in black. The reduced full data sets are in green, reduced 95% data sets are in red, reduced 90% data sets in blue, and reduced 85% data sets in brown. Boxes represent 50% of the data and whiskers extend to 1.5 times the interquartile range, representing 95% of the data.