| Literature DB >> 26459325 |
Sara Ruane1, Christopher J Raxworthy2, Alan R Lemmon3, Emily Moriarty Lemmon4, Frank T Burbrink5,6.
Abstract
BACKGROUND: Using molecular data generated by high throughput next generation sequencing (NGS) platforms to infer phylogeny is becoming common as costs go down and the ability to capture loci from across the genome goes up. While there is a general consensus that greater numbers of independent loci should result in more robust phylogenetic estimates, few studies have compared phylogenies resulting from smaller datasets for commonly used genetic markers with the large datasets captured using NGS. Here, we determine how a 5-locus Sanger dataset compares with a 377-locus anchored genomics dataset for understanding the evolutionary history of the pseudoxyrhophiine snake radiation centered in Madagascar. The Pseudoxyrhophiinae comprise ~86 % of Madagascar's serpent diversity, yet they are poorly known with respect to ecology, behavior, and systematics. Using the 377-locus NGS dataset and the summary statistics species-tree methods STAR and MP-EST, we estimated a well-supported species tree that provides new insights concerning intergeneric relationships for the pseudoxyrhophiines. We also compared how these and other methods performed with respect to estimating tree topology using datasets with varying numbers of loci.Entities:
Mesh:
Year: 2015 PMID: 26459325 PMCID: PMC4603904 DOI: 10.1186/s12862-015-0503-1
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Species tree for the pseudoxyrhophiines using STAR and MP-EST (topologies identical) based on the 377-locus NGS data set. Bootstrap values are 100 for all nodes except those labeled with the STAR value first, followed by the MP-EST value. Photos CJR & SR
Robinson-Foulds (RF) distances as % difference between trees using the Sanger (SS) and NGS datasets
| Sanger Trees | SS MP-EST 5 Loci | SS STAR 5 Loci | SS *BEAST 5 Loci | 377 MP-EST/STAR/Concatenated |
|---|---|---|---|---|
| 5 Loci MP-EST | ------ | RF = 10 % | RF = 14 % | RF = 19 % |
| 5 Loci STAR | RF = 10 % | ------ | RF = 14 % | RF = 19 % |
| 5 Loci *BEAST | RF = 14 % | RF = 14 % | ------ | RF = 10 % |
| 5 Loci Concatenated | RF = 19 % | RF = 19 % | RF = 14 % | RF = 19 % |
| 4 Loci (no | RF = 10 % | RF = 5 % | RF = 19 % | RF = 24 % |
| 4 Loci (no | RF = 10 % | RF = 10 % | RF = 24 % | RF = 33 % |
| 4 Loci (no | RF = 14 % | RF = 19 % | RF = 29 % | RF = 29 % |
| 4 Loci (no | RF = 10 % | RF = 0 % | RF = 14 % | RF = 19 % |
| 3 Loci (nucDNA) MP-EST | RF = 19 % | RF = 14 % | RF = 29 % | RF = 33 % |
| 3 Loci (nucDNA) STAR | RF = 19 % | RF = 14 % | RF = 29 % | RF = 33 % |
Note that the MP-EST, STAR, and concatenated trees for the 377 loci NGS datasets were identical and so are in one column
Fig. 2NGS dataset mean Robinson-Foulds (RF) % distances for each species tree (5 replicates with random loci subsets) with respect to the number of NGS loci used (3, 4, 5, 10, 25, 50, 100, 200) compared to the 377-locus species tree and the mean bootstrap values for each species tree (5 replicates with random loci subsets) with respect to the number of loci used (3, 4, 5, 10, 25, 50, 100, 200, 377)
Fig. 3Metatree visualization showing similarity of the species tree topology (as shown in Fig. 1) with that of the 5 Sanger loci gene trees (left) and 50 NGS loci gene trees (right). The species tree is labeled for each. Sanger loci gene trees are indicated by locus name, NGS loci gene trees are indicated by number. In both data sets no gene trees were identical to the species tree