| Literature DB >> 25880916 |
Matthew C Brandley1,2, Jason G Bragg3, Sonal Singhal4,5, David G Chapple6, Charlotte K Jennings7,8, Alan R Lemmon9, Emily Moriarty Lemmon10, Michael B Thompson11, Craig Moritz12,13.
Abstract
BACKGROUND: High-throughput sequencing using targeted enrichment and transcriptomic methods enables rapid construction of phylogenomic data sets incorporating hundreds to thousands of loci. These advances have enabled access to an unprecedented amount of nucleotide sequence data, but they also pose new questions. Given that the loci targeted for enrichment are often highly conserved, how informative are they at different taxonomic scales, especially at the intraspecific/phylogeographic scale? We investigate this question using Australian scincid lizards in the Eugongylus group (Squamata: Scincidae). We sequenced 415 anchored hybrid enriched (AHE) loci for 43 individuals and mined 1650 exons (1648 loci) from transcriptomes (transcriptome mining) from 11 individuals, including multiple phylogeographic lineages within several species of Carlia, Lampropholis, and Saproscincus skinks. We assessed the phylogenetic information content of these loci at the intergeneric, interspecific, and phylogeographic scales. As a further test of the utility at the phylogeographic scale, we used the anchor hybrid enriched loci to infer lineage divergence parameters using coalescent models of isolation with migration.Entities:
Mesh:
Year: 2015 PMID: 25880916 PMCID: PMC4434831 DOI: 10.1186/s12862-015-0318-0
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1The distribution of parsimony-informative characters (PICs) for anchored hybrid enrichment (AHE) markers. a) The distribution of parsimony-informative characters (PICs) across the length of the 415 AHE loci for the full-taxon, Lampropholis, and Saproscincus data sets. Colored dots indicate how many total loci had at least one PIC at that nucleotide position. The zero position is the center of the anchor sequence for all loci. Grey dots indicate the distribution of locus length by plotting number of loci (right y-axis) with any nucleotide at that position. b) The distribution of PICs across the length of the 415 AHE loci in the full-taxon data set “normalized” by dividing the number of PICs at each nucleotide position by number of alignments of that length.
Summary statistics for the anchored-enriched (AHE) loci and transcriptome mining (TM) exons at different taxonomic levels and excluding non-informative loci
|
|
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| |
| Anchored-enriched (AHE) | |||||||||||
| All samples | 44 | 415 | 1458 | 250 | 534 | 152 | 6 | 52.1 | 99.6 | 92.1 | 96.5 |
|
| 5 | 264 | 1458 | 250 | 541 | 36 | 1 | 2.3 | 99.0 | 94.1 | 99.0 |
|
| 4 | 253 | 1458 | 250 | 540 | 36 | 1 | 2.2 | 99.9 | 94.5 | 99.4 |
|
| 13 | 414 | 1458 | 250 | 534 | 44 | 1 | 15.0 | 99.7 | 95.0 | 98.0 |
|
| 6 | 314 | 1458 | 250 | 543 | 13 | 1 | 3.1 | 99.9 | 97.8 | 99.4 |
|
| 5 | 207 | 1458 | 250 | 538 | 8 | 1 | 1.6 | 99.7 | 96.7 | 98.9 |
|
| 9 | 412 | 1458 | 250 | 535 | 45 | 1 | 6.7 | 99.9 | 90.1 | 98.4 |
|
| 5 | 221 | 1458 | 250 | 555 | 13 | 1 | 2.1 | 99.9 | 95.6 | 99.5 |
| Transcriptome mining (TM) | |||||||||||
| All taxa | 11 | 1650 | 3249 | 101 | 337 | 132 | 1 | 19.2 | 99.7 | 90.1 | 96.3 |
| TM data pruned to number of AHE loci (highest PICs) | |||||||||||
| All taxa | 11 | 415 | 3249 | 163 | 617 | 152 | 23 | 41.3 | 99.1 | 95.3 | 90.1 |
| TM data pruned to number of AHE loci (randomly sampled) | |||||||||||
| All taxa | 11 | 415 | 2480 | 103 | 337 | 120 | 1 | 19.2 | 99.6 | 90.8 | 96.3 |
| AHE data pruned to TM taxon sampling | |||||||||||
| All taxa | 11 | 415 | 1458 | 250 | 534 | 96 | 1 | 25.4 | 99.7 | 91.8 | 96.3 |
1Phylogenetic analyses were not performed on these data sets.
Figure 2Results of the (a) RAxML maximum likelihood and (b) STEAC species tree analysis of 415 anchored enriched (AHE) loci for all taxa. Numbers above or below nodes indicate bootstrap proportions from 1000 pseudoreplicates. Outgroup not shown.
Figure 3Results of the (a) RAxML and (b) STEAC analyses of all 1650 loci selected by our transcriptome mining (TM) analysis and the 415 anchored enriched (AHE) loci with taxa pruned the 10 ingroup and one outgroup taxa present in the TM data set. Numbers above the nodes in the RAxML tree indicate bootstrap proportions from 1000 pseudoreplicates. Clade support was identical between the TM and pruned AHE analyses, with the exception of the Lampropholis + Saproscincus clade that was supported by a bootstrap proportion of 100 and 96, respectively. We used PHYML maximum likelihood analyses to infer trees from 1000 bootstrap pseudoreplicates per locus. We used 1000 bootstrapped trees per locus (1,650,000 total trees) as input trees for the STEAC analysis, and numbers above or below the nodes indicate the proportion of times STEAC inferred that clade. Clade support was identical between the TM and pruned AHE analyses. Outgroup not shown.
Figure 4Correlations between different metrics for divergence, shown with one-tailed p-values. a) Correlation between nucleotide diversity at anchored enrichment (AHE) loci and transcriptome (TM) loci. b) Correlation between tau estimated by AHE loci using 3s and nucleotide diversity at AHE loci. c) Correlation between tau estimated by AHE loci using 3s and tau as previously estimated from population genomic data [31].