| Literature DB >> 27642619 |
Benjamin W Frable1, Bruno F Melo2, Brian L Sidlauskas3, Kendra Hoekzema4, Richard P Vari5, Claudio Oliveira6.
Abstract
The data presented herein support the article "Molecular phylogenetics of the Neotropical fish family Prochilodontidae (Teleostei: Characiformes)" (B.F. Melo, B.L. Sidlauskas, B.W. Frable, K. Hoekzema, R.P. Vari, C. Oliveira, 2016) [1], which inferred phylogenetic relationships of the prochilodontids from an alignment of three mitochondrial and three nuclear loci (5279 bp) for all 21 recognized prochilodontid species and 22 related species. Herein, we provide primer sequences, museum voucher information and GenBank accession numbers. Additionally, we more fully describe the maximum-likelihood and Bayesian phylogenetic analyses of the concatenated dataset, detail the Bayesian species tree analysis, and provide the maximum likelihood topologies congruent with prior morphological hypotheses that were compared with the unconstrained tree using Shimodaira-Hasegawa tests.Entities:
Keywords: *BEAST; Characiformes; Phylogenetics; Prochilodontidae; Shimodaira–Hasegawa test
Year: 2016 PMID: 27642619 PMCID: PMC5018087 DOI: 10.1016/j.dib.2016.08.015
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Information content and nucleotide frequencies of each locus.
| Locus | Bp after alignment | PCR | Primer sequence (5′–3′) | ΠA | ΠC | ΠG | ΠT | Reference |
|---|---|---|---|---|---|---|---|---|
| 16S | 510 bp | 1 PCR | 16Sa-L – ACGCCTGTTTATCAAAAACAT | 0.296 | 0.239 | 0.236 | 0.229 | |
| 16Sb-H – CCGGTCTGAACTCAGATCACGT | ||||||||
| COI | 656 bp | 1 PCR | L6252-Asn – AAGCCGGGGAAAGCCCCGGCAG | 0.242 | 0.278 | 0.187 | 0.293 | |
| H7271-COXI – TCCTATGTAGCCGAATGGTTCTTTT | ||||||||
| Cytb | 990 bp | 1 PCR | LNF – GACTTGAAAAACCAYCGTTGT | 0.269 | 0.310 | 0.146 | 0.275 | |
| H08R2–GCTTTGGGAGTTAGDGGTGGGAGTTAGAATC | ||||||||
| Myh6 | 710 bp | 1st PCR | F329 – CCGCMTGGATGATCTACACA | 0.302 | 0.229 | 0.231 | 0.239 | |
| 3R1 – ATTCTCACCACCATCCAGTTGAA | ||||||||
| 2nd PCR | A3F2 – GGAGAATCARTCKGTGCTCATCA | |||||||
| A3R2 – CTCACCACCATCCAGTTGAACAT | ||||||||
| Rag1 | 1378 bp | 1st PCR | Rag1CF1 – ACCCTCCGTACTGCTGAGAA | 0.250 | 0.239 | 0.287 | 0.224 | |
| Rag1CR1 – CGTCGGAAGAGCTTGTTGCC | ||||||||
| Rag1CF2 – TACCGCTGAGAAGGAGCTTC | ||||||||
| 2nd PCR | Rag1CR2 – TGTTGCCAGACTCATTGCCCTC | |||||||
| Rag2 | 1029 bp | 1st PCR | 164F – AGCTCAAGCTGCGYGCCAT | 0.242 | 0.259 | 0.273 | 0.225 | |
| Rag2-R6 – TGRTCCARGCAGAAGTACTTG | ||||||||
| 176F – GYGCCATCTCATTCTCCAACA | ||||||||
| 2nd PCR | Rag2Ri - AGAACAAAAGATCATTGCTGGTCGGG |
Specimens and loci used in Melo et al. [1]. For each individual, its taxonomic designation, collection catalog number of voucher, tissue specimen number, and GenBank accession numbers are given (GenBank:KX086740 through GenBank:KX087100).
| Taxon | Voucher | Specimen | 16S | Co1 | Cytb | Myh6 | Rag1 | Rag2 |
|---|---|---|---|---|---|---|---|---|
| LBP 19,326 | 76,121 | – | – | – | – | – | ||
| ANSP 192,865 | 76,122 | – | – | – | – | – | ||
| ANSP 192,865 | 6609 | – | ||||||
| LBP 251 | 4216 | – | – | |||||
| LBP 251 | 4217 | |||||||
| LBP 2496 | 16,385 | |||||||
| LBP 2496 | 16,386 | – | ||||||
| LBP 20,269 | 79,757 | – | – | – | – | |||
| LBP 20,269 | 79,758 | – | – | – | – | – | ||
| LBP 252 | 4222 | |||||||
| LBP 252 | 4223 | – | ||||||
| LBP 7211 | 33,175 | – | – | |||||
| LBP 7211 | 33,176 | |||||||
| LBP 9104 | 42,731 | – | ||||||
| LBP 9104 | 42,732 | – | – | |||||
| LBP 45 | 3611 | – | ||||||
| LBP 2348 | 16,071 | – | – | |||||
| GR-93-1 | GR207 | – | ||||||
| GR-93-1 | GR208 | – | ||||||
| LBP 2188 | 15,561 | |||||||
| LBP 2188 | 15,562 | – | ||||||
| LBP 1690 | 12,754 | – | – | |||||
| LBP 7841 | 36,858 | |||||||
| LBP 8589 | 43,397 | |||||||
| LBP 8589 | 43,398 | |||||||
| LBP 12,865 | 53,496 | |||||||
| OS 18,792 | PE10045 | |||||||
| OS 18,792 | PE10058 | – | – | |||||
| FMNH 113,534 | T54 | |||||||
| LBP 6127 | 29,513 | |||||||
| LBP 6127 | 29,514 | |||||||
| ANSP 40,692 | P4313 | |||||||
| MHNG 2705.008 | SU07108 | |||||||
| MHNG 2717.017 | SU08776 | |||||||
| USNM 403,693 | GY11461 | |||||||
| LBP 2349 | 16,011 | |||||||
| LBP 10,180 | 47,662 | |||||||
| LBP 12,776 | 41,019 | |||||||
| LBP 12,807 | 41,171 | |||||||
| LBP 1692 | 12,761 | – | ||||||
| LBP 1692 | 12,762 | – | ||||||
| OS 18,380 | PE10001 | |||||||
| ANSP 180,205 | T43 | |||||||
| LBP 1384 | 12,734 | – | ||||||
| LBP 3041 | 19,139 | – | ||||||
| LBP 3041 | 19,140 | – | – | |||||
| ANSP 187,277 | P4298 | |||||||
| LBP 1383 | 12,727 | – | ||||||
| LBP 1383 | 12,728 | |||||||
| FMNH 113,712 | 2004BSAQ01 | |||||||
| LBP 1691 | 12,757 | – | ||||||
| LBP 1691 | 12,758 | – | ||||||
| LBP 1691 | 12,759 | – | ||||||
| MHNG uncatalogued | 15,729 | |||||||
| ANSP 187,435 | 6929 | |||||||
| AUM 54,212 | T09912 | – | ||||||
| AUM 43,700 | V5274 | |||||||
| ANSP 189,264 | 7015 | |||||||
| LBP 3180 | 16,871 | |||||||
| AUM 53,775 | T08985 | |||||||
| AUM 53,654 | T09707 | |||||||
| AUM 51,355 | T10201 | |||||||
| ANSP 180,516 | T48 | |||||||
| ANSP 178,188 | 1697 | |||||||
| USNM 402,471 | GY11-1-03 | |||||||
| OS 18,313 | PE10113 | |||||||
| LBP 8343 | 40,130 | |||||||
| LBP 4747 | 25,521 | – | – | |||||
| OS 18,724 | PE10110 | – | ||||||
| OS18345 | PE10076 | |||||||
| LBP 4591 | 24,665 | – | ||||||
| LBP 1135 | 5635 | – | ||||||
| LBP 5173 | 26,648 | – | ||||||
| LBP 7556 | 35,624 | – | – | |||||
| LBP 6282 | 29,688 | – | ||||||
| OS 18,990 | PE10044 | |||||||
| OS 18,361 | PE10072 |
Position of each gene and codon within the alignment, with their partitions and best models of nucleotide evolution as determined by PartitionFinder.
| Gene | Position | Partition | Best BIC model for MrBayes |
|---|---|---|---|
| 16S | 1–510 | 1 | SYM+I+G |
| COI 1st position | 511–1167/3 | 2 | GTR+G |
| COI 2nd position | 512–1167/3 | 1 | SYM+I+G |
| COI 3rd position | 513–1167/3 | 3 | HKY+I+G |
| Cytb 1st position | 1169–2158/3 | 4 | GTR+G |
| Cytb 2nd position | 1170–2158/3 | 1 | SYM+I+G |
| Cytb 3rd position | 1168–2158/3 | 3 | HKY+I+G |
| Myh6 1st position | 2160–2869/3 | 6 | HKY+I+G |
| Myh6 2nd position | 2161–2869/3 | 6 | HKY+I+G |
| Myh6 3rd position | 2159–2869/3 | 5 | SYM+G |
| Rag1 1st position | 2871–4248/3 | 6 | HKY+I+G |
| Rag1 2nd position | 2872–4248/3 | 6 | HKY+I+G |
| Rag1 3rd position | 2870–4248/3 | 5 | SYM+G |
| Rag2 1st position | 4249–5278/3 | 6 | HKY+I+G |
| Rag2 2nd position | 4250–5278/3 | 6 | HKY+I+G |
| Rag2 3rd position | 4251–5278/3 | 5 | SYM+G |
Fig. 1Phylogenetic relationships of Prochilodontidae based on maximum likelihood analysis of the concatenated dataset. Numbers near nodes represent bootstrap support. Colored symbols correspond to those in Fig. 3, Fig. 4 of Melo et al. [1]. (F1_RAxML_bestTree.unconstrained_result.nwk).
Fig. 5Maximum likelihood topology with intrageneric relationships within Prochilodus constrained to those hypothesized by Castro and Vari [2]. (F5_constraint3_Prochilodus_constrained_RAxML_bestTree.result.nwk).
Partitioning schemes and substitution models for *BEAST identified using the Bayesian Information Criterion in PartitionFinder.
| Gene | Position | Partition | Best BIC model for *BEAST |
|---|---|---|---|
| 16S | 1–510 | 1 | SYM+I+G |
| COI | 511–1167 | 2 | GTR+I+G |
| Cytb | 1169–2158 | 2 | GTR+I+G |
| Myh6 | 2159–2869 | 3 | TrNef+I+G |
| Rag1 | 2870–4248 | 3 | TrNef+I+G |
| Rag2 | 4249–5278 | 3 | TrNef+I+G |
Prior parameter settings for major priors applied in *BEAST. Prior names as in *BEAST/Beauti and are described in BEAST documentation [18].
| Prior | Distribution | Initial | Mean/Shape | Scale | Standard deviation | Offset | Upper | Lower |
|---|---|---|---|---|---|---|---|---|
| Species.popMean | Gamma | 1 | 1.6 | 0.5 | – | 0 | – | – |
| BirthDeath.meanGrowthRate | Uniform | 0.8 | – | – | – | – | 10,000 | 0 |
| BirthDeath.relativeDeathRate | Uniform | 0.5 | – | – | – | – | 1 | 0 |
| 16S.ucld.mean | – | – | – | – | – | – | – | – |
| 16S.ucld.stdev | Lognormal | 0.333 | 0.5 | – | – | 0 | – | – |
| COXI.ucld.mean | Lognormal | 0.003 | 0.003 | – | 1 | 0 | – | – |
| COXI.ucld.stdev | Lognormal | 0.333 | 0.5 | – | – | 0 | – | – |
| CYTB.ucld.mean | Lognormal | 0.003 | 0.003 | – | 1 | 0 | – | – |
| CYTB.ucld.stdev | Lognormal | 0.333 | 0.5 | – | – | 0 | – | – |
| MYH6.ucld.mean | Lognormal | 0.0005 | 0.0005 | – | 1 | 0 | – | – |
| MYH6.ucld.stdev | Lognormal | 0.333 | 0.5 | – | – | 0 | – | – |
| RAG1.ucld.mean | Lognormal | 0.0005 | 0.0005 | – | 1 | 0 | – | – |
| RAG1.ucld.stdev | Lognormal | 0.333 | 0.5 | – | – | 0 | – | – |
| RAG2.ucld.mean | Lognormal | 0.0005 | 0.0005 | – | 1 | 0 | – | – |
| RAG2.ucld.stdev | Lognormal | 0.333 | 0.5 | – | – | 0 | – | – |
Fig. 2Maximum likelihood topology with Ichthyoelephas constrained to be sister to Semaprochilodus. (F2_constraint4_Ichthyoelephas_constrained_RAxML_bestTree.result.nwk).
Fig. 3Maximum likelihood topology with Semaprochilodus taeniurus constrained to be sister to a clade containing S. kneri and S. insignis. (F3_constraint1_Semaprochilodus_taeniurus_constrained_RAxML_bestTree.result.nwk).
Fig. 4Maximum likelihood topology with Semaprochilodus taeniurus constrained to be sister to a clade containing S. kneri and S. insignis, and S. insignis constrained to monophyly. (F4_constraint2_Semaprochilodus_constrained_RAxML_bestTree.result.nwk).
| Subject area | Biology, Genetics and Genomics |
| More specific subject area | Phylogenetics and Phylogenomics |
| Type of data | Tables, figures, primers, sequence alignment, museum voucher information, phylogenetic trees |
| How data was acquired | DNA extraction from tissue samples, gene amplification, Sanger sequencing |
| Data format | Raw, filtered, analyzed |
| Experimental factors | DNA extraction from muscle or fin tissue using Quiagen DNeasy kit or modified NaCl protocol |
| Experimental features | Sequences concatenated and aligned in Geneious (v.7.1.7), phylogenies generated using unconstrained and constrained maximum-likelihood (RAxML), concatenated Bayesian (MrBayes), and Bayesian species tree (*BEAST) methods. |
| Data source location | South America |
| Data accessibility | Data provided with this article and in the GenBank public repository, GenBank: |
| COI: | |
| Cytb: | |
| Myh6: | |
| Rag1: | |
| Rag2: |