| Literature DB >> 29201984 |
Felix Vaux1, Simon F K Hills1, Bruce A Marshall2, Steve A Trewick1, Mary Morgan-Richards1.
Abstract
This data article provides genome statistics, phylogenetic networks and trees for a phylogenetic study of Southern Hemisphere Buccinulidae marine snails [1]. We present alternative phylogenetic reconstructions using mitochondrial genomic and 45S nuclear ribosomal cassette DNA sequence data, as well as trees based on short-length DNA sequence data. We also investigate the proportion of variable sites per sequence length for a set of mitochondrial and nuclear ribosomal genes, in order to examine the phylogenetic information provided by different DNA markers. Sequence alignment files used for phylogenetic reconstructions in the main text and this article are provided here.Entities:
Year: 2017 PMID: 29201984 PMCID: PMC5702863 DOI: 10.1016/j.dib.2017.11.021
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Maximum-likelihood mtDNA phylogeny of buccinid and buccinulid whelks. A maximum-likelihood derived phylogeny generated using RAxML 8.2.8 [9], based an alignment of 31 concatenated mitochondrial genome sequences (11,128 bp incorporating protein-encoding, tRNA and rRNA genes). No partitions were used. No outgroup or monophyly was enforced for this tree. Genera putatively belonging to Buccinulidae are shown in different colours.
Fig. 9Splits network for illustrating alternative phylogenetic signal in 45S rDNA sequence data for marine snails. The splits network of based on a 4667 bp alignment of 31 concatenated nuclear rDNA gene sequences (18S, 5.8S, 28S rRNA genes). Splits were generated using the Neighbor-Net algorithm in SplitsTree 4 [18]. The splits network presents a generalisation of all of possible topological solutions for the phylogenetic signal contained in the underlying sequence data, but it does not quantify the likelihood of alternative phylogenetic relationships. Edge length is proportional to split weight, and box structures within the network indicate signal for alternative topologies in the underlying sequence data.
A summary of statistics for the length and nucleotide composition for the concatenated DNA sequences for the nuclear ribosomal RNA genes 18S, 5.8S and 28S (the internal transcribed spacer regions are not included). All listed specimens were newly sequenced for this study.
| SFKH-TMP005 | 5337 | 23 | 24.5 | 30.0 | 22.2 | 54.5 | |
| SFKH-TMP004 | 5339 | 23 | 24.5 | 30.0 | 22.2 | 54.5 | |
| SFKH-TMP006 | 5339 | 23 | 24.7 | 30.1 | 22.0 | 54.8 | |
| SFKH-TMP014 | 5338 | 24 | 24.4 | 30.0 | 22.2 | 54.4 | |
| 20140782 | 5334 | 24 | 24.5 | 30.0 | 22.2 | 54.5 | |
| 20140781 | 5338 | 24 | 24.4 | 30.0 | 22.3 | 54.4 | |
| 20140783 | 5339 | 24 | 24.3 | 30.0 | 22.3 | 54.3 | |
| SFKH-TMP009 | 5339 | 23 | 24.6 | 30.0 | 22.0 | 54.6 | |
| SFKH-TMP010 | 5339 | 21 | 24.9 | 30.3 | 21.7 | 55.2 | |
| M.302907/2 | 5340 | 22 | 24.8 | 30.1 | 22.0 | 54.9 | |
| SFKH-TMP016 | 5340 | 22 | 24.8 | 30.1 | 22.0 | 54.9 | |
| SFKH-TMP011 | 5340 | 22 | 24.7 | 30.1 | 22.0 | 54.8 | |
| M.258277/6 | 5340 | 22 | 24.7 | 30.2 | 21.9 | 54.9 | |
| M.302870/2 | 5340 | 22 | 24.7 | 30.1 | 22.0 | 54.8 | |
| M.314755/1 | 5340 | 22 | 24.8 | 30.1 | 21.9 | 54.9 | |
| SFKH-TMP004 | 5340 | 22 | 24.7 | 30.1 | 22.0 | 54.8 | |
| M.274111 | 5340 | 22 | 24.6 | 30.1 | 22.1 | 54.7 | |
| SFKH-TMP015 | 5340 | 22 | 24.7 | 30.1 | 22.0 | 54.8 | |
| M.279437 | 5340 | 22 | 24.7 | 30.2 | 21.9 | 54.9 | |
| M.190119 | 5340 | 22 | 24.6 | 30.1 | 22.1 | 54.7 | |
| M.183832 | 5337 | 23 | 24.4 | 30.0 | 22.3 | 54.4 | |
| KK12 | 5337 | 24 | 24.4 | 29.9 | 22.3 | 54.3 | |
| KL2 | 5337 | 24 | 24.3 | 29.9 | 22.4 | 54.2 | |
| C.456980 | 5339 | 24 | 24.4 | 29.9 | 22.2 | 54.3 | |
| C.487648 | 5339 | 25 | 24.4 | 29.9 | 22.2 | 54.3 | |
| Phoenix9 | 5339 | 23 | 24.4 | 30.0 | 22.3 | 54.4 | |
| Phoenix1 | 5339 | 23 | 24.4 | 30.0 | 22.3 | 54.4 | |
| M.190085/3 | 5339 | 23 | 24.4 | 29.9 | 22.3 | 54.3 | |
| M.190082/2 | 5339 | 23 | 24.4 | 29.9 | 22.3 | 54.3 | |
| M.183792 | 5339 | 23 | 24.3 | 30.0 | 22.3 | 54.3 | |
| M.183927 | 5339 | 23 | 24.3 | 30.0 | 22.4 | 54.3 | |
A summary of the statistics for the length and nucleotide composition for the mitochondrial genomes newly sequenced as part of this study. Specimens marked with one asterisk (*) exhibit drops in read coverage for some small regions, for example K. kelletii has 54 bp missing from cox1. Specimens marked with two asterisks (**) have genomes with large gaps in genome coverage for some regions, such as B. v. vittatum that has 266, 151 and 64 bp missing from the ATP6, cox1 and ND2 genes respectively.
| SFKH-TMP005 | 15204 | 31.5 | 14 | 15.0 | 40.1 | 28.4 | ||
| SFKH-TMP004 | 15235 | 31.5 | 13 | 14.6 | 40.7 | 27.9 | ||
| SFKH-TMP006 | 15189 | 29.3 | 15.7 | 17.0 | 38.0 | 32.7 | ||
| SFKH-TMP014 | 15195 | 31.1 | 14.5 | 15.3 | 39.1 | 29.8 | ||
| 20140782 | 15158 | 30.5 | 14.9 | 15.8 | 38.7 | 30.7 | ||
| 20140781 | 15232 | 29.3 | 15.7 | 16.5 | 38.4 | 32.2 | ||
| 20140783 | 15231 | 29.5 | 15.6 | 16.3 | 38.7 | 31.9 | ||
| SFKH-TMP009 | 15251 | 30.4 | 15.7 | 16.0 | 38.0 | 31.7 | ||
| SFKH-TMP010 | 15263 | 29.6 | 15.9 | 16.7 | 37.8 | 32.6 | ||
| M.302907/2 | 15246 | 30.2 | 14.8 | 15.8 | 39.1 | 30.6 | ||
| M.258277/6 | 15247 | 30.9 | 14.1 | 15.2 | 39.7 | 29.3 | ||
| M.302870/2 | 15247 | 30.1 | 14.8 | 15.9 | 39.1 | 30.7 | ||
| M.314755/1 | 15244 | 29.6 | 15.2 | 16.1 | 39.0 | 31.3** | ||
| SFKH-TMP012 | 15244 | 29.6 | 15.1 | 16.4 | 38.9 | 31.5** | ||
| M.274111 | 15254 | 30.4 | 14.7 | 15.7 | 39.2 | 30.4 | ||
| SFKH-TMP015 | 15254 | 30.3 | 14.6 | 15.8 | 39.3 | 30.4 | ||
| M.279437 | 15249 | 30.3 | 14.7 | 15.5 | 39.5 | 30.2* | ||
| M.190119 | 15264 | 30.0 | 14.9 | 16.0 | 39.1 | 30.9 | ||
| SFKH-TMP013 | 15258 | 29.3 | 15.8 | 16.7 | 38.2 | 32.5 | ||
| M.183832 | 15229 | 29.5 | 16.2 | 17.0 | 37.3 | 32 | ||
| KK12 | 15104 | 29.3 | 16.0 | 17.1 | 37.6 | 31* | ||
| KL2 | 15225 | 29.6 | 16.1 | 16.8 | 37.5 | 32.9 | ||
| C.456980 | 15250 | 30.4 | 15.1 | 16.2 | 38.3 | 31.3 | ||
| C.487648 | 15249 | 30.6 | 15.1 | 16.0 | 38.2 | 31.1 | ||
| Phoenix9 | 15227 | 29.2 | 16.0 | 17.2 | 37.5 | 32 | ||
| Phoenix1 | 15227 | 29.2 | 16.1 | 17.2 | 37.4 | 33 | ||
| M.190085/3 | 15227 | 28.6 | 16.8 | 18.0 | 36.7 | 34.8 | ||
| M.190082/2 | 15228 | 28.5 | 16.8 | 18.0 | 36.7 | 34.8 | ||
| M.183792 | 15235 | 28.6 | 16.9 | 17.8 | 36.7 | 34.7 | ||
| M.183927 | 15241 | 28.3 | 17.1 | 18.0 | 36.6 | 35.1** | ||
| Subject area | Biology |
|---|---|
| More specific subject area | Phylogenetics; Genetics; Evolutionary Biology |
| Type of data | Table, text file, graph, figure |
| How data was acquired | High-throughput and Sanger DNA sequencing |
| Data format | Text file format for DNA sequence alignments and phylogenetic trees is.nex (nexus) and.tree respectively. |
| Experimental factors | Total DNA was extracted from specimens using CTAB buffer. DNA was paired-end sequenced using the high-throughput Illumina HiSeq. 2500 platform. Short-length DNA sequences were amplified via PCR and Sanger sequenced. |
| Experimental features | mtDNA genome and 45 S nuclear ribosomal DNA sequences were assembled using reference sequences. Sequences were aligned with gaps and poorly-aligned positions removed. Phylogenetic trees were constructed using Bayesian (BEAST 1.8.3) and Maximum-Likelihood methods (RAxML 8.2.8). The unrooted phylogenetic network of some alignments was investigated using SplitsTree 4. |
| Data source location | Most specimens originate from New Zealand waters, some were collected from the coasts of Australia, Japan, USA (California), and the UK. |
| Data accessibility | Interactive.nwk (Newick) tree files are provided here and with the main article |