| Literature DB >> 20214780 |
Achim Meyer1, Christiane Todt, Nina T Mikkelsen, Bernhard Lieb.
Abstract
BACKGROUND: The 18S rRNA gene is one of the most important molecular markers, used in diverse applications such as molecular phylogenetic analyses and biodiversity screening. The Mollusca is the second largest phylum within the animal kingdom and mollusks show an outstanding high diversity in body plans and ecological adaptations. Although an enormous amount of 18S data is available for higher mollusks, data on some early branching lineages are still limited. Despite of some partial success in obtaining these data from Solenogastres, by some regarded to be the most "basal" mollusks, this taxon still remained problematic due to contamination with food organisms and general amplification difficulties.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20214780 PMCID: PMC2841657 DOI: 10.1186/1471-2148-10-70
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Helical regions correspond with missing data or possible foreign DNA. 1a: Possible chimerical patterns in previously published sequences (yellow bars) match with regions of double stranded DNA forming helical regions (blue bars) in newly generated sequences. Epimenia sp. (Ep_sp) and Helicoradomenia sp. (He_sp) sequences have been aligned to the nucleotide positions (represented by the scale bar) of the sequences from this study, Wirenia argentea (Wi_ar) and Simrothiella margaritacea (Si_ma). The simplified schematic alignments (green bars) show high similarity (>90%) whereas other regions (yellow bars) possess lower similarity and point to contamination issues. BLASTn searches of yellow domains indicate high similarity with polychaetes (PI to III) or cnidarians (C), or resulted in no significant hit, indicated as Helicoradomenia only (H). Detailed BLASTn results are given in additional file 1: Blast result table. Below the scale bar, the double stranded helical regions of 18S sequences are indicated containing 66-82% GC islands. 1b and c: Close-up views of single stranded regions of helix 1 (1b) and helix 2 (1c) of S. margaritacea 18S. These stems have strong adhesive forces probably hampering PCR. Secondary structures were calculated using mfold, applying 72°C. G-C hydrogen bonds are indicated in red.
Figure 2Secondary structures of double stranded 18S sequences. Stitch Profiles calculated at 90.7°C showing melted and double stranded regions for the three new solenogaster 18S sequences and the caudofoveate Scutopus ventrolineatus. One short helical region in the S. ventrolineatus sequence of approximately 100 bp length and 63% GC content is at the same position as in the solenogaster sequences.
Sequence composition within selected taxa (no = number of specimens) characterized using the disparity index (I), compositional distance (Dc), percent GC content, gene length in base pairs (bp) and average branchlengths (brls) with standard deviation (σ).
| Taxa | No | Dc | GC | bp | Brls*10 | σ*10 | |
|---|---|---|---|---|---|---|---|
| Outgroup | 41 | 0.42 | 0.52 | 48.5 | 1682.9 | 0.60 | 0.41 |
| 32 | 0.08 | 0.11 | 49.9 | 1739.8 | 0.46 | 0.08 | |
| 1 | n/c | n/c | 50.4 | 1767.0 | 0.38 | n/c | |
| 22 | 0.14 | 0.21 | 53.5 | 1835.8 | 1.47 | 0.19 | |
| 3 | 0.00 | 0.01 | 55.8 | 1785.0 | 1.82 | 0.08 | |
| 3 | 0.18 | 0.23 | 63.9 | 2100.0 | 5.27 | 0.24 | |
| Coleoidea | 74 | 0.19 | 0.28 | 60.3 | 2275.3 | 7.48 | 1.37 |
| Nautiloidea | 3 | 0.31 | 0.34 | 52.9 | 2360.0 | 5.40 | 0.35 |
| Vetigastropoda | 31 | 0.03 | 0.08 | 51.2 | 1784.9 | 2.10 | 0.12 |
| Neritimorpha | 5 | 0.00 | 0.02 | 50.6 | 1736.0 | 0.43 | 0.05 |
| Neomphalida | 3 | 0.09 | 0.14 | 51.3 | 1790.7 | 1.48 | 0.02 |
| Cocculinida | 1 | n/c | n/c | 50.1 | 1749.0 | 4.11 | n/c |
| Patellogastropoda | 9 | 0.42 | 0.50 | 57.2 | 2011.9 | 5.99 | 0.05 |
| Caenogastropda | 116 | 0.01 | 0.02 | 50.7 | 1745.7 | 0.86 | 0.05 |
| Heterobranchia | 183 | 1.86 | 1.96 | 55.3 | 1796.5 | 2.18 | 1.54 |
| Protobranchia | 18 | 0.12 | 0.17 | 50.6 | 1751.6 | 0.46 | 0.24 |
| Paleoheterodonta | 13 | 0.02 | 0.05 | 49.6 | 1750.5 | 0.47 | 0.08 |
| Pteriomorpha | 111 | 0.06 | 0.10 | 49.6 | 1742.1 | 0.48 | 0.38 |
| Heterodonta (ex. Anom.) | 160 | 0.25 | 0.34 | 52.2 | 1760.2 | 2.01 | 0.48 |
| Anomalodesmata | 41 | 0.11 | 0.20 | 51.7 | 1874.7 | 2.86 | 0.27 |
* Brls and σ values were multiplied with 10 for better readability. We used all sequences available for the Mollusca, Sipuncula and Kamptozoa with a length over 1,600 bp, supplemented by some other Lophotrochozoa and Priapulus caudatus.
Sequence composition of Mollusca within selected taxa measured using the disparity index (I), compositional distance (CD), GC content [%], gene length in base pairs (bp) and average branch lengths (brls) and their standard deviation (σ). Brls and σ values were multiplied with 10 for better readability. We used all sequences available for the Mollusca, Sipuncula and Kamptozoa above 1,600 bp supplemented by several other Lophotrochozoa and Priapulus caudatus.
Results from the disparity analyses on the 123 taxon alignment with the disparity index between taxa depicted in the lower triangle and the mcmc-test results for the homogeneity of substitution pattern in the upper triangle (% of taxa).
| Taxa (No) | Anom | Caeno | Caudo | Ceph | Cocc | Hetb | Mono | Neom | Nerit | Paleo | Pate | Polyp | Protob | Pterio | Scaph | Solen | Hetd | Vetig |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Anomalodesmata (7) | 0 | 100 | 85 | 0 | 39 | 0 | 0 | 0 | 5 | 90 | 0 | 0 | 0 | 46 | 37 | 0 | ||
| Caenogastropoda (7) | 0.029 | 100 | 98 | 0 | 56 | 0 | 0 | 0 | 0 | 100 | 0 | 21 | 0 | 83 | 55 | 0 | ||
| Caudofoveata (2) | 1.303 | 1.626 | 19 | 100 | 50 | 100 | 100 | 100 | 100 | 17 | 100 | 100 | 100 | 50 | 32 | 100 | ||
| Cephalopoda (16) | 3.350 | 4.198 | 0.764 | 81 | 85 | 88 | 85 | 90 | 94 | 29 | 95 | 85 | 91 | 29 | 31 | 78 | 90 | |
| Cocculinida (1) | 0.004 | 0.000 | 2.029 | 3.793 | 40 | 0 | 0 | 0 | 0 | 100 | 0 | 0 | 0 | 40 | 36 | 0 | ||
| Heterobranchia (10) | 0.898 | 1.189 | 0.719 | 2.153 | 1.155 | 50 | 43 | 58 | 73 | 43 | 70 | 57 | 62 | 54 | 90 | 47 | 44 | |
| Monoplacophora (1) | 0.003 | 0.001 | 1.685 | 4.100 | 0.000 | 1.083 | 0 | 0 | 0 | 100 | 0 | 22 | 0 | 80 | 64 | 0 | ||
| Neomphalida (3) | 0.006 | 0.121 | 1.122 | 3.182 | 0.042 | 0.886 | 0.045 | 0 | 56 | 78 | 52 | 11 | 17 | 27 | 21 | 0 | ||
| Neritimorpha (5) | 0.008 | 0.001 | 1.767 | 4.451 | 0.000 | 1.228 | 0.000 | 0.095 | 7 | 93 | 0 | 24 | 3 | 88 | 64 | 0 | ||
| Paleoheterodonta (3) | 0.075 | 0.043 | 2.412 | 5.173 | 0.007 | 1.629 | 0.054 | 0.270 | 0.047 | 100 | 0 | 33 | 0 | 100 | 81 | 0 | ||
| Patellogastropoda (3) | 2.152 | 2.488 | 0.581 | 0.586 | 2.551 | 1.041 | 2.382 | 1.808 | 2.245 | 3.077 | 100 | 85 | 100 | 40 | 44 | 69 | 100 | |
| Polyplacophora (7) | 0.114 | 0.015 | 2.430 | 5.259 | 0.007 | 1.579 | 0.030 | 0.311 | 0.035 | 0.025 | 2.974 | 30 | 0 | 100 | 84 | 0 | ||
| Protobranchia (9) | 0.049 | 0.138 | 1.419 | 3.690 | 0.091 | 1.035 | 0.102 | 0.088 | 0.129 | 0.252 | 1.972 | 0.275 | 28 | 53 | 63 | 2 | ||
| Pteriomorpha (6) | 0.056 | 0.014 | 2.112 | 4.483 | 0.000 | 1.383 | 0.028 | 0.200 | 0.021 | 0.010 | 2.560 | 0.020 | 0.207 | 90 | 80 | 0 | ||
| Scaphopoda (5) | 0.397 | 0.727 | 0.478 | 1.869 | 0.546 | 0.590 | 0.625 | 0.260 | 0.734 | 1.064 | 0.755 | 1.123 | 0.489 | 0.916 | 19 | 42 | ||
| Heterodonta s.A. (14) | 0.339 | 0.503 | 0.507 | 2.314 | 0.532 | 0.673 | 0.487 | 0.224 | 0.520 | 0.906 | 1.365 | 0.882 | 0.430 | 0.741 | 0.208 | 56 | ||
| Vetigastropoda (9) | 0.022 | 0.048 | 1.219 | 3.578 | 0.039 | 0.970 | 0.026 | 0.030 | 0.039 | 0.144 | 2.195 | 0.131 | 0.098 | 0.083 | 0.376 | 0.438 |
* We considered homogeneity as significantly rejected with P- values ≤ 0.01. The pairwise deletion option was used to compare all possible species pairs (100% = all pairwise comparisons between the species from both taxa rejected homogeneity). Anomalodesmata were treated separately and thus are excluded when showing results for the remaining Heterodonta.
Figure 3Consensus tree from the bayesian analysis using a time heterogeneous model in phase 2.0. * = indicates full support from the posterior probability (pp: 1.00). The Mollusca (pp: 0.75) and all mollusk classes (pp: 1.00) except Bivalvia were recovered. Solenogastres and Cephalopoda were assigned to different substitution models within this run. The branching pattern assembling taxa with similar sequence composition at node one to four is very robust in all analyses (see also table 2). Inserted table: ML bootstrap support (BS) is given for the nodes 1-4 using three suggested doublet models (S6A, S7B, S16). We used two different coding schemes for unpaired sites to infer the possible influence of base composition to the results. BS values are depicted in the order NT/RY: original nucleotide (NT) coded loops in followed by the results providing a recoded (G, A: R; T, C: Y) loop region.
Figure 4ML analyses with a modified taxon sampling. A: The heterobranchs are here represented by representatives of the fast evolving Aeolidina, only. All other taxa are identical to the sampling shown in figure 3, but we excluded also Patellogastropoda. B: Same alignment as used in the inference for figure 3 but excluding all other gastropods except Patellogastropoda. Both analyses were conducted on ten different starting trees with a partitioned GTR + and S16 substitution model and enabling the bootstopping criterium in RAxML (-# autoFC).