| Literature DB >> 32095363 |
Arina L Maltseva1, Marina A Varfolomeeva1, Arseniy A Lobov1,2, Polina Tikanova1, Marina Panova1,3, Natalia A Mikhailova1,4, Andrei I Granovitch1.
Abstract
BACKGROUND: The introduction of DNA-based molecular markers made a revolution in biological systematics. However, in cases of very recent divergence events, the neutral divergence may be too slow, and the analysis of adaptive part of the genome is more informative to reconstruct the recent evolutionary history of young species. The advantage of proteomics is its ability to reflect the biochemical machinery of life. It may help both to identify rapidly evolving genes and to interpret their functions.Entities:
Keywords: Cryptic species; Ecological divergence; IndVal; Littorinidae; Outliers; Phylogenetic markers; Phylogeny; Proteomics; Taxon-specific proteomic markers; Taxonomy
Year: 2020 PMID: 32095363 PMCID: PMC7024583 DOI: 10.7717/peerj.8546
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Location of sampling sites, collection seasons, and sample composition.
| Location | Geographic coordinates | Season | Collected species |
|---|---|---|---|
| Cancale, English Channel, France | 48° 70′ N, −1° 84′ W | May, 2014 | |
| Tromsø, Barents Sea, Norway | 69° 43′ N, 18° 60′ E | May, 2014 | |
| Chupa Bay, White Sea, Russia | 66° 29′ N, 33° 68′ W | June, 2014 | |
| Sheung Pak Nai, New Territories, Hong Kong, China | 22° 27′ N, 113° 58′ E | August, 2014 | |
| Oban, Scotland, UK | 56° 27′ N, 5° 27′ W | April, 2015 | |
| Eilat, Red Sea, Israel | 29° 30′ N, 34° 54′ E | August, 2015 |
Figure 1Interspecies relations within the family Littorinidae.
(A) Dendrogram of consensus species proteomes obtained via neighbor joining based on Jaccard dissimilarities of protein occurrence frequency in samples of different species. The bootstrap support values are shown. (B) The molecular phylogeny tree obtained via Bayesian inference using concatenated partial gene sequences from 28S rRNA, 12S rRNA and cytochrome oxidase C subunit I (COI). Support values are posterior probabilities. Prior to comparison, the both trees (A) and (B) were made ultrametric using non-negative least squares. Robinson–Foulds distance between unrooted trees was RF = 2 (normalized RF = 0.143). The cophenetic correlation between trees (A) and (B) is CC = 0,801; between raw NJ and Bayesian trees is 0.798 (С) Fowlkes–Mallows index comparing dendrograms (A) and (B). Black line with dots shows the change of the compositional similarity of clusters (Bk) with the number of clusters (k). Dashed line indicates Bk values under a null hypothesis of insignificant similarity of cluster’ composition in the trees under comparison). Red line depicts threshold values for rejection of the null hypothesis. (D) Matrices of cophenetic distances for the proteomic and DNA-based trees expressed as a percentage of the total tree length. L. lit Littorina (Littorina) littorea, L. obt Littorina (Neritrema) obtusata, L. fab Littorina (Neritrema) fabalis, L. sax Littorina (Neritrema) saxatilis, L. arc Littorina (Neritrema) arcana, L. com Littorina (Neritrema) compressa, L. ard Littoraria ardouiniana, L. mel Littoraria melanostoma, E. mil Echinolittorina millegrana, E. mar Echinolittorina marisrubri.
Figure 2Dendrogram of proteome UPGMA clustering from samples of the 10 Littorinidae species.
Clustering was produced using unweighted pair group method with arithmetic mean (UPGMA) algorithm based on Jaccard dissimilarity coefficients for the data on presence/absence of proteins in the samples. Sample labels indicate species (L.arc: Littorina (Neritrema) arcana; L.comp: Littorina (Neritrema) compressa; L.sax: Littorina (Neritrema) saxatilis; L.obt: Littorina (Neritrema) obtusata; L.fab: Littorina (Neritrema) fabalis; L.lit: Littorina (Littorina) littorea; L.ard: Littoraria ardouiniana; L.mel: L. melanostoma; E.mar: Echinolittorina marisrubri; E.mil: E. millegrana), location (Ru: White Sea, Russia; Fr: English Channel, France; UK: Atlantic coast, Scotland; No: Barents Sea, Norway; Cn: East-China Sea, Hong Kong; Il: Israel), sex (f: female; m: male) and body part (so: foot + head parts; pe: penis). The approximately unbiased bootstrap support values are shown. (neighbor joining-based clustering is presented in File S2).
Composition, protein markers and supporting values of sample clusters.
Selected clusters obtained from analysis of Jaccard dissimilarities for the full set of samples (the full list is presented in Table S1). The first column indicates the composition of a cluster; the second is the number of protein markers, identifying the cluster with maximal specificity and sensitivity; the third is the threshold number of clusters when the cluster of this particular composition appears (from a minimal number of clusters to maximal); the fourth is the recovery probability of the particular composition cluster when it appears for the first time (e.g., the “saxatilis”-group cluster appeared at first during partition into 6 clusters; the recovery probability after this partition was 1; while this cluster persisted through 2 more partitions, its probability dropped to 0.99 and 0.90, respectively).
| Composition of clusters (CC) | Number of markers (NC) | NC for the CC first appearance | Recovery probability |
|---|---|---|---|
| 12 | 8 | 0.98 | |
| 24 | 8 | 0.98 | |
| 50 | 5 | 0.95 | |
| 58 | 5 | 0.94 | |
| 100 | 4 | 1 | |
| 28 | 7 | 0.92 | |
| 28 | 7 | 0.96 | |
| 2 | – | – | |
| 11 | 9 | 0.99 | |
| 74 | 7 | 0.82 | |
| 28 | 4 | 1 | |
| 15 | 9 | 0.89 | |
| 33 | 6 | 1 | |
| 12 | 6 | 0.89 | |
| 153 | 2 | 1 | |
| 17 | – | – |