| Literature DB >> 35497189 |
Zeyuan Chen1,2, Michael Schrödl1,2,3.
Abstract
The Gastropoda contains 80% of existing mollusks and is the most diverse animal class second only to the Insecta. However, the deep phylogeny of gastropods has been controversial for a long time. Especially the position of Patellogastropoda is a major uncertainty. Morphology and some mitochondria studies concluded that Patellogastropoda is likely to be sister to all other gastropods (Orthogastropoda hypothesis), while transcriptomic and other mitogenomic studies indicated that Patellogastropoda and Vetigastropoda are sister taxa (Psilogastropoda). With the release of high-quality genomes, orthologous genes can be better identified and serve as powerful candidates for phylogenetic analysis. The question is, given the current limitations on the taxon sampling side, how many markers are needed to provide robust results. Here, we identified single-copy orthologous genes (SOGs) from 14 gastropods species with whole genomes available which cover five main gastropod subclasses. We generated different datasets from 395 to 1610 SOGs by allowing species missing in different levels. We constructed gene trees of each SOG, and inferred species trees from different collections of gene trees. We found as the number of SOGs increased, the inferred topology changed from Patellogastropoda being sister to all other gastropods to Patellogastropoda being sister to Vetigastropoda + Neomphalina (Psilogastropoda s.l.), with considerable support. Our study thus rejects the Orthogastropoda concept showing that the selection of the representative species and use of sufficient informative sites greatly influence the analysis of deep gastropod phylogeny.Entities:
Keywords: Conflicting topologies; Gene tree; Mollusca; Orthogastropoda; Patellogastropoda; Phylogeny; Whole genomes
Year: 2022 PMID: 35497189 PMCID: PMC9048639 DOI: 10.7717/peerj.13285
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 3.061
Summary of published gastropods genome and deep gastropods phylogeny based on corresponding genome studies using single-copy orthologous genes (SOGs).
| Sub-class | Species | GenBank assembly accession | Assembly level | Topology | Used of SOGs | References |
|---|---|---|---|---|---|---|
| H |
|
| Chromosome | P,((H,C),(V,N)) | 223 |
|
| H |
|
| Chromosome | P,(H,C) | 229 |
|
| H |
| 100647 (Gigadb) | Chromosome | (P,H) | 675 |
|
| H |
|
| Contig | – | – | BANG, 2016, Unpublished data |
| H |
|
| Scaffold | – | – | Broad Institute, 2013, Unpublished data |
| H |
|
| Scaffold | – | – |
|
| H |
|
| Scaffold | – | – |
|
| H |
|
| Scaffold | – | – |
|
| H |
|
| Scaffold | – | – |
|
| H |
|
| Scaffold | – | – |
|
| H |
|
| Scaffold | – | – |
|
| H |
|
| Scaffold | – | – |
|
| H |
|
| Scaffold | – | – |
|
| H |
|
| Scaffold | – | – |
|
| C |
|
| Chromosome | – | – |
|
| C |
|
| Chromosome | (P,V),(H,C) | 1,357* |
|
| C |
|
| Contig | – | – | HKUST, 2021, Unpublished data |
| C |
|
| Contig | – | – | Ewha Womans University, 2021, Unpublished data |
| C |
|
| Contig | – | – | University of Konstanz, 2016, Unpublished data |
| C |
|
| Contig | – | – |
|
| C |
|
| Contig | – | – | BGI, 2021, Unpublished data |
| C |
|
| Scaffold | – | – | IRIDION GENOMES, 2020, Unpublished data |
| C |
|
| Scaffold | – | – | Fisheries and Technical, Economic College, 2020, Unpublished data |
| C |
|
| Chromosome | – | – |
|
| C |
|
| Scaffold | – | – |
|
| C |
|
| Scaffold | – | – |
|
| C |
|
| Scaffold | – | – |
|
| C |
|
| Scaffold | – | – |
|
| V |
|
| Chromosome | – | – | Wellcome Sanger Institute, 2021, Unpublished data |
| V |
|
| Scaffold | – | – |
|
| V |
|
| Scaffold | – | – |
|
| V |
|
| Scaffold | – | – |
|
| N |
|
| Chromosome | (P,(V,N)),(H,C) | 529 |
|
| N |
|
| Chromosome | (P,(V,N)),(H,C) | 1,375* |
|
| N |
|
| Scaffold | – | – |
|
| P |
|
| Scaffold | – | – | DOE Joint Genome Institute, 2012, Unpublished data |
Note:
Subclass H, C, V, N, P represents Heterobranchia, Caenogastropoda, Vetigastropoda, Neomphalina, and Patellogastropoda respectively. * Represents SOGs that can be found in at least 60% of taxa. Data without citations have been replaced by the data submitter and data publication date in NCBI. The species in bold are the species used to infer the phylogeny in this article.
Figure 1The four data sets used to infer gastropod relationships.
The first data set includes single-copy orthologous genes (SOGs) identified in all 16 species per SOG and results in 395 SOGs. 933, 1,331, 1,610 SOGs were identified while allowed 1–3 species missing per SOG respectively. The blue line represents SOGs identified by each species, and the black vertical line represents the missing of that gene in the corresponding species. The dots below represent the number of SOGs occupied by corresponding species under different degrees of species missing.
Figure 2The gastropod relationships constructed with different data sets and the corresponding posterior probabilities of different topology.
H: Heterobranchia, C: Caenogastropoda, V: Vetigastropoda, N: Neomphalina, P: Patellogastropoda, B: Bivalve.
Figure 3Gastropod phylogeny inferred from 847 single-copy orthologous genes (SOGs) identified from 15 species except C. consors.
Bootstrap support percentages are indicated for each internal branch. Branch length is marked with blue numbers.