| Literature DB >> 31810981 |
Andrea Desiderato1,2, Marcos Barbeitos1, Clément Gilbert3, Jean-Luc Da Lage4.
Abstract
The subfamily GH13_1 of alpha-amylases is typical of Fungi, but it is also found in some unicellular eukaryotes (e.g., Amoebozoa, choanoflagellates) and non-bilaterian Metazoa. Since a previous study in 2007, GH13_1 amylases were considered ancestral to the Unikonts, including animals, except Bilateria, such that it was thought to have been lost in the ancestor of this clade. The only alpha-amylases known to be present in Bilateria so far belong to the GH13_15 and 24 subfamilies (commonly called bilaterian alpha-amylases) and were likely acquired by horizontal transfer from a proteobacterium. The taxonomic scope of Eukaryota genomes in databases has been greatly increased ever since 2007. We have surveyed GH13_1 sequences in recent data from ca. 1600 bilaterian species, 60 non-bilaterian animals and also in unicellular eukaryotes. As expected, we found a number of those sequences in non-bilaterians: Anthozoa (Cnidaria) and in sponges, confirming the previous observations, but none in jellyfishes and in Ctenophora. Our main and unexpected finding is that such fungal (also called Dictyo-type) amylases were also consistently retrieved in several bilaterian phyla: hemichordates (deuterostomes), brachiopods and related phyla, some molluscs and some annelids (protostomes). We discuss evolutionary hypotheses possibly explaining the scattered distribution of GH13_1 across bilaterians, namely, the retention of the ancestral gene in those phyla only and/or horizontal transfers from non-bilaterian donors.Entities:
Keywords: Bilateria; alpha-amylase; annelids; brachiopods; bryozoans; gene loss; glycosyl hydrolase; hemichordates; horizontal gene transfer; introns; molluscs; phoronids
Mesh:
Substances:
Year: 2020 PMID: 31810981 PMCID: PMC7003070 DOI: 10.1534/g3.119.400826
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
GH13_1-like sequences found after BLAST searches in online databases (not comprehensive for unicellars, without the Fungi). *: sequences which have not been characterized as protein-coding, in sequenced genomes with long contigs; (1): from short DNA sequences (except Sequence reads archive); **: reported as GH13_1 in CAZy. Most of the SRA data are from transcriptome studies; see Tables S1 and S2
| Phylum | Species | Database | Accession |
|---|---|---|---|
| Porifera Demospongiae Heteroscleromorpha | GenBank proteins | XP_019851448 | |
| Porifera Demospongiae Heteroscleromorpha | m.29963 g.29963 | ||
| Porifera Demospongiae Heteroscleromorpha | GenBank TSA | GFAV01017079 | |
| Porifera Demospongiae Heteroscleromorpha | GenBank SRA | SRX470277 | |
| Porifera Demospongiae Heteroscleromorpha | gnl|BL_ORD_ID|6299 | ||
| Cnidaria Hexacorallia Actiniaria | GenBank TSA | GEVE01039432 | |
| Cnidaria Hexacorallia Actiniaria | GenBank TSA | GBYC01063006 | |
| Cnidaria Hexacorallia Actiniaria | c117986_g2_i1 | ||
| Cnidaria Hexacorallia Actiniaria | c88768_g1_i1 | ||
| Cnidaria Hexacorallia Actiniaria | c66498_g1_i1 | ||
| Cnidaria Hexacorallia Actiniaria | GenBank proteins | XP_020895894 | |
| Cnidaria Hexacorallia Actiniaria | GenBank proteins | XP_001629956 | |
| Cnidaria Hexacorallia Actiniaria | GenBank TSA | GGNY01117022 | |
| Cnidaria Hexacorallia Actiniaria | C36117_g1_i2 | ||
| Cnidaria Hexacorallia Corallimorpharia | |||
| Cnidaria Hexacorallia Corallimorpharia | |||
| Cnidaria Hexacorallia Scleratinia | GenBank proteins | XP_015760547 partial | |
| Cnidaria Hexacorallia Scleratinia | GenBank proteins | XP_029201467 | |
| Cnidaria Hexacorallia Scleratinia | aten_0.1.m1.10359.m1 | ||
| Cnidaria Hexacorallia Scleratinia | ffun1.m4.16656.m1 | ||
| Cnidaria Hexacorallia Scleratinia | gasp1.m3.6500.m1 | ||
| Cnidaria Hexacorallia Scleratinia | TR26025|c0_g2_i3 | ||
| Cnidaria Hexacorallia Scleratinia | GenBank proteins | XP_020628431 | |
| Cnidaria Hexacorallia Scleratinia | Sc0001227 74283-80000 | ||
| Cnidaria Hexacorallia Scleratinia | GenBank genomes | XP_027058081 | |
| Cnidaria Hexacorallia Scleratinia | plut2.m8.18618.m1 | ||
| Cnidaria Hexacorallia Scleratinia | GenBank genomes | OKRP01000157 | |
| Cnidaria Hexacorallia Scleratinia | GenBank proteins | XP_022802004 | |
| Cnidaria Octocorallia Pennatulacea | GenBank SRA | SRX4364609 | |
| Cnidaria Octocorallia Pennatulacea | GenBank SRA | SRX4717871 | |
| Cnidaria Octocorallia Pennatulacea | GenBank genomes | FXAL01159338 | |
| Placozoa | GenBank proteins | XP_002114911 | |
| Brachiopoda Linguliformea | GenBank SRA | SRX731468 | |
| Brachiopoda Linguliformea | GenBank proteins | XP_013396432 | |
| Brachiopoda Linguliformea | GenBank proteins | XP_013378610 | |
| Brachiopoda Craniiformea | GenBank SRA | SRX731472 | |
| Brachiopoda Rhynchonelliformea | GenBank SRA | SRX112037 | |
| Brachiopoda Rhynchonelliformea | GenBank SRA | SRX731471 | |
| Brachiopoda Rhynchonelliformea | GenBank SRA | SRX731469 | |
| Brachiopoda Rhynchonelliformea | GenBank SRA | SRX1307070 | |
| Brachiopoda Phoroniformea or Phoronida | marinegenomics | g9986.t1 | |
| Brachiopoda Phoroniformea or Phoronida | marinegenomics | g16048.t1 | |
| Brachiopoda Phoroniformea or Phoronida | GenBank SRA | SRX1121914 | |
| Bryozoa Flustrina | GenBank SRA | SRX2112329 | |
| Bryozoa Flustrina | GenBank SRA | SRX6428326 | |
| Bryozoa Ctenostomatida | GenBank SRA | SRX6428327 | |
| Bryozoa Cheilostomatida | GenBank SRA | SRX1121923 | |
| Hemichordata Enteropneusta | Marinegenomics | pfl_40v0_9_20150316_1g2314.t1 | |
| GenBank WGS | LD343027 41534-50098 | ||
| Hemichordata Enteropneusta | GenBank WGS | LD343027 51007-66347 | |
| Hemichordata Enteropneusta | Marinegenomics | pfl_40v0_9_20150316_1g6997.t1 | |
| GenBank WGS | BCFJ01022326 32811-41459 | ||
| Hemichordata Enteropneusta | GenBank proteins | XP_006816582 | |
| Hemichordata Enteropneusta | GenBank proteins | XP_006816581 | |
| Hemichordata Enteropneusta | GenBank proteins | XP_006819810 | |
| Hemichordata Enteropneusta | GenBank SRA | SRX1436000 | |
| Hemichordata Enteropneusta | GenBank SRA | SRX798197 | |
| Hemichordata Pterobranchia | GenBank SRA | SRX879690 | |
| Mollusca Gastropoda Caenogastropoda | AmpuBase | Apl52885 | |
| Mollusca Gastropoda Caenogastropoda | GenBank SRA | SRX2957288 | |
| Mollusca Gastropoda Caenogastropoda | GenBank SRA | SRX2753455 | |
| Mollusca Gastropoda Caenogastropoda | GenBank WGS | LFLW010536118 | |
| Mollusca Gastropoda Caenogastropoda | GenBank TSA | GELE01086894 | |
| Mollusca Gastropoda Caenogastropoda | GenBank SRA | SRX5277776 | |
| Mollusca Gastropoda Caenogastropoda | GenBank SRA | ERX3138276 | |
| Mollusca Gastropoda Caenogastropoda | AmpuBase | Lny24710 | |
| Mollusca Gastropoda Caenogastropoda | AmpuBase | Mco2627 | |
| Mollusca Gastropoda Caenogastropoda | GenBank SRA | SRX5832309 | |
| Mollusca Gastropoda Caenogastropoda | GenBank TSA | GHHQ01002371 | |
| Mollusca Gastropoda Caenogastropoda | GenBank SRA | SRX4378318 | |
| Mollusca Gastropoda Caenogastropoda | GenBank SRA | SRX2739536 | |
| Mollusca Gastropoda Caenogastropoda | AmpuBase | Pila82769 | |
| Mollusca Gastropoda Caenogastropoda | GenBank proteins | XP_025109323 (incomplete) | |
| AmpuBase | Pca5338 | ||
| Mollusca Gastropoda Caenogastropoda | AmpuBase | Pdi16479 (partial) | |
| Mollusca Gastropoda Caenogastropoda | AmpuBase | Pma33988 (partial) | |
| Mollusca Gastropoda Caenogastropoda | AmpuBase | Psc4690 | |
| Mollusca Gastropoda Caenogastropoda | GenBank TSA | GDIA01047641 | |
| Mollusca Gastropoda Caenogastropoda | GenBank TSA | GGNX01073707 | |
| Mollusca Gastropoda Vetigastropoda | GenBank TSA | GFTT01038064 | |
| Mollusca Gastropoda Vetigastropoda | GenBank WGS | QXJH01001142 | |
| Mollusca Gastropoda Vetigastropoda | GenBank WGS | QGMO01000565 | |
| Mollusca Gastropoda Vetigastropoda | GenBank SRA | SRX958768 | |
| Mollusca Bivalvia Mytiloida | GenBank Assembly | MJUT01033839 | |
| Mollusca Bivalvia Mytiloida | GenBank Assemby | NFUK01006104 | |
| Mollusca Bivalvia Mytiloida | GenBank SRA | SRX1940727 | |
| Mollusca Bivalvia Mytiloida | GenBank Assembly | MJUU01021410 | |
| Mollusca Bivalvia Mytiloida | GenBank Assembly | APJB011511270 | |
| Mollusca Bivalvia Mytiloida | GenBank TSA | GHIK01025031 | |
| Mollusca Bivalvia Mytiloida | GenBank TSA | GGLA01150624 | |
| Mollusca Bivalvia Mytiloida | GenBank TSA | GFKS01035611 | |
| Mollusca Bivalvia Mytiloida | GenBank SRA | SRX2210805 | |
| Mollusca Bivalvia Mytiloida | GenBank SRA | SRX4058936 | |
| Mollusca Bivalvia Pterioida | GenBank SRA | SRX1688295 | |
| Mollusca Bivalvia Pterioida | GenBank Assembly | CM008066 | |
| Mollusca Bivalvia Pterioida | Marinegenomics | pfu_aug1.0_4142.1_01638 | |
| Mollusca Bivalvia Pterioida | GeneBank TSA | GEMO01011007 | |
| Mollusca Bivalvia Arcoida | GenBank SRA | SRX323049 | |
| Mollusca Bivalvia Arcoida | GenBank TSA | GEXI01046152 | |
| Mollusca Bivalvia Arcoida | GenBank SRA | SRX1334524 | |
| Mollusca Bivalvia Unionoida | GenBank SRA | SRX1153631 | |
| Annelida Oligochaeta | GenBank SRA | SRX6596293 | |
| Annelida Oligochaeta | GenBank TSA | GBIL01075477 | |
| Annelida Polychaeta | GenBank Assembly | LQRL01141559 | |
| LQRL01153670 | |||
| LQRL01157410 | |||
| Annelida Polychaeta | GenBank TSA | GFPL01035490 | |
| Annelida Polychaeta | GenBank TSA | GGGS01192599 | |
| Amoebozoa Mycetozoa | GenBank proteins | XP_004351949 | |
| Amoebozoa Mycetozoa | GenBank proteins | XP_640516** | |
| Amoebozoa Mycetozoa | GenBank proteins | XP_020429468 | |
| Amoebozoa Discosea | GenBank proteins | XP_004368209 | |
| Choanoflagellida Salpingoecidae | GenBank proteins | XP_001742116 | |
| Choanoflagellida Salpingoecidae | GenBank proteins | XP_004998636 | |
| Ciliata | GenBank proteins | XP_004027176 | |
| Ciliata | GenBank proteins | AGU13046** | |
| Ciliata | GenBank proteins | AGU13047** | |
| Ciliata | GenBank proteins | XP_001462315 | |
| Ciliata | GenBank proteins | OMJ70617 | |
| Ciliata | GenBank proteins | CDW84776 | |
| Ciliata | GenBank proteins | XP_001020855** | |
| Heterolobosea | GenBank proteins | XP_002676377 | |
| Apusozoa | GenBank proteins | XP_013759080 | |
| Oomycetes | GenBank proteins | AIG56379** | |
| Oomycetes | GenBank proteins | XP_008604251 | |
| Oomycetes | GenBank proteins | AIG55673** |
Figure 1ML tree of GH13_1 protein sequences of metazoan and non-metazoan species. The tree was rooted by placing fungi and unicellular organisms, except choanoflagellates, as outgroups. The numbers at the nodes are the aLRT supports. Dark green: hemichordates; light blue: brachiozoans; red: cnidarians, dark blue: sponges; orange: placozoans; pink: choanoflagellates; purple: amoebozoans; brown: fungi; gray, molluscs; bright green: annelids; black: other protists.
Figure 2Intron positions compared across the sampled GH13_1 genes. The intron positions found in the studied parts of the sequences were numbered from 1 to 56. Pink: phase zero introns; green: phase 1 introns; blue: phase 2 introns. The black horizontal bar separates bilaterians from species where GH13_1 alpha-amylases are considered native. The color code for species is the same as in Figure 1.
Figure 3Two scenarii of HGT/gene losses of the GH13_1 genes. HGT or gene loss events were plotted on one of the proposed phylogenies of Bilateria, adapted from Plazzi ; Kocot (2016); Kocot ; Luo ; Luo ; Uribe . Fractions after the lineage names are the number of species showing GH13_1 sequences over the total number of species investigated. A: HGT hypothesis. Black diamonds represent the HGT events, crosses indicate subsequent GH13_1 loss events. B: Gene loss hypothesis. Crosses indicate GH13_1 loss events. Dashed crosses indicate lineages for which only a fraction of the available reliable genome or transcriptome data were found to contain a GH13_1 sequence. Divergence times are from Kumar .