| Literature DB >> 30711536 |
Christina Schilde1, Hajara M Lawal1, Koryu Kin1, Ikumi Shibano-Hayakawa2, Kei Inouye3, Pauline Schaap4.
Abstract
The Dictyostelid social amoebas are a popular model system for cell- and developmental biology and for evolution of sociality. Small subunit (SSU) ribosomal DNA-based phylogenies subdivide the known 150 species into four major and some minor groups, but lack resolution within groups, particularly group 4, and, as shown by genome-based phylogenies of 11 species, showed errors in the position of the root and nodes separating major clades. We are interested in the evolution of cell-type specialization, which particularly expanded in group 4. To construct a more robust phylogeny, we first included 7 recently sequenced genomes in the genome-based phylogeny of 47 functionally divergent proteins and next selected 6 proteins (Agl, AmdA, PurD, PurL, RpaA, SmdA) that independently or in sets of two fully reproduced the core-phylogeny. We amplified their coding regions from 34 Dictyostelium species and combined their concatenated sequences with those identified in the 18 genomes to generate a fully resolved phylogeny. The new AAPPRS based phylogeny (after the acronym of the 6 proteins) subdivides group 4 into 2 branches. These branches further resolve into 5 clades, rather than the progressively nested group 4 topology of the SSU rDNA tree, and also re-orders taxa in the other major groups. Ancestral state reconstruction of 25 phenotypic traits returned higher "goodness of fit" metrics for evolution of 19 of those traits over the AAPPRS tree, than over the SSU rDNA tree. The novel tree provides a solid framework for studying the evolution of cell-type specialization, signalling and other cellular processes in particularly group 4, which contains the model Dictyostelid D. discoideum.Entities:
Keywords: Ancestral state reconstruction; Dictyostelia; Dictyostelium caveatum; Phylogenetic marker genes; Phylogenomics; Polysphondylium multicystogenum
Mesh:
Substances:
Year: 2019 PMID: 30711536 PMCID: PMC6430600 DOI: 10.1016/j.ympev.2019.01.017
Source DB: PubMed Journal: Mol Phylogenet Evol ISSN: 1055-7903 Impact factor: 4.286
Fig. 1Phylogenies inferred from 47 and 6 proteins from sequenced genomes. A.47 proteins. Orthologous sequences of 47 proteins involved in a broad range of cellular functions (Singh et al., 2016) were aligned using Clustal Omega (Sievers and Higgins, 2014). Alignments were edited to remove non-consensual segments and gaps across multiple sequences. After concatenation, the 38,942 AA alignment was subjected to phylogenetic inference using Phylobayes MPI (Lartillot et al., 2013) with a CAT-GTR model. Two MCMC chains were run for 10,000 cycles, with trees sampled at every 10 cycles. The Bayesian consensus tree was generated with a burnin of 1000 cycles. B.6 full proteins. Alignments of agl, amdA, purD, purL, rpaA and smdA (AAPPRS) protein sequences, which individually either fully reproduced the earlier 12 species core phylogeny (Singh et al., 2016) or with a single non-consensual node, were concatenated, edited and analysed using MrBayes 3.2. The 6136 AA alignment was partitioned into its individual proteins with each partition run under its most likely amino-acid substitution model over 1 million generations. C.PCR products. Sequences of the same six proteins equivalent to the PCR products amplified from 34 test species (Fig. 2) were aligned, concatenated to a total of 2711 AA and analysed as for panel B. Posterior probabilities of the nodes are shown in all trees.
Fig. 2A 6 protein phylogeny of 52 Dictyostelium species. A. 6 proteins. Fragments of the agl, amdA, purD, purL, rpaA, smdA genes were amplified by PCR from 34 Dictyostelium species and retrieved from genome sequences of 18 other Dictyostelia and 3 outgroup amoebozoan species (see Fig. 1) The deduced amino acid sequences were aligned with ClustalOmega and the alignments were concatenated. A phylogeny was inferred using MrBayes 3.2 with the alignment partitioned into its amplified segments. Each segment was analysed over 1 million generations with its most likely amino acid substitution model and a gamma distribution of substitution rates. Taxon names follow those of the original species diagnoses. B. SSU rDNA. SSUrDNA sequences for all 55 species were retrieved from Genbank, aligned with mCoffee and subjected to Bayesian inference using a GTR model with a gamma distribution and a proportion of invariable sites over 1 million generations. Taxon names follow the recently proposed re-classification of Dictyostelids (Sheikh et al., 2018). Note that the genus Raperostelium in the SSU rDNA tree is paraphyletic in the AAPPRS tree.
Fig. 3Ancestral state reconstruction. The trait “stalk supports” was reduced to a binary expression (see Supplementary Data 6 Traits) and trait evolution was examined over both the AAPPRS and SSUrDNA tree using the “rerootingMethod” implemented in phytools (Revell, 2012) Small circles next to species names are colour coded to represent the presence (red) or absence (blue) of the trait, while the fractional colour coding in the larger circles represents the posterior probabilities of the trait at the internal nodes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Tree validity tests from ancestral state reconstruction.
| Discrete trait | AAPPRS Log likelihood | SSU_rDNA Log likelihood | Continuous trait | Mean variance ratio AAPPRS/SSU rDNA |
|---|---|---|---|---|
| Stalk support | −31.18 | Spore diameter | 0.22 | |
| Polar granules | −16.97 | Spore eccentricity | 0.21 | |
| Aggregation | −15.10 | Amoeba diameter | 0.23 | |
| Habit | −19.21 | Amoeba eccentricity | 0.13 | |
| Sorocarp branching | −29.74 | Aggregate diameter | 0.21 | |
| Lateral branches | −32.30 | Sorogen length | 0.54 | |
| Regular whorls | −9.69 | Sorus diameter | 0.26 | |
| Pointed stalk tip | −24.16 | Stalk area | 0.21 | |
| Broadened_tip | −27.59 | Stalk eccentricity | 0.39 | |
| Stalked_migration | Anterior prestalk | 0.30 | ||
| Free_migration | −19.81 | Posterior prespore | 0.19 | |
| Phototropism | Rearguard region | 0.15 | ||
| Microcyst | −18.99 |
Discrete traits were converted to binary expressions (Supplementary Data 6 Traits) and subjected to ancestral state reconstruction over the AAPPRS and SSU rDNA phylogenies (Fig. A4) using the “rerootingMethod” function implemented in phytools (Revell, 2012). The “goodness of fit” of trait evolution to the tree, as represented by the marginal log likelihood at the root is presented for each trait. The lower values (in bold) represent the best fit for each trait. For continuous traits ancestral state reconstruction was performed with “fastAnc” in phytools, which returns node estimates with variances and 95% confidence intervals (see Supplementary Data 7_AncestralStates). The variances for each trait were averaged here and the ratio of averaged variances obtained with AAPPRS tree over those obtained with the SSU rDNA tree was calculated. All variances were lower for ancestral trait estimates in the AAPPRS tree.
Performance of single protein phylogenies.
| Number of non-consensual nodes | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | |||||
| Protein | AA | Protein | AA | Protein | AA | Protein | AA | Protein | AA |
| aco1(1) | 882 | accA(3) | 2183 | alxA (2) | 699 | 5NT(3) | 516 | aclY(2) | 610 |
| 1474 | 678 | cinC (2) | 837 | acsa (2) | 653 | clcD (2) | 825 | ||
| 270990(1) | 601 | glnA3(1) | 732 | 287723(2) | 912 | argC (2) | 740 | grpA | 211 |
| 271904(1) | 604 | midA(1) | 422 | 290197(2) | 496 | cas1 (2) | 699 | scdA(3) | 666 |
| 289993(1) | 613 | pdhC(1) | 497 | glpD(2) | 878 | dcsA (3) | 975 | ||
| rpa2(2) | 1130 | 787 | glud2(3) | 1004 | 276321(2) | 506 | |||
| rpc3(2) | 567 | 1325 | hdaB(2) | 391 | dnmA (3) | 314 | |||
| pyr13(1) | 2144 | ogdH(3) | 975 | fcsA(2) | 646 | ||||
| 1460 | pfkA(2) | 765 | gcsa(3) | 602 | |||||
| rpb1(2) | 1541 | pgmB(4) | 577 | glpV(2) | 852 | ||||
| rpc1(2) | 1372 | sglA(3) | 526 | sdhA(3) | 611 | ||||
| 402 | |||||||||
| tkt2(4) | 661 | ||||||||
| xdH(3) | 1318 | ||||||||
Summary of the number of non-consensual nodes detected in 21-species phylogenies inferred from the individual proteins of the 47 protein phylogeny (see Appendix A, Fig. A5). Data in parentheses are the number of non-consensual nodes detected in the earlier 14-species phylogenies of the same proteins (Singh et al., 2016). The proteins used in the AAPPRS phylogeny are underlined. The DDB_G0 prefixes of genes with 12 character gene names are not shown.