| Literature DB >> 27855631 |
Reema Singh1, Christina Schilde1, Pauline Schaap2.
Abstract
BACKGROUND: Dictyostelia are a well-studied group of organisms with colonial multicellularity, which are members of the mostly unicellular Amoebozoa. A phylogeny based on SSU rDNA data subdivided all Dictyostelia into four major groups, but left the position of the root and of six group-intermediate taxa unresolved. Recent phylogenies inferred from 30 or 213 proteins from sequenced genomes, positioned the root between two branches, each containing two major groups, but lacked data to position the group-intermediate taxa. Since the positions of these early diverging taxa are crucial for understanding the evolution of phenotypic complexity in Dictyostelia, we sequenced six representative genomes of early diverging taxa.Entities:
Keywords: Dictyostelia; Evolution of multicellularity; Evolution of soma; Multi-locus phylogeny; Phylogenomics; Taxonomy
Mesh:
Substances:
Year: 2016 PMID: 27855631 PMCID: PMC5114724 DOI: 10.1186/s12862-016-0825-7
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Bioinformatics pipeline. Chain of procedures for protein selection, cognate gene identification, gene model prediction and phylogenetic analysis
Fig. 2Phylogenetic inference from 47 concatenated proteins. a Location of selected species in the previously inferred SSU rDNA phylogeny [10]. b–f Phylogenetic trees inferred by Bayesian inference (b–d), RAxML (e) or Phylobayes (f) from an alignment of 47 concatenated orthologous proteins that were identified in the species shown in a. Bayesian analyses were run for 100,000 generations with either a mixed amino-acid model (b,c) for the entire alignment, or with a partitioned alignment in which each protein was run under its optimal amino-acid substitution model (d). All analyses converged within 6000 generations (SD of split frequencies = 0). The RaxML analysis was run with 100 bootstrap replicates on an alignment partitioned as in d. Phylobayes MPI [22] was run over two chains under the CAT-GTR model (f). Trees were rooted using A. castellani as outgroup. The average GC content of the genomic DNAs encoding the 47 proteins is plotted onto the phylogeny in panel c. Posterior probabilities or bootstrap support for the nodes are shown. Abbreviated and full species names are colour-coded to reflect the taxon group to which the species belong
Fig. 3Phylogenetic inference from subsets. The 47 protein set was subdivided in sets of nine or ten proteins by joining the fifth rows of five staggered columns of protein identifiers. The protein alignments were concatenated per set, and each alignment was subjected to Bayesian inference with a mixed amino-acid model. Only set 5 yielded a tree topology that was different (red branch) from the 47 protein consensus topology (top left)
Fig. 4Correlations between protein alignment statistics and tree features. For trees inferred from alignments of orthologs for each of the individual 47 test proteins (Additional file 3), the number of non-consensual nodes in each tree was plotted against either the number of aligned (a) or variable (b) positions per alignment, or against the averaged posterior probabilities of all nodes in the tree (c). Averaged posterior probabilities were also plotted against the number of variable positions per alignment (d). Regression lines for all plots are shown with their equations and coefficients of determination. Correlations between the variables were determined by Spearman rank order and P-values are shown. All variables are listed in Additional File 2, sheet 4
Fig. 5Alternative tree topologies. Schematic of repositioning of tree branches in the 47 protein consensus tree to yield the most commonly encountered alternative topologies found in either the earlier SSU rDNA tree of Dictyostelia [10], or the trees inferred from single proteins (Additional file 3). All alternative trees are shown in Additional file 5
Alternative topology tests
| Alternative topologies |
| |
|---|---|---|
| Consensus | Unconstrained 47 protein phylogeny | 1.000 |
| Topology 1 |
| 1e-04 |
| Topology 2 |
| 1e-04 |
| Topology 3 |
| 6e-74 |
| Topology 4 | Root between group 1 and group 2, as in SSU rDNA tree | 5e-74 |
| Topology 5 |
| 3e-68 |
| Topology 6 |
| 2e-43 |
| Topology 7 |
| 4e-29 |
| Topology 8 |
| 5e-47 |
| Topology 9 |
| 5e-40 |
The probabilities (p-value) of the alternative tree topologies listed in Fig. 5 were investigated using the approximately unbiased (AU) test in CONSEL v0.20 [25]. Trees with constrained nodes were generated using RaxML and log likelihood values for the consensus tree and alternative trees were used as input for the AU test
Species and culture conditions
| Species | Strain/isolate | Medium | Temperature |
|---|---|---|---|
|
| AE2 | 1/5th SM + 0.5% charcoal | 21 °C |
|
| FG12A | 1/5th SM +0.5% charcoal | 25 °C |
|
| MexM19A | 1/3rd LP + 0.5% charcoal | 25 °C |
|
| OhioWILDS | 1/5th SM | 21 °C |
|
| MY1-1 | 1/5th SM | 25 °C |
|
| P6 | SM | 21 °C |
The species that were used for genome sequencing are listed with their preferred growth conditions. Full SM agar contains 10 g BactoTM peptone, 1 g yeast extract, 10 g glucose, 1 g MgSO4.7H2O, 2.2 g KH2PO4, 1.25 g Na2HPO4.2H2O and 15 g agar per liter, for 1/5th SM all quantities, except the agar and phosphates are five-fold reduced. Full LP agar contains 1 g lactose, 1 g BactoTM peptone, 2.2 g KH2PO4, 1.25 g Na2HPO4.2H2O and 15 g agar per liter. For 1/3rd LP the lactose and peptone quantities are three-fold reduced