| Literature DB >> 25935411 |
Miyuki Noro1, Fumiaki Sugahara2,3, Shigehiro Kuraku4.
Abstract
BACKGROUND: Vertebrate Emx genes, retained as multiple copies, are expressed in a nested pattern in the early embryonic forebrain and required for its regionalization. This pattern seems to have originated in a vertebrate common ancestor; however, a previous analysis, reporting two lamprey Emx genes, claimed independent Emx gene duplications in both cyclostome (extant jawless fish) and gnathostome (jawed vertebrate) lineages after their divergence. This scenario is neither parsimonious nor consistent with the hypothesis that genome expansion occurred before the cyclostome-gnathostome split, which is supported by recent genome-wide analyses.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25935411 PMCID: PMC4464114 DOI: 10.1186/s12862-015-0351-z
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Alternative scenarios for timing of Emx gene duplication. (a) Scenario supporting the common Emx duplication before the cyclostome-gnathostome split. In this scenario, the establishment of the nested Emx expression could have occurred once before the cyclostome-gnathostome split just after the gene duplication. (b) Scenario supporting independent Emx duplications in both of the lamprey and gnathostome lineages. In this scenario, the nested Emx gene expression (see Background) is assumed to have been established independently in the two lineages, as a result of convergence. (c) Nested expression patterns of Emx genes in the embryonic brains of sea lamprey and mouse. The ranges of the gene expressions including the pallium (pal) in the telencephalon (tel) along the anteroposterior axis are indicated with the lengths of the thickened vertical bars.
Figure 2Deduced amino acid sequences of the cyclostome Emx genes. The putative full-length amino acid sequences deduced from the nucleotide cDNA sequences are shown for EmxA and -B of the sea lamprey (P. marinus), Japanese lamprey (L. camtschaticum) and inshore hagfish (E. burgeri). Homeodomains are indicated with bold letters. HPAA tracts of size 10 or larger are indicated with gray background.
ML analysis of vertebrate gene phylogeny
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
|
| ||||
| 1 | (((((g1,g3),g2),cB),cA),outgroup) | -1909.94 | ML | 1.00 | 1.00 |
| 2 | (((((g1,g3),cB),g2),cA),outgroup) | -1910.93 | 0.99 ± 1.84 | 0.27 | 0.82 |
| 3 | ((((g1,g3),(g2,cB)),cA),outgroup) | -1910.93 | 0.99 ± 1.84 | 0.26 | 0.82 |
| 4 | (((((g1,g3),g2),cA),cB),outgroup) | -1911.62 | 1.68 ± 2.27 | 0.22 | 0.74 |
| 5 | ((((g1,g3),g2),(cA,cB)),outgroup) | -1911.62 | 1.68 ± 2.27 | 0.20 | 0.74 |
| 6 | ((((g1,g3),(cA,cB)),g2),outgroup) | -1912.39 | 2.45 ± 2.56 | 0.17 | 0.63 |
| 7 | (((g1,g3),((cA,cB),g2)),outgroup) | -1912.39 | 2.46 ± 2.55 | 0.17 | 0.63 |
| 8 | ((((g1,g3),(g2,cB)),cA),outgroup) | -1912.61 | 2.68 ± 2.91 | 0.17 | 0.60 |
| 9 | ((((g1,g3),cB),(g2,cA)),outgroup) | -1912.61 | 2.68 ± 2.91 | 0.16 | 0.60 |
| 10 | (((((g1,g3),cA),g2),cB),outgroup) | -1912.61 | 2.68 ± 2.91 | 0.17 | 0.60 |
| 11 | ((((g1,g3),cA),(g2,cB)),outgroup) | -1912.61 | 2.68 ± 2.91 | 0.17 | 0.60 |
| 12 | (((((g1,g3),cB),cA),g2),outgroup) | -1912.61 | 2.68 ± 2.91 | 0.19 | 0.60 |
| 13 | (((g1,g3),((g2,cB),cA)),outgroup) | -1912.61 | 2.68 ± 2.91 | 0.18 | 0.60 |
| 14 | (((g1,g3),((g2,cA),cB)),outgroup) | -1912.61 | 2.68 ± 2.91 | 0.18 | 0.60 |
| 15 | (((((g1,g3),cA),cB),g2),outgroup) | -1912.61 | 2.68 ± 2.91 | 0.19 | 0.60 |
Abbreviation: SE standard error, 1sKH One-sided Kishino-Hasegawa test, SH Shimodaira-Hasegawa test, g1 gnathostome Emx1, g2 gnathostome Emx2, g3 gnathostome Emx3, cA cyclostome EmxA, cB cyclostome EmxB.
Figure 3Maximum-likelihood tree for vertebrate Emx genes with either cyclostome species. These ML trees were reconstructed using 132 amino acid sites in the multiple alignment in Additional file 7 under the JTT model [49] with among-site rate heterogeneity taken into account by assuming the gamma distribution [50] with four rate categories (JTT + Γ4 model). To evaluate the effect of the choice of cyclostome sequences, we performed the analysis including sea lamprey (a) or hagfish (b), with a shape parameter of the gamma distribution α of 0.37 and 0.38, respectively. The numbers at nodes indicate bootstrap probabilities with 100 replicates for the ML and NJ methods, in order.
Figure 4Evidence for orthology between cyclostome EmxB and gnathostome Emx2. (a) Synteny conservation between the genomic regions containing Emx2 of gnathostomes and EmxB of sea lamprey. Assignment of gene names in this figure was based on orthology information in Ensembl except the one in a parenthesis (“Slc18a2”) which was assigned by using BLASTP. (b) Molecular phylogeny of vertebrates Pdzd8 genes, included in the Emx-containing genomic regions in (a). This ML tree was inferred using 600 amino acid sites in the multiple alignment in Additional file 10, assuming the JTT model with amino acid frequencies taken into account (JTT + Γ4 + F model) (shape parameter α = 1.17). The numbers at nodes indicate bootstrap probabilities with 100 replicates for the ML and NJ methods, in order.
Probabilistic count of gene duplications based on the ML method
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Sea lamprey and hagfish | (((((g3,g1),g2),cB),cA),outgroup); | 1.36 | 1.32 | 0.09 | 0.18 |
| Sea lamprey | ((((g3,g1),g2),(cB,cA)),outgroup); | 1.21 | 0.93 | 0.63 | 0.21 |
| Hagfish | ((((cB,g2),(g3,g1)),cA),outgroup); | 2.02 | 0.56 | 0.02 | 0.34 |
Abbreviation: g1 gnathostome Emx1, g2 gnathostome Emx2, g3 gnathostome Emx3, cA cyclostome EmxA, cB cyclostome EmxB. See Kuraku et al. [11] for details of N bef, N aft, N cyc, and N unk.
According to the results of ProtTest3, JTT+I+Γ4 model was employed for the analysis with both sea lamprey and hagfish, while JTT+Γ4 model was employed for the analyses with either of sea lamprey and hagfish.
Figure 5Examination on the effect of HPAA tracts in lamprey Emx sequences. (a) Schematics of lamprey Emx sequences used in this analysis. The A-tract (A) and Q-tract (Q) correspond to those indicated with gray background in Figure 2. The two intact or artificially modified (ΔHPAA) lamprey Emx sequences were aligned with their homologs by using ClustalW, MAFFT, or T-Coffee. The resultant multiple alignments were passed on to selection of amino acid sites using TrimAl with different settings (see Methods). (b) Comparison of degrees of the support for the lamprey EmxA-EmxB grouping. RELL bootstrap probabilities for this relationship were computed in a ML analysis by inputting datasets prepared with the variable stringencies in site trimming with TrimAl (no gap allowed and st of 0.00001 to 0.001), as well as variable choices of alignment programs and the presence or absence of the HPAA tracts (see Methods).
Figure 6Genome-wide frequencies of homopolymeric amino acid tracts in the sea lamprey, human, and zebrafish. Peptide sequences with homopolymeric amino acid (HPAA) tracts (those with a stretch of no less than six consecutive residues) were extracted from public database and compared among the three vertebrates (see Methods for details). This figure shows the frequencies of HPAA tract-containing peptides. The results with the different criteria (stretches of no less than eight and ten consecutive residues) are shown in Additional file 12.