| Literature DB >> 21697098 |
Federico G Hoffmann1, Juan C Opazo, Jay F Storz.
Abstract
If rates of postduplication gene retention are positively correlated with levels of functional constraint, then gene duplicates that have been retained in a restricted number of taxonomic lineages would be expected to exhibit relatively low levels of sequence conservation. Paradoxical patterns are presented by gene duplicates that have been retained in a small number of taxa but which are nonetheless subject to strong purifying selection relative to paralogous members of the same multigene family. This pattern suggests that such genes may have been co-opted for novel, lineage-specific functions. One possible example involves the enigmatic globin-E gene (GbE), which appears to be exclusively restricted to birds. Available data indicate that this gene is expressed exclusively in the avian eye, but its physiological function remains a mystery. In contrast to the highly restricted phyletic distribution of GbE, the overwhelming majority of jawed vertebrates (gnathostomes) possess copies of the related cytoglobin (Cygb) and myoglobin (Mb) genes. The purpose of the present study was 1) to assess the phyletic distribution of the Cygb, Mb, and GbE genes among vertebrates, 2) to elucidate the duplicative origins and evolutionary histories of these three genes, and 3) to evaluate the relative levels of functional constraint of these genes based on comparative sequence analysis. To accomplish these objectives, we conducted a combined phylogenetic and comparative genomic analysis involving taxa that represent each of the major lineages of gnathostome vertebrates. Results of synteny comparisons and phylogenetic topology tests revealed that GbE is clearly not the product of a recent, bird-specific duplication event. Instead, GbE originated via duplication of a proto-Mb gene in the stem lineage of gnathostomes. Unlike the Mb gene, which has been retained in all major gnathostome lineages other than amphibians, the GbE gene has been retained only in the lineage leading to modern birds and has been independently lost in at least four major lineages: teleost fish, amphibians, mammals, and nonavian reptiles. Despite the restricted phyletic distribution of this gene, our results indicate that GbE is one of the most highly conserved globins in the avian genome.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21697098 PMCID: PMC3156568 DOI: 10.1093/gbe/evr055
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FPhyletic distribution of Cygb, GbE, and Mb in the genomes of gnathostome vertebrates. Phylogenetic relationships among vertebrates were drawn following a consensus from Meyer and Zardoya (2003) and Li et al. (2007).
FStructural alignment of avian GbE sequences to Human Mb. Amino acids are colored according to the hydrophobicity table of Kyte and Doolittle (1982), as implemented in JalView version 2.6.1 (Waterhouse et al. 2009). The most hydrophobic residues are shown in red, the most hydrophilic residues are shown in blue, and intermediate residues are shown in shades of purple. The α-helical structure is shown on top of the alignments. The functionally important phenylalanine in position CD1 and the distal and proximal histidines in positions E7 and F8 are indicated by arrows.
FMaximum likelihood phylogram describing phylogenetic relationships among vertebrate-specific globins from representative taxa obtained under the LG + I + Γ model of amino acid substitution. As outgroup sequences, we included three globin sequences from amphioxus that are sister to the complete set of vertebrate-specific globins, plus Ngb sequences from eight representative vertebrate taxa. Numbers above the nodes correspond to maximum likelihood bootstrap support values, and those below the nodes correspond to Bayesian posterior probabilities.
FSchematic representation of alternative hypotheses regarding the phylogenetic position of GbE. According to the scenario shown in panel (A), GbE represents the product of a bird-specific duplication of the Cygb gene. According to the scenario shown in panel (B), GbE represents the product of a bird-specific duplication of the Mb gene. According to the scenario shown in panel (C), GbE originated via duplication of the proto-Cygb gene in the stem lineage of gnathostomes. According to the scenario shown in panel (D), GbE originated via duplication of the proto-Mb gene in the stem lineage of gnathostomes. Our phylogenetic and synteny analyses support hypothesis D.
Likelihood Scores Associated with Alternative Phylogenetic Hypotheses Regarding the Relationships of Cygb, Mb, and GbE and Results from the Corresponding Topology Tests
| Hypothesis | log-likelihood | AU test | Parametric Bootstrap Test |
| −14,989.86 | |||
| −15,000.49 | |||
| −15,044.23 | |||
| −15,064.81 |
FPatterns of conserved synteny in the chromosomal region that harbors the Cygb gene in gnathostome vertebrates. Horizontal lines denote orthologous relationships.
FPatterns of conserved synteny in the chromosomal region that harbors the GbE and Mb genes in gnathostome vertebrates. (A) Synteny comparisons of the GbE and Mb gene regions among representative amniote taxa. (B) Synteny comparisons of the Mb gene region among representative teleost fish. Horizontal lines denote orthologous relationships.
Results from Tajima's (1993) Relative Rate Tests of Amino Acid (aa) Substitution
| Unique aa Changes in | Unique aa Changes in | Unique aa Changes in | Identical Sites | Divergent Sites | ||
| Chicken | 27 | 15 | 21 | 27 | 57 | NS |
| Duck | 27 | 13 | 20 | 27 | 60 | |
| Turkey | 28 | 15 | 20 | 27 | 57 | |
| Zebra finch | 24 | 16 | 19 | 26 | 62 | NS |
NOTE.—NS, not significant.
Sites that were different in all three sequences.
Evolutionary Comparisons for Members of the Globin Gene Superfamily in the Genomes of Neognathe Birds (Top) and Boreoeutherian Mammals (Bottom)
| ω | ω | |||
| Neognathae | Chicken versus Zebra Finch | |||
| 0.05 | 6.2 | 8.4 | 0.03 | |
| 0.08 | 6.1 | 7.3 | 0.11 | |
| 0.12 | 8.5 | 11.3 | 0.15 | |
| 0.19 | 19.1 | 25.5 | 0.15 | |
| 0.15 | 5.9 | 19.0 | 0.09 | |
| ρ-Globin | 0.15 | 9.5 | 13.6 | 0.04 |
| 0.21 | 13.2 | 17.0 | 0.07 | |
| 0.06 | 5.9 | 7.5 | 0.02 | |
| 0.11 | — | — | — | |
| 0.08 | 5.6 | 9.1 | 0.08 | |
| 0.10 | — | 11.9 | 0.08 | |
| Boreoeutheria | Human versus Dog | |||
| 0.03 | 4.8 | 4.2 | 0.09 | |
| 0.15 | 20.8 | 17.0 | 0.12 | |
| 0.11 | 12.5 | 15.6 | 0.09 | |
| 0.17 | 17.7 | 12.0 | 0.08 | |
| 0.17 | 13.7 | 12.4 | 0.11 | |
| 0.27 | 19.1 | 13.6 | 0.16 | |
| 0.10 | 16.0 | 15.6 | 0.12 | |
| 0.05 | 5.9 | 4.6 | 0.04 | |
NOTE.—aa, amino acid.
ω (=dN/dS) estimates from codeml.
Calculated in MEGA, excluding sites with alignment gaps in a pairwise manner.
ω (=dN/dS) estimates from yn00.
Neognathae is the clade that includes chicken, duck, turkey, and zebra finch (Supplementary data file 2, Supplementary Material online).
Boreoeutheria is the clade that unites Euarchontoglires and Laurasiatheria. We included sequences from mega- and microchiropteran bats, cow, dog, guinea pig, human, macaque, marmoset, mouse, pika, and rabbit (Supplementary data file 3, Supplementary Material online).
In cases where only two avian sequences were available for comparison, codeml analyses are not reported.
FA model for the duplicative origins and evolutionary history of the Cygb, GbE, and Mb genes in gnathostome vertebrates. Progenitors of the Cygb and Mb gene lineages were produced by a whole-genome duplication event in the stem lineage of vertebrates. Subsequently, the GbE and Mb genes originated via tandem duplication of the proto-Mb gene in the common ancestor of gnathostomes (prior to the split between cartilaginous fish and the common ancestor of teleost fish and tetrapods). One or more copies of the Cygb gene have been retained in all gnathostome lineages, and a single copy of Mb has been retained in all major lineages other than the amphibians, whereas GbE has only been retained in the lineage leading to modern birds. The loss of the GbE gene in teleost fish probably occurred prior to the fish-specific genome duplication, which may have given rise to the Cygb-1 and Cygb-2 paralogs. Estimated divergence dates are taken from Hedges (2009) and Shedlock and Edwards (2009).