| Literature DB >> 20573239 |
Kiwoong Nam1, Carina Mugal, Benoit Nabholz, Holger Schielzeth, Jochen B W Wolf, Niclas Backström, Axel Künstner, Christopher N Balakrishnan, Andreas Heger, Chris P Ponting, David F Clayton, Hans Ellegren.
Abstract
BACKGROUND: Obtaining a draft genome sequence of the zebra finch (Taeniopygia guttata), the second bird genome to be sequenced, provides the necessary resource for whole-genome comparative analysis of gene sequence evolution in a non-mammalian vertebrate lineage. To analyze basic molecular evolutionary processes during avian evolution, and to contrast these with the situation in mammals, we aligned the protein-coding sequences of 8,384 1:1 orthologs of chicken, zebra finch, a lizard and three mammalian species.Entities:
Mesh:
Year: 2010 PMID: 20573239 PMCID: PMC2911116 DOI: 10.1186/gb-2010-11-6-r68
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Summary statistics of the overall rate of non-synonymous (d) and synonymous (d) substitution, and their ratio (ω) in avian lineages
| Pairwise chicken-zebra finch | Zebra finch | Chicken | Ancestral bird lineage | |
|---|---|---|---|---|
| Overall | 0.0635 | 0.0283 | 0.0239 | 0.0288 |
| (0.0517-0.0777) | (0.0225-0.0350) | (0.0185-0.0316) | (0.0241-0.0345) | |
| Overall | 0.4184 | 0.2133 | 0.1973 | 0.2600 |
| (0.3868-0.4584) | (0.1929-0.2384) | (0.1810-0.2154) | (0.2361-0.2834) | |
| Overall ω | 0.1517 | 0.1326 | 0.1208 | 0.1107 |
| (0.1270-0.1788) | (0.1080-0.1601) | (0.0973-0.1527) | (0.0942-0.1295) |
95% confidence intervals based on resampling are given in parentheses.
Correlations of divergence at fourfold degenerate sites between avian lineages in 1-Mb windows
| Without controlling for GC | Controlling for GC | ||||||
|---|---|---|---|---|---|---|---|
|
| d.f. |
|
|
|
|
| |
| Windows based on zebra finch genome | |||||||
| Zebra finch/chicken | 0.366 | 441 | 1.89 × 10-15 | 0.134 | 0.156 | 0.001 | 0.024 |
| Zebra finch/ancestral | -0.048 | 441 | 0.309 | 0.002 | -0.146 | 0.002 | 0.021 |
| Chicken/ancestral | 0.074 | 441 | 0.119 | 0.005 | -0.046 | 0.331 | 0.002 |
| Windows based on the chicken genome | |||||||
| Chicken/zebra finch | 0.778 | 438 | 3.71 × 10-16 | 0.141 | 0.138 | 0.004 | 0.019 |
| Chicken/ancestral | 0.073 | 438 | 0.017 | 0.013 | -0.008 | 0.868 | 0.000 |
| Zebra finch/ancestral | -0.064 | 438 | 0.180 | 0.004 | -0.161 | 0.001 | 0.026 |
d.f., degrees of freedom.
The number of REB, MREB and PS genes in different avian lineages
| Ancestral lineage | Chicken lineage | Zebra finch lineage | |
|---|---|---|---|
| Rapidly evolving bird (REB) genes | 419 | 1,148 | 1,202 |
| More rapidly evolving genes in birds (MREB) than in other amniotes | 103 | 432 | 1,154 |
| Positively selected (PS) bird genes | 259 | 883 | 936 |
Over-represented Gene Ontology terms in REB, MREB and PS genes in avian lineages
| Ancestral bird lineage | Chicken lineage | Zebra finch lineage | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene Ontologya | N1b | N2c | Excess |
| N1b | N2c | Excess |
| N1b | N2c | Excess |
|
| Rapidly evolving in birds (REB) | ||||||||||||
| Biological adhesion (B 2) | 17 | 136 | 2.67 | 0.013 | ||||||||
| Cell adhesion (B 3) | 17 | 135 | 2.69 | 0.013 | ||||||||
| Cytokine secretion (B 7) | 4 | 5 | 17.06 | 0.013 | ||||||||
| Telomere organization (B 5) | 5 | 5 | 7.81 | 0.024 | ||||||||
| Telomere maintenance (B 6) | 5 | 5 | 7.81 | 0.024 | ||||||||
| Sterol transport (B 5) | 6 | 7 | 6.69 | 0.024 | ||||||||
| Cholesterol transport (B 6) | 6 | 7 | 6.69 | 0.024 | ||||||||
| More rapidly evolving in birds (MREB) than in other amniotes | ||||||||||||
| Biological adhesion (B 2) | 12 | 136 | 5.82 | 0.0002 | ||||||||
| Cell adhesion (B 3) | 12 | 135 | 5.86 | 0.0002 | ||||||||
| Blood vessel development/maturation (B5) | 2 | 2 | 65.94 | 0.061 | ||||||||
| Synapse organization and biogenesis (B 5) | 3 | 12 | 16.49 | 0.088 | ||||||||
| Integrin-mediated signaling pathway (B 6) | 3 | 11 | 17.98 | 0.088 | ||||||||
| Proteinaceous extracellular matrix (C 3) | ||||||||||||
| Cytoskeletal part (C 5) | 37 | 124 | 1.92 | 0.040 | ||||||||
| Microtubule cytoskeleton (C 7) | 27 | 83 | 2.09 | 0.040 | ||||||||
| Positively selected (PS) in birds | ||||||||||||
| Biological adhesion (B 2) | 16 | 148 | 3.27 | 0.016 | ||||||||
| Cell adhesion (B 3) | 16 | 147 | 3.29 | 0.016 | ||||||||
| Cell-cell adhesion (B 4) | 9 | 57 | 4.78 | 0.035 | ||||||||
| Homophilic cell adhesion (B 5) | 5 | 16 | 9.45 | 0.035 | ||||||||
| Calcium ion binding (M 5) | 14 | 154 | 2.74 | 0.035 | ||||||||
| Anion transmembrane transport activity (M 6) | 16 | 48 | 3.00 | 0.006 | ||||||||
Terms with a false discovery rate (FDR) of adjusted P < 0.1 are shown. Excess is the fold enrichment for significant Gene Ontology terms. aB is biological process, M is molecular function and C is cellular component. The numbers indicate hierarchical level. bNumber of genes in test sample (REB, MREB and PS, respectively). cNumber of genes in reference sample (1:1 orthologs found in the respective lineage).
OMIM search for genes implicated in neurological processes and the number of these identified as evolving under positive selection in the chicken and zebra finch lineages
| Search term* | NOMIM | Nalign | PSchicken | PSzebra |
|
|---|---|---|---|---|---|
| Learning | 159 | 74 | 5 | 10 | 0.050 |
| Neurogenesis | 472 | 211 | 15 | 27 | 0.017 |
| Neurodegeneration | 246 | 107 | 8 | 16 | 0.025 |
*See Materials and methods. 'NOMIM' is the number of human genes identified in OMIM, 'Nalign' is the number NOMIM genes for which we had data from multispecies alignments. 'PSchicken' and 'PSzebra' are the number of unique positively selected genes found in the chicken and zebra finch lineages, respectively. P is the significance level in Fisher's exact test comparing the incidence of positively selected genes in chicken and zebra finch.
Genes implicated in neurobehavioral evolution by converging lines of evidence
| Ensembl ID | Locus | Gene |
|---|---|---|
| Evolving rapidly in the primate lineage [ | ||
| ENSTGUG00000000255 | D(2) dopamine receptor | |
| ENSTGUG00000004249 | Abnormal spindle-like microcephaly-associated protein | |
| ENSTGUG00000004747 | Glutamate [NMDA] receptor subunit epsilon-1 precursor | |
| ENSTGUG00000007079 | LIM/homeobox protein Lhx2 | |
| Differentially expressed in zebra finch song control system [ | ||
| ENSTGUG00000000694 | G protein-coupled receptor 98 precursor | |
| ENSTGUG00000002176 | Mcf2 transforming sequence-like | |
| ENSTGUG00000004464 | Neurofilament triplet L protein | |
| ENSTGUG00000005484 | Glutamate receptor, ionotropic AMPA 2 | |
| ENSTGUG00000006839 | Voltage-dependent L-type calcium channel subunit alpha-1D | |
| ENSTGUG00000007224 | Protein tyrosine phosphatase receptor type F | |
| ENSTGUG00000007343 | Retinoic acid-induced protein 1 | |
| ENSTGUG00000010757 | Glutamate receptor, metabotropic 1 | |
| ENSTGUG00000015209 | Synaptonemal complex protein 1 |
Neurological genes under positive selection in the zebra finch (see also Table S3 in Additional file 1) were assessed for representation in the results of two other studies: orthologs under positive selection in the primate lineage (Dorus et al. [29]) and zebra finch genes that are differentially expressed in song nucleus the high vocal centre compared to the underlying 'shelf' region (Lovell et al. [30]).
Figure 1The relationship between ω estimated for 1-Mb intervals and chromosome size. (a) Zebra finch; (b) chicken.
Figure 2The relationship between the mean mutation rate (divergence at fourfold degenerate sites) for 1-Mb intervals and chromosome size. (a) Zebra finch; (b) chicken.
Bivariate and partial correlations (with GC content and amount of coding sequence controlled for) between ω and recombination rate in 1 Mb windows
|
| d.f. |
|
| |
|---|---|---|---|---|
| Zebra finch | ||||
| Bivariate | -4.13 | 557 | 0.00004 | 0.030 |
| Controlled for GC | -2.8 | 556 | 0.0053 | 0.014 |
| Controlled for CDS | -4.51 | 556 | 0.00001 | 0.035 |
| Controlled for GC and CDS | -2.16 | 555 | 0.0313 | 0.008 |
| Chicken | ||||
| Bivariate | -2.82 | 713 | 0.0049 | 0.011 |
| Controlled for GC | -2.15 | 712 | 0.0320 | 0.006 |
| Controlled for CDS | -2.44 | 712 | 0.0149 | 0.008 |
| Controlled for GC and CDS | -2.14 | 711 | 0.0329 | 0.006 |
CDS, coding sequence; d.f., degrees of freedom; t, t-statistic (t-score) of the slope.