| Literature DB >> 25494627 |
Morgan Wirthlin, Peter V Lovell, Erich D Jarvis, Claudio V Mello1.
Abstract
BACKGROUND: Songbirds (oscine Passeriformes) are among the most diverse and successful vertebrate groups, comprising almost half of all known bird species. Identifying the genomic innovations that might be associated with this success, as well as with characteristic songbird traits such as vocal learning and the brain circuits that underlie this behavior, has proven difficult, in part due to the small number of avian genomes available until recently. Here we performed a comparative analysis of 48 avian genomes to identify genomic features that are unique to songbirds, as well as an initial assessment of function by investigating their tissue distribution and predicted protein domain structure.Entities:
Mesh:
Year: 2014 PMID: 25494627 PMCID: PMC4377847 DOI: 10.1186/1471-2164-15-1082
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Phylogenetic analysis used to identify the set of novel genes unique to songbirds (oscine passerines). (A) Schematic diagram summarizing phylogenetic relationships among organisms included in genomic analyses, not to scale. Full details on genomes included in these analyses are reported in main avian phylogenomics and comparative genomics papers [34, 35]. (B) Simplified schematic of the Zebra finch nuclei specialized for vocal learning and their connections. Red projection: the posterior vocal motor pathway for vocal control, originating in HVC, in yellow, continuing to RA, in orange, to the hindbrain vocal motor nuclei, in black. Blue projection: the anterior forebrain pathway for vocal learning, originating in HVC, which projects to a set of interconnected nuclei (Area X, LMAN, DLM) analogous to mammalian cortical-basal ganglia-thalamo-cortical loops for somatosensory learning. Abbreviations: DLM, medial part of the dorsal lateral nucleus of the thalamus; LMAN, lateral magnocellular nucleus of the anterior nidopallium; HVC, proper name; nXIIts, tracheosyringeal portion of the hypoglossal nucleus; PAm/RAm, nucleus para-ambiguus/retroambiguus; RA, robust nucleus of the arcopallium.
Novel genes exclusive to songbirds
| Gene name ¥ | Phylogeny | Ensembl ID | Gene location | SD site | Brain-derived ESTs | Non-brain ESTs | Non-brain RNA-SEQ |
|---|---|---|---|---|---|---|---|
|
| |||||||
| A4GALT-1 | SONGBIRDS† | ENSTGUG00000012139 | chr1A:65,194,903-65,195,964 | YES | No | No | None |
| A4GALT-2 | SONGBIRDS† | ENSTGUG00000018227 | chr1A:65,202,780-65,203,835 | YES | No | No | None |
| A4GALT-3 | SONGBIRDS† | ENSTGUG00000018451 | chr1A:65,210,658-65,211,719 | YES | No | No | None |
|
| |||||||
| CASC1-1 | SONGBIRDS† | ENSTGUG00000012133 | chr1A:65,157,078-65,159,677 | FE723736√, DV948439 | JV165872, JV165873 | Embryo, spleen, testes | |
| CASC1-2 | SONGBIRDS† | ENSTGUG00000012243 | chr1A:66,373,778-66,384,478 | YES | DV948439 | JV184784, JV165872, JV165873 | Embryo, liver, muscle, testes |
|
| |||||||
| FN3KRP | ALL BIRDS‡ | ENSTGUG00000007633 | chr18:6,574624-6,580,544 | DV959265√ | No | Embryo | |
| FN3KRPL1 | SONGBIRDS | No model | chrZ:24,858,422-24,862,943 | No | No | Embryo, liver, skin | |
| FN3KRPL2 | PASSERINES | ENSTGUG00000006787 | chrZ:69,583,022-69,590,574 | YES | DV955139√, FE727948√, DV955139√ | JV168705√, JV168706√, JR864904√ | Embryo, liver, spleen, testes |
|
| |||||||
| HYDIN | ALL BIRDS‡ | No model | chr11:5,451,491-5,475,997 | No | No | Liver, muscle, skin, spleen | |
| HYDINL1 | SONGBIRDS | ENSTGUG00000009150 | chr11:16,377,691-16,404,553 | No | JV172391 | Muscle, skin, spleen | |
| HYDINL2 | PASSERINES | No model | chr11:16,633,304-16,663,944 | YES | No | No | Liver, muscle, testes |
| HYDINL3 | ZEBRA FINCH | ENSTGUG00000009256 | chr11:17,005,696-17,033,616 | YES | No | No | Muscle, testes |
| HYDINL4 | ZEBRA FINCH | No model | chr11:18,229,584-18,238,051 | YES | No | No | None |
| HYDINL5 | ZEBRA FINCH | ENSTGUG00000009737 | chr11:19,907,326-19,914,490 | No | JV172391 | Testes | |
|
| |||||||
| TMRA | SONGBIRDS | ENSTGUG00000012248 | chr1A:66,486,182-66,494,397 | YES | CK302958√ | JV159445√, JV159451√ | Embryo, liver, muscle, spleen, testes |
|
| |||||||
| RIOK2 | ALL BIRDS‡ | ENSTGUG00000001223 | chrZ:24,872,816-24,883,105 | DV956882√ | JV172474√, JR863880√ | Embryo, liver, muscle, skin, spleen, testes | |
| RIOK2L | SONGBIRDS | No model | chrZ:69,578,247-69,578,886 | YES | No | No | Skin |
|
| |||||||
| RNF4 | ALL BIRDS‡ | ENSTGUG00000010518 | chr4:62,477,216-62,484,738 | DV951366√ | JV183872√, JR867734√ | Liver, skin, testes | |
| RNF4L1 | SONGBIRDS | No model | chr4:8,201,210-8,201,765 | No | No | None | |
| RNF4L2 | PASSERINES | ENSTGUG00000018516 | chr4:20,660,938-20,751,958 | No | No | None | |
| RNF4L3 | PASSERINES | ENSTGUG00000018370 | chr4:22,411,072-22,433,579 | No | No | Embryo, muscle, skin, spleen | |
| RNF4L4 | ZEBRA FINCH | ENSTGUG00000018547 | chr4:22,445,102-22,478,943 | No | No | Testes | |
| RNF4L5 | ZEBRA FINCH | ENSTGUG00000018338 | chr4:22,507,187-22,517,977 | No | No | Muscle, spleen | |
| RNF4L6 | ZEBRA FINCH | ENSTGUG00000018226 | chr4:22,538,729-22,547,838 | No | No | None | |
| RNF4L7 | PASSERINES | No model | chr4:41,650,214-41,650,348 | No | No | None | |
|
| |||||||
| URB1 | ALL BIRDS‡ | ENSTGUG00000013442/ no model | chr1:97,555,543-97,592,096/ chr1_random:362,665-365,267 | YES | FE722167√ | No | Embryo, liver, muscle, skin, testes |
| URB1L1 | ZEBRA FINCH | ENSTGUG00000011753 | chr1A:60,924,812-60,941,816 | YES | CK301434, CK303889, DV957700 | No | Embryo, liver |
| URB1L2 | ZEBRA FINCH | No model | chr1A:63,520,638-63,528,208 | CK301434, CK303889 | No | None | |
| URB1L3 | SONGBIRDS | No model | chr5:4,764,796-4,772,823 | YES | CK301434, CK303889, DV957700 | No | Skin |
| URB1L4 | ZEBRA FINCH | No model | chr7:1,064,071-1,065,024 | No | No | None | |
| URB1L5 | ZEBRA FINCH | No model | chr23:2,319,460-2,335,606 | YES | DV957700 | No | Liver, muscle, spleen, testes |
|
| |||||||
| YTHDC2 | ALL BIRDS‡ | No model | chrZ:21,509,497-21,511,474 | YES | No | No | None |
| YTHDC2L1 | SONGBIRDS | ENSTGUG00000014232 | chr2_random:378,730-383,125 | CK309358√ | No | None | |
| YTHDC2L2 | ZEBRA FINCH | ENSTGUG00000014992 | chr3_random:766,156-785,803 | No | No | None | |
| YTHDC2L3 | PASSERINES | ENSTGUG00000000643 | chrZ:10,792,293-10,809,782 | YES | No | JV174477, JV177272 | Embyro, liver, muscle, spleen |
| YTHDC2L4 | ZEBRA FINCH | No model | chrZ:29,309,810-29,332,222 | No | No | Embryo, liver, muscle, skin, spleen, testes | |
| YTHDC2L5 | ZEBRA FINCH | ENSTGUG00000003755 | chrZ:55,988,800-56,020,229 | DV947064√ | JV174477√ | Embryo, liver, muscle, skin, spleen, testes | |
| YTHDC2L6 | ZEBRA FINCH | ENSTGUG00000004173 | chrZ:57,443,052-57,444,728 | YES | No | No | Embryo |
Families of genes where at least one member has been determined to be present uniquely in all songbirds. Tandem duplicates, where clear orthology cannot be determined, are distinguished with dashed numbers (e.g., CASC-1, CASC-2). Other duplicates are named after being ‘like’ their orthologous parent gene (e.g. RIOK2L). Ensembl IDs and chromosomal locations refer to the Zebra finch genome. Expressed sequence tags (ESTs) providing evidence of gene expression in the brain [57, 81, 82] and RNA-seq data derived from other tissues [27] are from Zebra finch, non-brain ESTs are derived from Dark-eyed junco [37]. ¥: Gene symbols have been corrected based on our curation of Ensembl annotation, *: Gene family previously reported as being expanded in Zebra finch, ‡: Parent gene, †: Parent gene cannot be determined, √: EST is specific to this gene locus.
Figure 2Example of songbird novel and duplicated genes associated with regions of syntenic disruption. Chromosomal maps of the syntenic order of genes in chicken (chr1) compared to Zebra finch (chr1A) reveal that songbird-unique de novo novel gene (TMRA, in orange) and duplicated/expanded novel genes (CASC1, A4GALT; in red) are located in chromosomal regions containing syntenic disruptions (SD) that are unique to the songbird lineage (i.e. the syntenic flanking genes in Zebra finch are different from those in other avian and non-avian species). Small black up/down arrows next to each gene indicate orientation on the minus/plus strand of DNA. Line colours denote genes in rearranged syntenic blocks, with shaded regions representing apparent chromosomal inversions.
Figure 3Syntenic and protein functional domain analysis of and (A) Schematic representation of conserved chromosomal loci in avian and non-avian vertebrate species showing the relative position of YTHDC2L1 (in red), a novel expansion of YTHDC2 that is only present in songbirds. Adjacent genes are indicated in black. The chromosome or scaffold number is indicated beneath each species common name. (B) An alignment of the protein family domains predicted for amino acid sequences derived from Chicken YTHDC2, the orthologous YTHDC2 “parent” gene in Zebra finch, as well as the copies of YTHDC2 that are only present in songbirds (YTHDC2L1), and Zebra finch (YTHDC2L5). Specific protein family domains predicted by InterProScan5 are aligned relative to Chicken YTHDC2, and are indicated by the various coloured symbols. YTHDC2 and YTHDC2L1 lack nearly all of the major protein family domains that are characteristic of YTHDC2. In contrast, YTHDC2L5 appears to be a nearly complete copy of YTHDC2.
Figure 4Schematic model depicting the predicted structure of in the plasma membrane. Protein functional domain analysis (InterProScan5) predicts the protein coding sequence of TMRA contains three transmembrane spanning domains connected to an extracellular C-type Lectin-like (CLEC) domain that is typically associated with carbohydrate binding.
Figure 5Expression of in the adult male Zebra finch brain. (A) Photomicrograph of in situ hybridization showing uniform expression of URB1 in the pallium. (B) High-power view reveals URB1 enrichment in individual pallial neurons. Note that in several of the cells indicated by the arrowheads the intracellular labelling appears more robust in the nucleus than in the surrounding cytoplasm, forming a pattern reminiscent of a fried-egg. Scale bar: 10 μm.
Figure 6Differential expression of genes in song nucleus LMAN of the adult male Zebra finch. (A) Photomicrograph of an in situ hybridization conducted with a probe (DV948439) that is not locus specific reveals expression of CASC1-1 and/or CASC1-2 throughout the brain (See Additional file 1: Figure S3 for details). The approximate positions for the photomicrographs shown in panels B-D are depicted by the black squares. (B-D) CASC1-1/2 mRNA is highly expressed in song nucleus HVC (B, left panel), the globus pallidus (C, left), and ependymal cells of the lateral ventricle (D, left; arrowheads). In contrast, CASC1-1, revealed by a probe that is specific to this locus, is differentially expressed in ependymal cells of the lateral ventricle (B, right panel), large likely GABAergic cells in globus pallidus (C; right), and ependymal cells of the fourth ventricle in the midbrain (D, right). The dashed rectangles in B indicate the approximate positions of the high-power photomicrographs depicting labelled cells in the globus pallidus. Anatomical abbreviations: HVC, proper name; GP, Globus pallidus; St, striatum; v., lateral ventricle. Scale bars: 100 μm.
Figure 7Differential expression of in song nucleus LMAN of the adult male Zebra finch brain. (A) A schematic depicting a sagittal brain section ~2 mm from the midline shows the approximate location of the in situ photomicrographs presented in panel B. (B) Photomicrograph of in situ hybridization of songbird novel gene YTHDC2L1 shows discrete expression in song nucleus LMAN. (C) Detailed view of this section reveals that expression of YTHDC2L1 is restricted to large cells of LMAN, with labelled foci evident in some cells pairs within cellular nuclei (inset). (D) A comparable view of paralogous gene YTHDC2L5 shows low levels of expression, non-differential in LMAN. Anatomical abbreviations: A, arcopallium; H, hyperpallium; HVC, proper name; LMAN, lateral magnocellular nucleus of the nidopallium; M, mesopallium; MD, dorsal mesopallium; MV, ventral mesopallium; N, nidopallium; RA, robust nucleus of the arcopallium; St, striatum. Scale bars: 100 μm in B–D; 20 μm in C inset.
Figure 8Differential expression of in song nucleus RA of the adult male Zebra finch. (A) Photomicrograph of in situ hybridization showing the distribution of TMRA expressing cells in a parasagittal brain section that includes song nucleus RA (~2.0 mm from the midline). (B) High-power view reveals the enrichment of TMRA in individual cells within RA. (C) Detailed views reveal that in some cells in RA (indicated by the arrowheads), labelling is largely restricted to nuclei, and expression is low in the surrounding cytoplasm. Anatomical abbreviations: A, arcopallium; H, hyperpallium; M, mesopallium; N, nidopallium; St, striatum. Scale bars: 100 μm in A and B; 20 μm in C.