| Literature DB >> 18366674 |
Kirstin Replogle1, Arthur P Arnold, Gregory F Ball, Mark Band, Staffan Bensch, Eliot A Brenowitz, Shu Dong, Jenny Drnevich, Margaret Ferris, Julia M George, George Gong, Dennis Hasselquist, Alvaro G Hernandez, Ryan Kim, Harris A Lewin, Lei Liu, Peter V Lovell, Claudio V Mello, Sara Naurin, Sandra Rodriguez-Zas, Jyothi Thimmapuram, Juli Wade, David F Clayton.
Abstract
BACKGROUND: Songbirds hold great promise for biomedical, environmental and evolutionary research. A complete draft sequence of the zebra finch genome is imminent, yet a need remains for application of genomic resources within a research community traditionally focused on ethology and neurobiological methods. In response, we developed a core set of genomic tools and a novel collaborative strategy to probe gene expression in diverse songbird species and natural contexts.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18366674 PMCID: PMC2329646 DOI: 10.1186/1471-2164-9-131
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Sequence sources and statistics for the three sequential ESTIMA:Songbird assemblies
| SB01 | 2,880 | 1,840 | √ | √ | √ |
| SB02 | 18,720 | 16,360 | √ | √ | √ |
| SB03 (a) | 19,104 | 17,032 | √ | √ | √ |
| SB03 (b) | 9,120 | 5,717 | √ | √ | |
| Duke chromatograms | 36,469 | 23,225 | √ | ||
| Duke from NCBI | 17,312 | 16,442 | √ | ||
| SB05 | 11,328 | 10,056 | √ | ||
| SB06 | 11,040 | 9,760 | √ | ||
| Rockefeller | 9,845 | 9,764 | √ | ||
| C | |||||
| Total redundant sequences | |||||
| Redundancy | |||||
ESTIMA:Songbird Build 2, presence ("yes") or absence ("no") of specific sequences of interest assessed by direct BLAST with available avian or mammalian sequences against 58,211 sequences in the ESTIMA database as of June 2005.
| Androgen Receptor | • | BDNF | √ | GABA-A alpha 1 | √ | |||
| estrogen receptor alpha | • | cannabinoid receptor | • | GABA-A alpha 2 | √ | |||
| estrogen receptor beta | • | IGF-2 | √ | GABA-A alpha 4 | • | |||
| estrogen receptor gamma | √ | N-CAM | √ | GABA-A alpha 5 | √ | |||
| RARalpha | √ | GABA-A beta 3 | √ | |||||
| RARbeta | √ | GABA-A delta | √ | |||||
| RARgamma | • | Calbindin | √ | GABA-A gamma 2 | √ | |||
| RXRalpha | √ | CAMKII | √ | GABA-A gamma 3/4 | √ | |||
| RXRbeta | • | Canarygranin | √ | GABA-B | √ | |||
| RXRgamma | • | CRABPI | √ | GABA-C rho-2B | • | |||
| CRABPII | √ | GAD-65 | √ | |||||
| Aromatase | √ | CRBPI | • | GAD67 | √ | |||
| P450-CYP26 | • | CRBPII | • | |||||
| zRalDH | √ | GAP-43 | √ | Kcnma1 | • | |||
| MAPKK | √ | Kcnmb1 | • | |||||
| ARC | √ | Narp | √ | Kcnmb2 | • | |||
| c-fos | √ | parvalbumin | √ | Kcnmb3 | • | |||
| c-jun | √ | PAX-6 | √ | Kcnmb4 | • | |||
| √ | SNAP-25 | √ | Kv3.1 | √ | ||||
| Synapsin I | • | |||||||
| Beta Tubulin | √ | synapsin IIb | √ | VGAT | • | |||
| GFAP | √ | VGLUT | • | |||||
| NFM | √ | VIAAT | √ | |||||
| vimentin | √ | |||||||
Annotation statistics for ESTIMA:Songbird Build 3
| Gga Genome | 21601 | 12396 | |||
| Chicken_TC | 20208 | 11715 | |||
| Gga Unigene | 17980 | 10490 | |||
| NCBI_Chicken_RNA | 15904 | 9110 | |||
| Ensembl_Chicken_cdna_all | 14609 | 8348 | |||
| Ensembl_Chicken_cdna_abinitio | 13223 | 7588 | |||
| Chicken IPI | 13219 | 7898 | |||
| Hs Unigene | 12373 | 7224 | |||
| NCBI_Chicken_protein | 7776 | 4517 | |||
| 1 Use IPI-annotation* | 13219 | 7553 | |||
| 2 in GGA_Unigene but not IPI | 5614 | 3364 | |||
| 3 in HS but not IPI or GGA_unigene | 265 | 104 | |||
| 11021 | |||||
| 4 additional "conserved in chicken" | 5368 | 2784 | |||
| 13805 | |||||
| 5 remainder = "TGU-specific" (no hits) | 7192 | 3409 | |||
| 31658 | 17214 | ||||
| Number of sequences with IPI identifiers | 13219 | 7898 | (Unigene used for 345) | ||
| Number of unique IPI identifiers | |||||
| Fraction of total IPI identifiers | 32% | 24% | |||
Content of SoNG 20 K array
| Unique sequences in SB1 assembly | 17878 |
| Controls and knowns | 368 |
| Spots that replicate SB clones | 967 |
| Total DNA spots on array | 19213 |
| Blanks and buffer spots | 950 |
Previously Cloned Genes and Replicated Controls spotted on 20 K Array
| AR | Art Arnold | Androgen receptor | |
| ARC | Claudio Mello | ||
| beta actin | SB02028A1F09.f1 | Replicated control | |
| CAB | Keck1 | Replicated control (soybean) | |
| ER alpha | Art Arnold | Estrogen Receptor alpha | |
| ER beta | Art Arnold | Estrogen Recetor beta | |
| GAPDH | SB02023A1H08.f1 | Replicated control | |
| Histone H3 | SB02008B2G06.f1 | Replicated control | |
| K. Wada | Wada et al.2 | ||
| K. Wada | Wada et al.2 | ||
| MSG | Keck1 | Replicated control (soybean) | |
| RAR alpha | Claudio Mello | Retinoic Acid Receptor | |
| RAR beta | Claudio Mello | Retinoic Acid Receptor | |
| RAR gamma | Claudio Mello | Retinoic Acid Receptor | |
| RUB | Keck1 | Replicated control (soybean) | |
| David Clayton | Replicated control (canary "e12" clone) | ||
| zf GluR1 | K. Wada2 | AMPA subunit, Wada et al.2 | |
| zf GluR2 | K. Wada2 | Wada et al.2 | |
| zf GluR3 | K. Wada2 | Wada et al.2 | |
| zf GluR4 | K. Wada2 | Wada et al.2 | |
| zf GluR5 | K. Wada2 | Kainate, Wada et al.2 | |
| zf GluR6 | K. Wada2 | Wada et al.2 | |
| zf GluR7 | K. Wada2 | Wada et al.2 | |
| zf mGluR1 | K. Wada2 | Metabotropic, Wada et al.2 | |
| zf mGluR2 | K. Wada2 | Wada et al.2 | |
| zf mGluR3 | K. Wada2 | Wada et al.2 | |
| zf mGluR4 | K. Wada2 | Wada et al.2 | |
| zf mGluR5 | K. Wada2 | Wada et al.2 | |
| zf mGluR8 | K. Wada2 | Wada et al.2 | |
| zf NR1 | K. Wada2 | NMDA, Wada et al.2 | |
| zf NR2A | K. Wada2 | Wada et al.2 | |
| zf NR2B | K. Wada2 | Wada et al.2 | |
| zf NR2C | K. Wada2 | Wada et al.2 | |
| zf NR2D | K. Wada2 | Wada et al.2 | |
| zf NR3A | K. Wada2 | Wada et al.2 | |
| zf | Claudio Mello |
1contributed to Keck Center by Lila Vodkin 2Wada K et al, JCN 476:44-64 (2004)
Figure 1Schematic alignment of . Five partial zenk cDNAs are spotted on the 20 K microarray. Here they are aligned against the derived contig ("mRNA") and its predicted protein coding sequence ("CDS"). Canary e12 cDNA [Genbank: EF050732] is the original songbird zenk sequence [20]. Zebra finch ZZF23 cDNA [Genbank: EF052676] was isolated using the canary sequence to probe a zebra finch cDNA library (C.V. Mello et al., unpublished). The three clones beginning in "SB..." are three independent cDNAs that did not form contigs and were spotted independently on the array; sequence extent is based on raw EST sequence. Scale bar in kb from putative 5' end of mRNA.
zenk-hybridizing sequences on the 20 K microarray (ordered as in Figure 1).
| Canary e12 cDNA | Replicated control | 2.0 | 1.5E-59 | 0.00 | 3661 |
| Zebra finch cDNA | Single-spotted control | 2.1 | 2.1E-03 | 0.16 | 3602 |
| SB02047B2F12.f1 | 5'-most EST | 1.2 | 1.0E-01 | 0.35 | 9005 |
| SB03008A1H11.f1 | Central EST | 1.7 | 3.6E-04 | 0.15 | 5551 |
| SB03026B1F05.f1 | 3'-most EST | 2.3 | 7.6E-06 | 0.05 | 4799 |
a. Mean of intensity ratios for song-stimulated sample divided by silence control sample (n = 6 pairs).
b. P-value (uncorrected for multiple testing) of null hypothesis that song and silence groups have the same distributions, derived from a 2-stage parametric linear model (Methods).
c. False Discovery Rate [81].
d. Mean of all measurements (all replicate spots against all samples); background intensities were less than 60 in all cases.
Figure 2Comparison of SoNG 20 K cDNA array and Lund-zfa array performance. A) Correlation plot comparing hybridization data (for 15,537 gene targets) on the Lund-zfa array (vertical axis) versus the SoNG 20 K array (horizontal axis). Each point shows the mean log2 ratio (male:female) for a single gene. Blue indicates significant difference between males and females (FDR = .05) on both arrays. See Results for discussion of correlation coefficients. B) Principal Components Analysis (PCA) of all data on both array platforms. Each data point represents an array, hybridized to the RNA sample indicated by the adjacent number (1–5 are males and 6–11 are females). The key indicates the array platform. For each gene on each array, expression value was scaled to the overall mean for that gene on that platform. Note that the first principal component (PC1), which explains 16.2 % of the variation, reveals a major effect of sex in the data, independent of platform.
Figure 3CGH analyses. A) For each species, genomic DNAs from each animal (n = 4) were hybridized on separate arrays against a reference DNA from pooled zebra finches. Histograms are shown of the resulting mean log2 ratios (zebra finch:test species) for all gene targets, by test species. Note that when samples from individual zebra finches were hybridized versus the pooled zebra finch reference, the log2 signal ratios cluster closely around 0 as expected. The curve becomes shorter and broader with cross-hybridization to the kingbird and the chicken. B) Histograms of log2 ratios (zebra finch:test species) for a subset of 200 spots found significantly different on multi-species analysis of CGH data. Significant spots were determined by multi-class SAM Analysis using TIGR TMEV software with a median number of false significant genes set at 0. Note the large shift of the distribution to the right with increasing phylogenetic distance. C) Hierarchical tree of all CGH arrays, in a multiclass analysis using the Euclidean distance metric with average linkage clustering [82]. Arrays from the same species are clustered together, and the hierarchy is consistent with phylogenetic species relationships. The autologous hybridization (zebra finch – zebra finch, not shown) is effectively the outgroup in this analysis. D) The distribution of log2 ratios (test species:zebra finch reference) for one spot across 28 arrays (7 species).
Proposed "Community Collaborations" for songbird microarray studies.
| 1 | AZ | A* | Zebra finch | • | ||||
| 2 | WA | B | White-crowned sparrow | • | ||||
| 3 | NY | C* | Zebra finch | • | • | |||
| 4 | NY | C* | Zebra finch | • | • | |||
| 5 | Hungary | D* | Long-tailed, great & penduline tit | • | • | |||
| 6 | CA north | E* | House finch, red crossbill | • | • | |||
| 7 | MI/CA | F | Zebra finch | • | • | |||
| 8 | MI/CA | F | Zebra finch | • | ||||
| 9 | MI/CA | F | Zebra finch | • | ||||
| 10 | MI/CA | F | Zebra finch | • | • | |||
| 11 | Netherlands | G* | Zebra finch | • | ||||
| 12 | AZ (Phoenix) | H* | House finch | • | ||||
| 13 | AZ (Tempe) | H* | White-crowned sparrow | • | ||||
| 14 | IN | I* | White-throated sparrow | • | • | |||
| 15 | FL | J* | Zebra finch | • | • | • | ||
| 16 | MD | K | European starling | • | ||||
| 17 | OR | L | Zebra finch | • | • | |||
| 18 | OR | L | Parrot, hummingbird | • | ||||
| 19 | OR | L | Zebra finch | • | ||||
| 20 | US West | M* | Thrush, Junco, House sparrow, Cordillerian flycatcher | • | ||||
| 21 | IL | N | Zebra finch | • | ||||
| 22 | IL | N | Zebra finch | • | • | |||
| 23 | IL | N | Zebra finch | • | ||||
| 24 | IL | O | Zebra finch | • | ||||
| 25 | WA | P* | Song sparrow | • | • | |||
| CGH | 14 non-Tgu species proposed | |||||||
Experiments (identified by # in first column) were nominated by investigators in the community following specific guidelines (Additional File 2). Investigator groups are indicated by Locale and a "group" letter code (columns 2 and 3); asterisk indicates investigators outside of the core group responsible for program development. Experiments were classified according to the Primary Contrast (question) defined by the experimental design, with these general contrasts as indicated by number in the table: 1, Comparative; 2, Brain Sex; 3, Critical Period; 4, Activational; 5, Environmental. CGH: Comparative Genomic Hybridizations validating zebra finch array use against the 14 other species proposed for study across all experiments.