| Literature DB >> 23145334 |
Robert Ekblom1, Lindsay L Farrell, David B Lank, Terry Burke.
Abstract
By next generation transcriptome sequencing, it is possible to obtain data on both nucleotide sequence variation and gene expression. We have used this approach (RNA-Seq) to investigate the genetic basis for differences in plumage coloration and mating strategies in a non-model bird species, the ruff (Philomachus pugnax). Ruff males show enormous variation in the coloration of ornamental feathers, used for individual recognition. This polymorphism is linked to reproductive strategies, with dark males (Independents) defending territories on leks against other Independents, whereas white morphs (Satellites) co-occupy Independent's courts without agonistic interactions. Previous work found a strong genetic component for mating strategy, but the genes involved were not identified. We present feather transcriptome data of more than 6,000 de-novo sequenced ruff genes (although with limited coverage for many of them). None of the identified genes showed significant expression divergence between males, but many genetic markers showed nucleotide differentiation between different color morphs and mating strategies. These include several feather keratin genes, splicing factors, and the Xg blood-group gene. Many of the genes with significant genetic structure between mating strategies have not yet been annotated and their functions remain to be elucidated. We also conducted in-depth investigations of 28 pre-identified coloration candidate genes. Two of these (EDNRB and TYR) were specifically expressed in black- and rust-colored males, respectively. We have demonstrated the utility of next generation transcriptome sequencing for identifying and genotyping large number of genetic markers in a non-model species without previous genomic resources, and highlight the potential of this approach for addressing the genetic basis of ecologically important variation.Entities:
Keywords: Birds; RNA-seq; genomics; lek; next generation sequencing; ruff
Year: 2012 PMID: 23145334 PMCID: PMC3492775 DOI: 10.1002/ece3.370
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Typical male color and feather pattern variation in the ruff (Philomachus pugnax). All pictures are from the captive population used in this study. Color pattern and variation differs between individuals, between and within morph types in this species. The two rightmost individuals on the lower panel have the Satellite mating strategy and the rest of the males are Independents.
Sequence data on each sample separately, Mean length of reads are shown within brackets. Information on color morph and mating strategy of the individual males is also provided. Individual 313 with color “Straw” was included in the “Rust” category in the analyses of color pattern, whereas individuals 294 and 306, which have multiple colors (barred or flecked), were included both in the “Black” and in the “Rust” categories
| Individual | No raw reads | No trimmed reads | Color | Mating strategy |
|---|---|---|---|---|
| 267 | 25,178 (275) | 19,909 (292) | Black | Satellite |
| 242 | 24,823 (276) | 19,714 (293) | Black | Independent |
| 294 | 17,508 (271) | 13,163 (298) | Rust/black | Independent |
| 318 | 37,927 (277) | 30,017 (296) | Black | Independent |
| 314 | 28,362 (272) | 21,390 (297) | Rust | Independent |
| 306 | 33,297 (278) | 26,190 (296) | Rust/black | Independent |
| 301 | 24,563 (270) | 18,987 (293) | White | Satellite |
| 190 | 37,743 (263) | 28,176 (294) | White | Satellite |
| 240 | 43,835 (273) | 33,462 (299) | White | Satellite |
| 313 | 30,617 (296) | 25,596 (307) | Straw | Satellite |
| 1241 | 46,759 (288) | 37,721 (305) | White | Satellite |
| Total | 350,648 (277) | 274,325 (297) |
Figure 2Photographs of the male ruffs and sampled feathers (inserted frames) used in this study. Clockwise numbering: 267, 294, 242, 318, 314, 306, 301, 190, 240, 313, 1241.
Figure 3Distribution of the length (base pairs) of assembled contigs of the ruff feather transcriptome. Sequence data from 11 different males were assembled together.
Figure 4Relationship between contig depth (number of reads) and length (number of base pairs) for the ruff feather transcriptome.
Figure 5Plot of expression divergence between (a) black males and males of other colors, (b) rust-colored males compared with males with different colors and (c) Independent compared with Satellite males. Genes more expressed in black (a), rust (b), and Independent (c) males have negative log-fold expression levels. The blue horizontal lines represent four-fold differences in expression between morphs. Genes that are only expressed in one of the morphs are plotted in yellow smears to the left in the graph. Red points represent genes with expression divergence between morphs (P < 0.01, without multiple test correction). None of the genes were significantly differentially expressed after applying false discovery rate correction.
Figure 6Distribution of minor allele frequencies for the 822 SNP markers identified from the feather transcriptome of 11 ruff males.
Figure 7Outputted P-values from the GenePop analysis testing for nucleotide differentiation between (a) black males and males of other colors, (b) rust-colored males compared with males with different colors, and (c) Independent compared with Satellite males. Test for differences in allele frequencies (genic differentiation) is shown on the X-axis while test for differences in haplotype frequencies (genotypic differentiation) is shown on the Y-axis. Dashed lines indicate significance thresholds of P = 0.05. Markers with significant structure inferred from the G-test in the SAM software are highlighted in red. The names of some of the most differentiated markers (see Tables 2–4) are given in the figures.
SNP markers identified as being genetically structured between black and non-black ruff males. “Allelep” and “genotypep” are outputted P-values from GenePop using the gene and genotype option. “G” and the p value for G (pG) are calculated from the SAM software. Heterozygosity (Het), FST (Fst), and P were calculated in LOSITAN and markers with pos.sel = 1 were identified as being under positive selection with this software. Markers highlighted with bold text were identified as structured using at least two of these independent tests
| Locus | Allelep | genotypep | G | pG | Het | Fst | P | pos.sel | Annotation | Gene description |
|---|---|---|---|---|---|---|---|---|---|---|
| c00052 | 0.07 | 0.06 | 6.16 | 0.05 | 0.30 | 0.22 | 0.88 | 0 | ENSTGUT00000003716 | microtubule-associated protein 1 light chain 3 alpha-like |
| c00054 | 0.08 | 0.06 | 6.16 | 0.05 | 0.30 | 0.22 | 0.88 | 0 | ENSGALP00000030189 | cytochrome c oxidase subunit Vic |
| c00279 | 0.06 | 0.20 | NA | NA | 0.42 | 0.36 | 1.00 | 1 | ENSGALP00000018498 | Putative uncharacterized protein |
| c00285 | 0.11 | 0.05 | NA | NA | 0.50 | 0.26 | 0.96 | 0 | ENSGALP00000006118 | keratin 10 |
| c00355 | 0.05 | 0.14 | NA | NA | 0.38 | 0.29 | 0.94 | 0 | gi|226823205|ref|NM_001159347.1| | neuron navigator 3 |
| c00358 | 0.05 | 0.13 | NA | NA | 0.38 | 0.29 | 0.94 | 0 | ENSGALP00000002235 | stathmin 1 |
| c00633 | 0.09 | 0.44 | NA | NA | 0.40 | 0.33 | 0.98 | 1 | ENSGALP00000023115 | FK506 binding protein 4, 59kDa |
| c00648 | 0.05 | 0.13 | NA | NA | 0.38 | 0.29 | 0.94 | 0 | ENSTGUT00000008828 | Protein Shroom2 (Apical-like protein) |
| c00777 | 0.05 | 0.03 | NA | NA | 0.38 | 0.29 | 0.94 | 0 | ENSGALP00000029159 | similar to Scale keratin (S-ker) |
| c01504 | 0.05 | 0.13 | NA | NA | 0.38 | 0.29 | 0.94 | 0 | ENSTGUT00000000045 | Lon protease homolog, mitochondrial Precursor |
| c01511 | 0.05 | 0.25 | NA | NA | 0.50 | 0.45 | 1.00 | 1 | – | – |
| c03886 | 0.06 | 0.17 | NA | NA | 0.58 | 0.32 | 0.98 | 1 | ENSTGUT00000008476 | Tyrosine-protein kinase Tec |
SNP markers identified as being genetically structured between ruff males with Independent and Satellite mating strategies. “allelep” and “genotypep” are outputted p values from GenePop using the gene and genotype option. “G” and the P-value for G (pG) are calculated from the SAM software. Heterozygosity (Het), FST (Fst), and P were calculated in LOSITAN and markers with pos.sel = 1 were identified as being under positive selection with this software. Markers highlighted with bold text were identified as structured using at least two of these independent tests
| Locus | Allelep | Genotypep | G | pG | Het | Fst | P | pos.sel | Annotation | Gene description |
|---|---|---|---|---|---|---|---|---|---|---|
| c00052 | 0.07 | 0.06 | 6.16 | 0.05 | 0.30 | 0.22 | 0.92 | 0 | ENSTGUT00000003716 | Microtubule-associated proteins 1A/1B light chain 3A Precursor |
| c00054 | 0.08 | 0.06 | 6.16 | 0.05 | 0.30 | 0.22 | 0.92 | 0 | ENSGALP00000030189 | cytochrome c oxidase subunit Vic |
| c00063 | 0.08 | 0.06 | 6.16 | 0.05 | 0.30 | 0.22 | 0.92 | 0 | gi|46429612|emb|CR390967.1| | chicken cDNA clone |
| c00279 | 0.06 | 0.20 | NA | NA | 0.42 | 0.36 | 0.99 | 1 | ENSGALP00000018498 | Putative uncharacterized protein |
| c00358 | 0.05 | 0.13 | NA | NA | 0.38 | 0.29 | 0.92 | 0 | ENSGALP00000002235 | Stathmin |
| c00433 | 0.06 | 0.25 | NA | NA | 0.50 | 0.45 | 1.00 | 1 | ENSGALP00000015448 | similar to neuropeptide Y receptor Y5 |
| c00465 | 0.12 | 0.47 | NA | NA | 0.33 | 0.27 | 0.99 | 1 | ENSGALP00000015687 | Collagen alpha-2(I) chain |
| c00754 | 0.05 | 0.25 | NA | NA | 0.50 | 0.45 | 1.00 | 1 | ENSTGUT00000002899 | hypothetical protein LOC100219861 |
| c00880 | 0.08 | 0.14 | 6.16 | 0.05 | 0.60 | 0.27 | 0.92 | 0 | ENSGALP00000000205 | Ribosomal structure protein |
| c01021 | 0.12 | 0.47 | NA | NA | 0.33 | 0.27 | 0.99 | 1 | ENSGALP00000016629 | Peroxiredoxin-1 |
| c01159 | 0.12 | 0.08 | NA | NA | 0.33 | 0.27 | 0.99 | 1 | ENSTGUT00000006904 | Histone acetyltransferase MYST4 |
| c02176 | 0.05 | 0.14 | NA | NA | 0.50 | 0.45 | 1.00 | 1 | ENSTGUT00000003357 | chromodomain helicase DNA binding protein 6 |
| c05497 | 0.25 | 0.50 | NA | NA | 0.33 | 0.27 | 0.99 | 1 | gi|61098492|gb|AC147215.4| | s-adenosylmethionine decarboxylase proenzyme 2-like |
| c18140 | 0.12 | 0.47 | NA | NA | 0.33 | 0.27 | 0.99 | 1 | ENSGALP00000011539 | Non-histone chromosomal protein HMG-14A |
SNP markers identified as being genetically structured between rust colored and other ruff males. “allelep” and “genotypep” are outputted p values from GenePop using the gene and genotype option. “G” and the p value for G (pG) are calculated from the SAM software. Heterozygosity (Het), FST (Fst), and P were calculated in LOSITAN and markers with pos.sel = 1 were identified as being under positive selection with this software. Markers highlighted with bold text were identified as structured using at least two of these independent tests
| Locus | Allelep | Genotypep | G | pG | Het | Fst | P | pos.sel | Annotation | Gene description |
|---|---|---|---|---|---|---|---|---|---|---|
| c00018 | 0.15 | 0.11 | 4.86 | 0.09 | 0.29 | 0.23 | 1.00 | 1 | ENSTGUT00000001822 | cytoplasmic linker associated protein 2 |
| c00031 | 0.04 | 0.11 | 4.89 | 0.09 | 0.38 | 0.29 | 0.96 | 0 | ENSGALP00000023950 | GJA1 gap junction protein, alpha 1 |
| c00053 | 0.15 | 0.11 | 4.86 | 0.09 | 0.29 | 0.23 | 1.00 | 1 | ENSGALP00000019543 | cytoplasmic linker associated protein 2 |
| c00153 | 0.27 | 0.42 | NA | NA | 0.29 | 0.23 | 1.00 | 1 | gi|46429612|emb|CR390967.1| | Gallus gallus finished cDNA, clone ChEST542i20 |
| c00279 | 0.15 | 0.35 | NA | NA | 0.36 | 0.31 | 0.98 | 1 | ENSGALP00000018498 | Putative uncharacterized protein |
| c00291 | 0.27 | 0.42 | NA | NA | 0.29 | 0.23 | 1.00 | 1 | ENSGALP00000039989 | similar to Scale keratin (S-ker) |
| c00328 | 0.09 | 0.17 | NA | NA | 0.40 | 0.33 | 0.99 | 1 | gi|71897262|ref|NM_001030907.1 | RCAN family member 3 |
| c00371 | 0.12 | 0.06 | 9.42 | 0.01 | 0.50 | 0.29 | 0.95 | 0 | ENSGALP00000010447 | S-phase kinase-associated protein 1 |
| c00558 | 0.04 | 0.11 | 4.89 | 0.09 | 0.38 | 0.29 | 0.96 | 0 | ENSGALP00000023266 | ribosomal protein S20 |
| c00692 | 0.15 | 0.11 | 4.86 | 0.09 | 0.29 | 0.23 | 1.00 | 1 | ENSTGUT00000014027 | 39S ribosomal protein L48 |
| c00754 | 0.05 | 0.25 | NA | NA | 0.50 | 0.45 | 1.00 | 1 | ENSTGUT00000002899 | hypothetical protein LOC100219861 |
| c00760 | 0.35 | 0.37 | 6.78 | 0.03 | 0.50 | 0.07 | 0.74 | 0 | ENSGALP00000007502 | ATP synthase, H+ transporting, mitochondrial F0 complex (F2) |
| c00813 | 0.21 | 0.44 | NA | NA | 0.30 | 0.22 | 1.00 | 1 | ENSTGUT00000007149 | Phosducin-like protein (PHLP) |
| c00895 | 0.06 | 0.25 | NA | NA | 0.50 | 0.45 | 1.00 | 1 | ENSTGUT00000017859 | Novel gene |
| c01159 | 0.27 | 0.20 | NA | NA | 0.29 | 0.23 | 1.00 | 1 | ENSTGUT00000006904 | Histone acetyltransferase MYST4 |
| c01504 | 0.05 | 0.13 | NA | NA | 0.38 | 0.29 | 0.96 | 0 | ENSTGUT00000000045 | Lon protease homolog |
| c01558 | 0.35 | 0.61 | NA | NA | 0.29 | 0.23 | 1.00 | 1 | ENSTGUT00000009454 | Programmed cell death protein 5 |
| c01802 | 0.05 | 0.25 | NA | NA | 0.50 | 0.45 | 1.00 | 1 | ENSGALP00000001690 | similar to feather keratin |
| c12157 | 0.05 | 0.14 | NA | NA | 0.50 | 0.45 | 1.00 | 1 | ENSGALP00000012295 | Putative uncharacterized protein VTGIII |
| c18140 | 0.26 | 0.53 | NA | NA | 0.29 | 0.23 | 1.00 | 1 | ENSGALP00000011539 | Non-histone chromosomal protein HMG-14A |
Figure 8Output from FST outlier analyses performed in LOSITAN testing for structure between (a) black males and males of other colors, (b) rust-colored males compared with males with different colors, and (c) Independent compared with Satellite males. Black line represents mean FST for each given level of heterozygosity (He) and the red and green lines correspond to the 95% confidence interval thresholds of neutral expectations. Red points in the graphs represent candidates for positive selection (higher differentiation than expected by chance) and names of some of the strongest candidates are given in red labels. Note that some markers have identical FST and He, and are thus overlapping in the graphs.
Microsatellite repeats identified from the ruff transcriptome sequencing
| Repeat type | Minimum repeat number | Total number | With primer sequence |
|---|---|---|---|
| Di | >10 | 157 | 27 |
| Tri | >8 | 54 | 17 |
| Tetra | >4 | 252 | 119 |
| Penta | >4 | 76 | 35 |
| Hexa | >4 | 28 | 1 |
| Total | 567 | 199 |