| Literature DB >> 24886435 |
Eshita Sharma, Axel Künstner1, Bonnie A Fraser, Gideon Zipprich, Verena A Kottler, Stefan R Henz, Detlef Weigel, Christine Dreyer.
Abstract
BACKGROUND: Sexually dimorphic phenotypes are generally associated with differential gene expression between the sexes. The study of molecular evolution and genomic location of these differentially expressed, or sex-biased, genes is important for understanding inter-sexual divergence under sex-specific selection pressures. Teleost fish provide a unique opportunity to examine this divergence in the presence of variable sex-determination mechanisms of recent origin. The guppy, Poecilia reticulata, displays sexual dimorphism in size, ornaments, and behavior, traits shaped by natural and sexual selection in the wild.Entities:
Mesh:
Year: 2014 PMID: 24886435 PMCID: PMC4059875 DOI: 10.1186/1471-2164-15-400
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Assembly of the guppy reference transcriptome. (A) Flowchart describing read summary, assembly strategy, and assembler comparison. The high quality paired reads from each sequenced dataset, non-barcoded (orange) and barcoded (green), were assembled using genome-independent (Trinity, GIA, red) and genome-guided (Cufflinks, GGA, blue) assemblers. Venn diagram shows the total number of protein sequence orthologs identified between at least two species using translated sequences from the two guppy assemblies (red, blue), and protein sequence databases from eight teleosts, mouse, and human (yellow); (B) Inset (dotted yellow, bottom left) shows an alternate view of the ortholog comparisons. Barplots show the number of orthologs identified in two-way reciprocal best blast-hit comparison between platyfish, tilapia, medaka, stickleback, takifugu, tetraodon, zebrafish, cod, human, and mouse proteins. The stacked bars show the number of orthologs common between GGA and GIA (purple), unique to GGA (blue) and unique to GIA (red); (C) Inset rectangle (dotted blue, bottom right) summarizes the steps for merging predicted CDS from both assemblies and functional annotation of the guppy reference transcriptome (GRT).
Figure 2Phenotypic sexual dimorphism in the guppy. Males (top) are smaller than females (bottom) and have complex color patterns on the body. The encircled region (white outline) indicates the tissues that were used for preparing the barcoded libraries, 1) brain and eyes; 2) Male testis and female ovary; and 3) tail.
Comparison of transcriptome assembled with genome-guided and genome-independent assemblers
|
|
| |
|---|---|---|
| Total length (bp) | 416,036,223 | 301,476,740 |
| Length with longest isoforms per locus (bp) | 101,831,430 | 128,048,246 |
| No. of transfrags | 213,088 | 91,126 |
| No. of transcripts (Unique components/gene groups) | 105,664 | 49,971 |
| Mean length (bp) | 1,952 | 3,308 |
| Longest contig (bp) | 65,264 | 61,058 |
| Overall mapping (%) | 73.21 | 73.64 |
| Concordant and unique mapping (%) | 62.98 | 55.10 |
| Total no. of ORFs | 53,537 | 63,520 |
| No. of complete ORFs | 29,309 | 49,535 |
| Mean length ORF (bp) | 766 | 803 |
| Longest ORF (bp) | 63,897 | 54,732 |
| Total length of assembly with CDSs only (bp) | 40,889,623 | 48,745,723 |
|
| ||
| Against guppy (GGA against GIA or vice-versa) | 40,973 (24,020) | 35,147 (24,020) |
|
| 19,680 (13,399) | 19,941 (14,934) |
|
| 17,925 (11,102) | 18,197 (12,455) |
|
| 19,139 (11,758) | 19,429 (13,096) |
| Orthologs in only one assembly | 4,767 | 6,954 |
We compared a number of metrics to determine which assembler performed better. Open reading frame (ORFs) are those with a minimum length of 50 amino acids. The number of annotations obtained for each assembly are given from best Blastp hits against other protein sequence databases (E-value < 1 × 10−20). The number of orthologs (brackets) are given from reciprocal best Blastp hits identified using PROTEIN ORTHO.
Differentially expressed genes between males and females in brain, gonad, and tail tissue
| No. of expressed genes | Fold change | Sex-biased genes (%) | Male-biased genes (%) | Female-biased genes (%) | |
|---|---|---|---|---|---|
| Brain | 27612 | > 1.2 fold | 3611 (13.08) | 1305 (4.73) | 2306 (8.35) |
| > Median fold | 2466 (8.93) | 702 (2.54) | 1764 (6.39) | ||
| Tail | 18988 | > 1.2 fold | 2792 (14.70) | 1355 (7.14) | 1437 (7.57) |
| > Median fold | 1460 (7.69) | 755 (3.98) | 705 (3.71) | ||
| Gonad | 22873 | > 1.2 fold | 17740 (77.56) | 7989 (34.93) | 9751 (42.63) |
| > Median fold | 10060 (43.98) | 4891 (21.38) | 5163 (22.57) | ||
|
| |||||
| Brain, tail | 18415 | > 1.2 fold | 767 (4.17) | 118 (0.64) | 619 (3.36) |
| > Median fold | 362 (1.97) | 44 (0.24) | 305 (1.66) | ||
| Brain, Gonad | 19759 | > 1.2 fold | 1961 (9.92) | 286 (1.45) | 1024 (5.18) |
| > Median fold | 851 (4.31) | 84 (0.43) | 466 (2.36) | ||
| Tail, Gonad | 16524 | > 1.2 fold | 1704 (10.31) | 373 (2.26) | 526 (3.18) |
| > Median fold | 447 (2.71) | 100 (0.61) | 126 (0.76) | ||
| Brain, Tail, Gonad | 16396 | > 1.2 fold | 470 (2.87) | 43 (0.26) | 214 (1.31) |
| > Median fold | 143 (0.87) | 11 (0.07) | 59 (0.36) |
We report the total number of genes expressed, and those that were sex-biased at two different fold-change cutoffs (1.2 fold and median-fold difference within each tissue) and FDR < 0.1. We also report genes that were sex-biased in multiple tissues with the same direction of change between the sexes.
Figure 3Quantitative differences in gene expression between sexes. Male to female expression ratios (log2FC, Fold-change: Male/Female) plotted against the average expression intensity (log2CPM, Counts per million) in (A) brain, (C) tail, and (E) gonads. Genes with greater than median-fold bias (FDR < 0.1) are shown in red while the others are shown by black dots or smoothened. The blue lines mark a 4-fold difference in expression between the sexes. Genes with sex-limited expression are underlined in black (E). The number of male-biased and female-biased genes in each comparison is mentioned at the top-right and bottom-right respectively in each figure. Heatmaps showing the mean centered log2FPKM (Fragments per kilo base per million) for the highest differentially expressed genes (FDR < 0.001) and a 1.5 fold-change in the brain (B), 1.7 fold-change in the tail (D), and 32 fold-change in the gonad (F). The top 30 genes that show sex-biased expression in each tissue are listed and ranked by fold-change in grey text boxes at the left (female-biased genes) and at the right (male-biased genes).
Enriched Gene Ontology (GO) terms in male-bias and female-bias genes
| GO term | SB (All) | p-value | Representative genes |
|---|---|---|---|
|
| |||
|
| |||
| Ionotropic glutamate receptor signaling (GO:0035235: BP) | 13 (46) | < 0.0001 | GRIK5 (2of2); GRIK4; GRIN2A (1 of 2); Glutamate [NMDA] receptor subunit zeta-1 |
| Ion transmembrane transport (GO:0034220: BP) | 29 (529) | < 0.0001 | Potassium channels: KCNJ3; KCNJ11 (1of2); KCNH1; KCNA1 (2 of2), Calcium channels: CACNA1I (4of4); Ryanodine receptor (RYR2 (1 of 3)), Sodium channel: SCN8A(2of2) |
| Regulation of cell development (GO:0060284: BP) | 9 (203) | 0.00699 | Lens calpain-3 (CAPN3); distal-less homeobox gene-1a (DLX1a); protein tyrosine phosphatase receptor-d (PTPRD); retinal cadherin-4 (CDH4 (1 of 2)) |
| Cerebellar Purkinje cell differentiation (GO:0021702: BP) | 3 (11) | 0.00068 | LIM/homeobox protein (LHX1); voltage-dependent P/Q-type calcium channel subunit alpha-1A (CACNA1A) ; LIM domain binding-1 (LDB1(2 of 2)) |
| Integral to membrane (GO:0016021: CC) | 84 (2448) | < 0.0001 | Hypocretin (orexin) receptor-1 (HCRTR1); multidrug resistance-associated protein 9; coiled-coil domain containing 149 CCDC149 (2 of 2); protocadherin 15b; gamma-aminobutyric acid A receptor, alpha-3 (GABRA3) |
|
| |||
| Endothelial cell migration (GO:0043542: BP) | 10 (44) | < 0.00036 | Sushi-repeat containing protein, X-linked2 (SRPX2); angiopoietin1 (ANGPT1); myosin heavy chain 9,non-muscle (MYH9(2 of 2)) |
| Gonad development (GO:0008406: BP) | 8 (37) | 0.00194 | WNT10A; WNT5A; secreted frizzled related proteins (SFRP1, SFRP5) ; TGFB2; Phospholipase A2 groupIVa (PLAG4A); |
| Immune response (GO:0006955: BP) | 23 (135) | < 0.0001 | WNT5A; TGFB2; kelch-like protein 6 (KLHL6); MHC classI-E (HLA-E); complement component (C6); MHC class II invariant chain (CD74(1of2)); chemokine(C-X-C motif) ligand12 (CXCL12(2of2)) |
| Integrin-mediated signaling pathway (GO:0007229: BP) | 8 (35) | 0.00132 | Integrin b1 binding protein1 (ITGB1BP1); Integrin, alpha10 (ITGA10);Integrin beta (ITGBL1, ITGB3a); nicotinamide riboside kinase-2 (NMRK2(2of2)) |
| DNA-dependent DNA replication initiation (GO:0006270: BP) | 5 (10) | 0.0002 | Minichromosome maintenance complex components (MCM2,MCM3,MCM4,MCM5,MCM6) |
| proteinaceous extracellular matrix (GO:0005578: CC) | 61 (151) | < 0.0001 | ADAM metallopeptidase with thrombospondin family members (ADAMTS12, ADAMTS15); NID1 (1of2); FBN3; MFAP2; CYR61 (2of2); COL4A1; COL10A1; COL11A1; WNT11; matrix metalloproteinases (MMP12, MMP14,MMP2 ) |
|
| |||
|
| |||
| Neuropeptide signaling pathway (GO:0007218: BP) | 5 (15) | < 0.00016 | Tachykinin, precursor 1 (TAC1); prepronociceptin PNOC (1 of 2); secretogranin V (7B2like) (SCG5); brain-specific angiogenesis inhibitor-3 (BAI3) |
| Neurotransmitter transport (GO:0006836: BP) | 11 (46) | 0.00057 | Syntaxin1B (STX1B); solute carrier family (SLC6A2, SLC6A5); syntaxin binding protein 1 STXBP1 (1 of 2) |
| Locomotory behavior (GO:0007626: BP) | 9 (42) | 0.00072 | Glycine receptor subunit beta (GLR-b2); choline O-acetyltransferase CHAT (2 of 2); astrotactin 1 (ASTN1) |
| *melanosome (GO:0042470:CC) | 4 (15) | 0.00241 0.01329 | 5,6-dihydroxyindole-2-carboxylic acid oxidase TYRP1 (1 of 2, 2 of 2); Tyrosinase TYR (2 of 2); synaptotagmin-like-2 (SYTL2); |
| *Pigment biosynthetic process (GO:0046148: BP) | 4 (25) | ||
| Insulin receptor binding (GO:0005158: MF) | 4 (14) | 0.00156 | Sorbin and SH3 domain containing 1 SORBS1; DOK7 (1 of 2); growth factor receptor-bound protein 10 (GRB10) |
|
| |||
| Glycolysis (GO:0006096: BP) | 12 (43) | < 0.0001 | lactate dehydrogenase A (LDHA); phosphoglycerate kinase 1 (PGK1); pyruvate kinase, muscle (PKM (1 of 2)); glucose-6-phosphate isomerase (GPI (2 of 2)); 2,3-bisphosphoglycerate mutase (BPGM) |
| DNA-dependent DNA replication (GO:0006261: BP) | 10 (29) | < 0.0001 | Topoisomerase2a (TOP2a); primase, DNA, polypeptide 1 (PRIM1); tonsoku-like, DNA repair protein (TONSL); minichromosome maintenance complex components (MCM2, MCM4, MCM5, MCM6); polymerase (DNA directed), epsilon2 (POLE2) |
| Mitosis (GO:0007067: BP) | 13 (91) | 0.00105 | Aurora kinase C AURKC; cyclin-dependent kinase 1 CDK1 ; spindle apparatus coiled-coil protein 1 SPDL1; cyclin B1 CCNB1; non-SMC condensin I complex, subunitH, subunitD2 (NCAPH, NCAPD2); checkpoint kinase-1 (CHEK1) |
| Proteinaceous extracellular matrix (GO:0005578: CC) | 26 (125) | 0.00106 | Secreted protein, acidic, cysteine-rich (osteonectin) (SPARC); versican VCAN VCAN (2 of 2); tenascin(TNC (1 of 2)); collagens (COL11A1a, COL11A1b, COL27A1 (2 of 2)) |
|
| |||
|
| |||
| Cilium assembly (GO:0042384: BP) | 32 (46) | < 0.0001 | Radial spoke head 9 homolog (RSPH9); ARPC4-TTLL3 readthrough; forkhead box J1 (FOXJ1); Transmembrane proteins (TMEM237, TMEM17, TMEM231); B9 protein domain 1 (B9D1); coiled-coil and C2 domain containing-2A (CC2D2A) |
| Spermatogenesis (GO:0007283: BP) | 25 (58) | < 0.0001 | Kelch-like family member 10 KLHL10 (3 of 3); rhophilin associated tail protein 1-like (ROPN1L); phosphate cytidylyltransferase 1, choline-b ( PCYT1B (1 of 2)); Outer dense fiber protein 2/Cenexin (ODF2) |
| Meiosis I (GO:0007127: BP) | 12 (25) | 0.00025 | MutS homolog 5 (E. coli) MSH5; DMC1 dosage suppressor of mck1 homolog, meiosis-specific homologous recombination (DMC1); HORMA domain containing 1 HORMAD1; cyclin B1 interacting protein 1, E3 ubiquitin protein ligase CCNB1IP1 |
| Fertilization (GO:0009566: BP) | 8 (14) | 0.00064 | KLHL10 (3 of 3); spectrin, beta, non-erythrocytic 4 SPTBN4 (1 of 2); glycine receptor, beta GLRB (1 of 2); polypyrimidine tract-binding protein-1 (PTPB1) |
|
| |||
| Regulation of BMP signaling pathway (GO:0030510: BP) | 15 (27) | < 0.00028 | Forkhead box H1 (FOXH1); Noggin (NOG); WNT5a; activin A receptor, type I (ACVR1); follistatin-like 1 (FSTL1); bone morphogenetic protein 15 (BMP15); growth differentiation factor 9 (Gdf9) |
| Fibroblast growth factor receptor signaling (GO:0008543: BP) | 21 (47) | < 0.00094 | Kinesin family member-16Bb (KIF16bb); sal-like-4 (SALL4); serine threonine-protein phosphatase 2a (PP2a); sal-like 1 (SALL1); catenin, beta 1 (CTNNB1); fibroblast growth factor receptor-1(FGFR1 (2 of 2)); sprouty homolog 1, antagonist of FGF signaling (SPRY1); FGF20, FGF13, FGF 10 |
| Focal adhesion (GO:0005925: CC) | 26 (50) | < 0.0001 | Filamin A alpha FLNA (2 of 2); ezrin EZR (1 of 2); Rho guanine nucleotide exchange factor-7 (ARHGEF7 (al 3 paralogs)); syndecan-4 (SDC4); PDZ and LIM domain 2 (PDLIM2); talin 2 TLN2 (2 of 2) |
| Blood vessel development (GO:0001568: BP) | 90 (234) | < 0.0001 | Forkhead box H1 (FOXH1); lysophosphatidic acid receptor-2(LPAR2 (2 of 2)); angiopoietin-2 (ANGPT2); Rho guanine nucleotide exchange factor-7(ARHGEF7(3 of 3)); melanoma cell adhesion molecule (MCAM1 of 2); angiopoietin-like-1(ANGPTL1 (2 of 2)); activin A receptor, typeI (ACVR1); activin A receptor type II-like-1(ACVRL1) |
| Proteinaceous extracellular matrix (GO:0005578: CC) | 50 (111) | < 0.0001 | Netrin-4 (NTN4 (2 of 2)); sparc/osteonectin, cwcv and kazal-like domains proteoglycan (testican) (SPOCK2 (2 of 2)); collagens (COL9A2, COL5A2, COL11A1a, COL11A1b, COL27A1 (2 of 2)); ADAMTS8 |
For each tissue we show GO terms enriched in both male-biased and female-biased genes. We report the total number of genes that have sex-biased (SB) expression and the number of sex-biased genes that were annotated with Blastx against NR and with GO terms. For each enriched GO term we report the GO term, its ID, and ontology (BP, Biological Process; MF, Molecular Function, and CC, Cellular Component), the number of sex-biased sequences, number of expressed sequences (in brackets) and the p-value. Statistical significance values were calculated with the Fisher’s test using the elim algorithm for reducing comparisons. We also list representative genes associated with the enriched GO term ordered by fold change expression.
* indicates that the same genes (except the italicized gene) were associated with both the cellular component and biological process GO terms.
Figure 4Male-biased expression of guppy pigmentation orthologs in tail. Barplots show male to female expression ratios (log2 FC: Male/Female) in tail tissue for differentially expressed candidate pigmentation genes (FDR < 0.1). Horizontal grey dotted line marks a 4-fold change in gene expression. Candidate gene names and linkage groups are specified at the bottom.
Linkage groups (LGs) with over-representation or under-representation of male-biased or female-biased genes
| Male-biased genes over represented | Male-biased genes under represented | Female-biased genes over represented | Female-biased genes under represented | |
|---|---|---|---|---|
|
| ||||
| Brain | LG10 | |||
| Tail | LG16 | |||
| Gonad |
|
| ||
|
| ||||
| Brain | LG5 | LG21 | ||
| Tail | LG17 | |||
| Gonad | LG4, LG11 | LG12, LG17 |
We use two expression differences cut-offs, ‘All’, 1.2-fold difference in expression (FDR < 0.05) and ‘median’, median-fold difference in expression (FDR < 0.1) for three different tissues (brain, tail, and gonad). The LGs with significant over- or under-representation of sex-biased genes (p < 0.05) are listed and those with p < 0.05 after correcting for multiple testing are highlighted in bold.
Figure 5Linkage group distributions of sex-biased genes. Distribution of percentage of testis-biased (blue) and ovary-biased (pink) genes over all gonad-expressed genes per linkage group (LG). Sex-biased genes were identified as those that show significant difference in expression (FDR < 0.05) above the 1.2 fold-change (log2FC: Male/Female). LGs with a significant over- or under-representation of sex-biased genes are marked with an asterisk (p < 0.05, after multiple correction).
Figure 6Nucleotide substitution rates in sex-bias genes per tissue. Mean values with 95% confidence intervals for rate of nucleotide substitutions in coding sequences. (A) d /d ratios; (B) d ; and (C) d . Male-biased (MB: blue), female-biased (FB: pink), and unbiased (UB: yellow) genes for brain, gonad, and tail. Asterisks above the boxplots indicate a significant difference in substitution rate was found between the sex-biased and unbiased genes using Mann–Whitney U test for non-parametric distributions (**** p < 0.0001; *** p < 0.001; ** p <0.01; * p < 0.05).
Comparison of d /d values for sex-biased genes to unbiased genes for brain, tail, and gonad tissues
| Brain | Tail | Gonad | |||||||
|---|---|---|---|---|---|---|---|---|---|
| (N) | dN/dS | p-value | (N) | dN/dS | p-value | (N) | dN/dS | p-value | |
|
| |||||||||
| Unbiased | 7386 | 0.097 | 5466 | 0.100 | 1667 | 0.099 | |||
| Male-biased | 105 | 0.082 | 0.015 | 279 | 0.096 | 0.474 | 1350 | 0.122 |
|
| Female-biased | 653 | 0.129 |
| 266 | 0.119 |
| 1984 | 0.112 |
|
|
| |||||||||
| Unbiased | 6061 | 0.096 | 4383 | 0.100 | 1328 | 0.097 | |||
| Male-biased | 96 | 0.084 | 0.083 | 221 | 0.094 | 0.576 | 1111 | 0.122 |
|
| Female-biased | 542 | 0.131 |
| 199 | 0.119 |
| 1561 | 0.111 |
|
|
| |||||||||
| Unbiased | 2018 | 0.094 | 504 | 0.124 | 104 | 0.135 |
| ||
| Male-biased | 74 | 0.077 | 0.061 | 179 | 0.091 |
| 1070 | 0.122 |
|
| Female-biased | 310 | 0.130 |
| 98 | 0.122 | 0.337 | 1368 | 0.108 |
|
|
| |||||||||
| Unbiased | 4043 | 0.096 | 3879 | 0.097 | 1224 | 0.094 | |||
| Male-biased | 22 | 0.107 | 0.929 | 42 | 0.107 | 0.452 | 41 | 0.105 | 0.336 |
| Female-biased | 232 | 0.132 |
| 101 | 0.116 |
| 193 | 0.131 |
|
|
| |||||||||
| Unbiased | 2220 | 0.079 | 1103 | 0.090 | 550 | 0.084 | |||
| Male-biased | 23 | 0.076 | 0.328 | 50 | 0.102 | 0.120 | 684 | 0.125 |
|
| Female-biased | 183 | 0.115 |
| 40 | 0.104 | 0.191 | 499 | 0.112 |
|
We first compare mean dN/dS values for all sex-biased genes (FDR < 0.1 and median fold-change: brain 1.5 fold; tail 1.7 fold; gonad 3.5 fold) to unbiased genes (FDR > 0.1, log2CPM > 2). We then compare sex-biased genes mapped to autosomal linkage groups only. We also compare those expressed only in a single tissue and multiple tissues separately. In the single tissue gonad set we also compared dN/dS to unbiased genes on autosomal linkage groups only (p-values given in brackets). Finally, we compared only highly expressed genes (>32 CPM). For each comparison we give the number of genes (N), the mean dN/dS, and p-value, with p-values less than 0.0001 in bold.