Literature DB >> 23634199

Characterization of sex determination and sex differentiation genes in Latimeria.

Mariko Forconi1, Adriana Canapa, Marco Barucca, Maria A Biscotti, Teresa Capriglione, Francesco Buonocore, Anna M Fausto, Daisy M Makapedua, Alberto Pallavicini, Marco Gerdol, Gianluca De Moro, Giuseppe Scapigliati, Ettore Olmo, Manfred Schartl.   

Abstract

Genes involved in sex determination and differentiation have been identified in mice, humans, chickens, reptiles, amphibians and teleost fishes. However, little is known of their functional conservation, and it is unclear whether there is a common set of genes shared by all vertebrates. Coelacanths, basal Sarcopterygians and unique "living fossils", could help establish an inventory of the ancestral genes involved in these important developmental processes and provide insights into their components. In this study 33 genes from the genome of Latimeria chalumnae and from the liver and testis transcriptomes of Latimeria menadoensis, implicated in sex determination and differentiation, were identified and characterized and their expression levels measured. Interesting findings were obtained for GSDF, previously identified only in teleosts and now characterized for the first time in the sarcopterygian lineage; FGF9, which is not found in teleosts; and DMRT1, whose expression in adult gonads has recently been related to maintenance of sexual identity. The gene repertoire and testis-specific gene expression documented in coelacanths demonstrate a greater similarity to modern fishes and point to unexpected changes in the gene regulatory network governing sexual development.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23634199      PMCID: PMC3636272          DOI: 10.1371/journal.pone.0056006

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Two major processes take place in sexual development: sex determination and sex differentiation. The former process determines whether the bipotential primordium will develop into a testis or an ovary; the latter takes place after sex determination and involves the actual development of testes or ovaries from the undifferentiated gonad [1]. Sex determination is considered as a default pathway or as suppression thereof and initiation of the opposite pathway; in contrast, sex differentiation seems to result from the antagonistic relationship among the genes influencing testis or ovary development [2], [3]. Recently it has emerged that sex-specific mechanisms, which are critical to maintaining the male or female identity of the testis and ovary, also operate in adult mammalian gonads [4]–[6]. Other organs besides the gonads may also acquire elaborate male- and female-specific differences. In vertebrates—with the possible exception of birds [7]—such secondary sexual traits are generally believed to be instructed exclusively by the developing testis or ovary through sex steroids, whereas in invertebrates each somatic cell seems to have an inherent sexual identity [8]. Compared with eutherian mammals, sex steroids and the proteins involved in their metabolism and binding play an earlier role in the sex differentiation process of fish, amphibians, reptiles, birds, and marsupials [9]–[20]. In vertebrates sexual development is determined by two main factors: either the genetic makeup of the individual or the environment, through the influence of temperature during development, nutrients, pH, etc [21]–[23]. It has been demonstrated that in mammals the consecutive processes of sex determination, gonad differentiation and identity maintenance are brought about by a complex network of transcription factor interactions and signalling molecules; a master regulator upstream then directs the network towards male or female [24]. The male-determining gene in most mammals is the Y chromosome SRY gene, which however has only been detected in placental mammals [25]. In chickens (and possibly all birds) the master regulator of sexual development is Dmrt1; its homologues are dmrt1bY (or DMY) in the Japanese ricefish (medaka, Oryzias latipes) [26], [27]; and DM-W in the frog Xenopus laevis [28]. In several fish species this function is served by gonadal soma-derived factor (GSDF) [29], anti-Müllerian hormone (AMH) [30], anti-Müllerian hormone receptor (AMHR2) [31], or other genes. In contrast to the variety of upstream sex determinants, genome-wide studies and homology cloning in teleost fishes, amphibians, reptiles and birds have suggested that the downstream components of the network have a conserved function. This has inspired the paradigm that in sex determination during evolution “masters change, slaves remain” [32]–[34]. However, it is unclear how far back in the evolutionary history this applies and in particular when and how the vertebrate sex regulation network evolved and whether the relevant genes represent an ancient, conserved mechanism or else they were repeatedly and independently recruited to the process. The unique opportunity to examine high-quality RNA from the Indonesian coelacanth Latimeria menadoensis for transcriptome analysis of testis and liver tissue, and the availability of the whole genome sequence of the African coelacanth Latimeria chalumnae, enabled us to gain insights into a “living fossil” that is held to be among the nearest living relatives of tetrapods. The genes involved in the regulatory network of sexual development described so far come mainly from mammalian studies and can be divided into functional groups as follows: 1) genes required for bipotential gonad development [Wilm's tumour suppressor-1 (WT1), steroidogenic factor-1 (SF-1), and GATA-binding protein 4 (GATA-4)]; 2) genes involved in male sex determination [double sex and mab-3 related transcription factor 1 (DMRT1), SRY-related box 9 (SOX9), dosage-sensitive sex-reversal-adrenal hypoplasia congenital-critical region of X chromosome, gene 1 (DAX1), fibroblast growth factor 9 (FGF9), and desert hedgehog (DHH)]; 3) genes involved in male sex differentiation [AMH, AMHR2, and androgen receptor (AR)]; 4) genes involved in female sex determination [Wingless-type MMTV integration site family member 4 (WNT4), R-spondin-1 (RSPO-1), catenin β-1 (CTNNB1), forkhead box transcription factor L2 (FOXL2), and follistatin (FST)]; 5) genes involved in female sex differentiation [aromatase (also known as Cyp19A1 or P450arom), oestrogen receptor α (ERα), and oestrogen receptor β (ERβ)] (Figure 1).
Figure 1

Genes involved in sexual development.

These 22 genes and 11 other genes [DMRT3, DMRT6, GSDF, platelet-derived growth factors (PDGF) α and β and their receptors (PDGFRα, PDGFRβ), 11β-hydroxylase (CYP11B), and 5α-reductase 1, 2, and 3 (SRD5A1, SRD5A2, SRD5A3)], whose involvement in sex development has been documented [35]–[47], were sought in the L. chalumnae genome and in the transcriptome of L. menadoensis; subsequently their expression levels were measured in the liver and testis of an adult specimen of L. menadoensis. The coelacanth gene repertoire and expression profiles were much more similar to those of modern fish than to those of tetrapods, although they may also represent an intermediate condition; these data unexpectedly suggest that the major evolutionary changes accompanying the transition to terrestrial life were also involved in gonad development.

Methods

The genome of the African coelacanth L. chalumnae has recently been sequenced (project accession PRJNA56111) [48] and is available in the framework of the whole genome shotgun (WGS) sequencing project at http://www.ncbi.nlm.nih.gov and http://www.ensembl.org. The transcriptome of its Indonesian congener, L. menadoensis, has been described by Pallavicini and colleagues [49] and Canapa and co-workers [50]. Briefly, an adult male specimen of L. menadoensis weighing 27 kg was caught in a shark net near Talise island, Indonesia [51]. Liver and testis were collected immediately after death and preserved in RNAlater (Applied Biosystems, Warrington, UK). A good quality RNA samples, extracted using Trizol Reagent (Ambion/Life Technologies, Carlsbad, CA) following the manufacturer's instructions (RNA integrity number was 7.0 for testis and 6.6 for liver), were used to generate cDNA libraries for transcriptome sequencing on the Illumina Genome Analyzer II platform (Illumina, San Diego, CA, USA). After filtering high-quality reads, removing reads containing primer/adaptor sequences, and trimming read length, the Illumina 100-bp paired-end reads were assembled on a 4-core server (72GB RAM). CLC Genomic Workbench 4.5.1 (CLC Bio, Katrinebjerg, Denmark) and Trinity [52] were used for de novo assembly of short reads. Contigs confirmed and improved by both methods were pooled in a high-quality set. To identify the coelacanth homologues of the genes involved in sexual development, the corresponding Xenopus tropicalis, Gallus gallus, Danio rerio and Homo sapiens sequences were BLASTed on the L. menadoensis transcript dataset. The identity of each retrieved putative transcript was confirmed through NCBI BLAST by homology. BLASTx analyses allowed transcript completeness to be established (coding sequences, CDSs). The L. menadoensis sequences were then BLASTed against the WGS dataset of L. chalumnae, to identify the genomic scaffolds of the African coelacanth containing them. Species divergence was calculated with PAUP on the matching sequences as p-distance percentage; the Ka/Ks ratio was calculated with KaKs_calculator [53] using the Nei and Gojobori method [54]. The synonymous distance was calculated using MEGA5 [55] by applying the uncorrected modified Nei and Gojobori method [56] to the concatenated CDSs, aligned with ClustalW2 (http://www.ebi.ac.uk/Tools/msa/clustalw2/; [57]). The predicted transcripts of L. chalumnae were collected from ENSEMBL (http://www.ensembl.org/Latimeria_chalumnae/Info/Index). The GSDF CDS was obtained manually by aligning L. menadoensis transcripts to the L. chalumnae genome; FGF9, not found in the transcriptome and not annotated in ENSEMBL, was obtained manually by BLASTing annotated amino acid sequences of other species to the L. chalumnae WGS. GSDF and FGF9 putative transcripts were confirmed by homology through NCBI BLAST. L. chalumnae and L. menadoensis transcripts were compared by ClustalW2 alignment; a graphical representation of each sequence pair is reported in Figures S1A and S1B. Gene ontology (GO) terms involved in sex determination and sex differentiation (GO0007530 and GO0007548, respectively) were selected and L. menadoensis orthologues to D. rerio, X. tropicalis, G. gallus, Canis familiaris, Bos taurus, Sus scrofa, Mus musculus, Rattus norvegicus and H. sapiens counterparts counted. L. menadoensis liver and testis gene expression levels were calculated using the CLC Genomic Workbench 4.5.1 by mapping paired reads from the transcriptome on the assembled transcripts, and given as Fragments Per Kilobase of exon per Million sequenced fragments (FPKM). The lack of some transcripts in the assembled transcriptome may depend on poor gene expression, hence on the limited number of reads, which prevented assembly of a contig. In such cases ENSEMBL gene predictions were used to determine absence or low expression taking into account the predicted transcripts. The FPKM value is therefore still a function of transcript length rather than gene length. The FPKM value was calculated for DMRT3, FOXL2, aromatase, WNT4, and CYP11B on ENSEMBL transcript predictions as well as on the inferred sequence of L. chalumnae FGF9. Besides genes expected to be involved in sexual development, the expression levels of some house-keeping genes, i.e. phosphoglycerate kinase (PGK), heat shock protein class B (HSPCB), and the ribosomal proteins RPS27, RPL19, RPL11, RPL32, chosen according to Eisenberg and Levanon [58], were also evaluated. Correct assignment to evolutionarily related gene groups was established by phylogenetic analysis. Sequences of SOXE, FGF9/16/20, and TGF-β groups of other vertebrates were retrieved from the NCBI protein database and ENSEMBL. Multiple alignments were performed with ClustalW2 using default parameters. Phylogenetic trees were obtained using Bayesian Inference (BI) and Maximum Parsimony (MP) methods. BI analysis was performed with MrBayes 3.1.2 [59] by applying the amino acid model of Dayhoff et al. [60] to the SOXE and TGF-β groups and the one by Jones et al. [61] to the FGF9/16/20 group. Parameters were set to 1,000,000 generations, sampling every 100; burn-in was set at 2,500 and stationarity was defined when the average standard deviation of split frequencies reached a value<0.009. MP analyses were performed with PAUP [62] by applying heuristic search with tree bisection-reconnection (TBR) branch swapping and random stepwise additions with 100 replications; 1,000 bootstrap replicates were calculated. Only minimal trees were retained. The outgroup, accession numbers, and constant, parsimony informative, and parsimony non-informative sites are reported in the legend to each phylogenetic tree. Conserved syntenic blocks were inferred from ENSEMBL annotation of putative CYP11B (Figure S2), DMRT1, FGF9, FGF16, and FGF20 flanking regions from some sequenced vertebrate genomes. Gene sizes and distances were calculated on the basis of the annotated coordinates of each element. Scaffolds containing FGF9 and flanking genes (EFHA1 and ZDHHC20) conserved in tetrapods were identified by homology through tBLASTn on L. chalumnae WGS data.

Results

GO analyses of ‘sex determination’ and ‘sex differentiation’ term annotations of the L. menadoensis transcriptome were conducted and the results compared to selected vertebrate genomes (Tables S1 and S2); 25 contigs were identified as orthologues of a GO0007530 (sex determination) annotation, and 297 contigs were orthologues of the GO0007544 (sex differentiation) annotation. In this study we examined 33 genes with substantial evidence of involvement in sex determination and differentiation (Supplementary notes). CDSs were retrieved from the L. chalumnae genome and the L. menadoensis testis and liver transcriptomes (Tables 1 and 2) and their expression levels assessed. The putative orthology status of closely related genes was confirmed by tree topologies obtained by phylogenetic analysis. Furthermore the instances of micro-synteny conservation described in other vertebrates for DMRT1 [36] and FGF9/16/20 [63], [64] were analysed in the two coelacanths.
Table 1

Male sex-determining/differentiation gene inventory.

Transcript in L. menadoensis Gene location in L. chalumnae Transcript prediction in L. chalumnae
GeneAccessionLengthCDSScaffoldN° exons3 Divergence4 Ka/KsENSEMBL accessionLengthCDS
AMH HF56230213121 13122 JH126742>50.0460.000ENSLACT00000009808168916892
HF56230310391 670
AMHR2 HF5623046936932 JH126659>90.2890.343ENSLACT000000205879219212
AR HF562305259021332 JH126641>80.1650.000ENSLACT00000017177223512392
CYP11B ---JH127279---ENSLACT0000001553614224742
DAX1 HF562306966786JH12826820.2070.309ENSLACT00000007979786786
DHH HF5623079269262 JH12656320.5400.649ENSLACT0000002174912751275
DMRT1 HF56230822449982 JH12723750.1340.000ENSLACT000000150347987982
DMRT3 ---JH127237---ENSLACT0000001375714551455
DMRT6 HF5623093121957JH13092840.129NAENSLACT000000037737987982
FGF9 ---JH128123---Manually identified
FGF20 HF5623103703702 JH12713430.270NAENSLACT00000014939627627
GATA-4 HF56231116551200JH128461>60.064NAENSLACT0000000700012091209
GSDF HF5623121258693JH1276325 >30.8260.470---
PDGFα HF562313968594JH12690960.000NAENSLACT000000250361892594
PDGFβ HF5623146644832 JH128946>40.000NAENSLACT00000002931630630
PDGFRα HF5623158238232 JH128279>70.243NAENSLACT00000010417328532852
PDGFRβ HF5623169951 9952 JH126585>170.1950.954ENSLACT0000001666433123312
HF56231710551 10552
SF-1 HF56231816861401JH1265726 70.000NAENSLACT000000214045915912
SOX8 HF5623197357352 JH126713>30.000NAENSLACT0000001901614341410
SOX9 HF56232033061428JH12658130.1850.000ENSLACT0000002148419081428
SOX10 HF562321140313532 JH12730930.3590.199ENSLACT0000000503445711356
SRD5A1 HF5623223664786JH12990350.164NAENSLACT000000020476104684
SRD5A2 HF5623232711765JH12670050.112NAENSLACT000000259362918765
SRD5A3 HF5623241244945JH12725650.000NAENSLACT000000144232733945
WT1 HF56232522601257JH12665290.1340.000ENSLACT0000001873212601257

Fragmented contig. 2Partial CDS. 3Number of exons from the alignment of L. menadoensis transcripts to the L. chalumnae genome. Where the transcript carries only a partial CDS, the number of exon is partial. 4Divergence between the two coelacanth sequences calculated as p-distance x100. 5The L. chalumnae GSDF gene is split between scaffold JH127632 and contig AFYH01270444. 6 The L. chalumnae SF-1 gene is split between scaffold JH126572 and contig AFYH01271535.

Table 2

Female sex-determining/differentiation gene inventory.

L. menadoensis transcriptGene location in L. chalumnae Transcript prediction in L. chalumnae
GeneAccessionLengthCDSScaffoldN° exons3 Divergence4 Ka/KsENSEMBL accessionLengthCDS
Aromatase ---JH127307---ENSLACT00000010703132913292
CTNNB1 HF56232633252346JH127054150.7020.000ENSLACT0000001733534582346
ERα HF562327200215412 JH1292275 >80.352NAENSLACT000000050563963962
ERβ HF56232831841689JH12656490.1590.000ENSLACT0000001923524651689
FOXL2 ---JH127245---ENSLACT00000012991915915
FST HF56232923811044JH12729160.2210.000ENSLACT00000014112202710322
RSPO-1 HF5623304741 4252 JH126592>50.626NAENSLACT000000193837477472
HF5623314851 2692
WNT4 ---JH126950---ENSLACT0000001713910681068

Fragmented contig. 2Partial CDS. 3Number of exons from the alignment of L. menadoensis transcripts to the L. chalumnae genome. Where the transcript carries only a partial CDS, the exon number is partial. 4Divergence between the two coelacanth sequences calculated as p-distance x100. 5The ERα gene in the L. chalumnae genome is split among scaffolds JH129227, JH129408, JH129637, and JH133026.

Fragmented contig. 2Partial CDS. 3Number of exons from the alignment of L. menadoensis transcripts to the L. chalumnae genome. Where the transcript carries only a partial CDS, the number of exon is partial. 4Divergence between the two coelacanth sequences calculated as p-distance x100. 5The L. chalumnae GSDF gene is split between scaffold JH127632 and contig AFYH01270444. 6 The L. chalumnae SF-1 gene is split between scaffold JH126572 and contig AFYH01271535. Fragmented contig. 2Partial CDS. 3Number of exons from the alignment of L. menadoensis transcripts to the L. chalumnae genome. Where the transcript carries only a partial CDS, the exon number is partial. 4Divergence between the two coelacanth sequences calculated as p-distance x100. 5The ERα gene in the L. chalumnae genome is split among scaffolds JH129227, JH129408, JH129637, and JH133026. To establish whether the sequence information from L. menadoensis and L. chalumnae could be combined, their genetic distance was determined by comparing the transcripts of the former to the genomic sequences of the latter. The distance, calculated over all matching sequences, ranges between 0% and 0.826%, divergences being due mainly to mutations, insertions or deletions in untranslated regions (UTRs). Point mutations affecting the transcript coding region are predominantly synonymous (Tables 1 and 2). The synonymous distance calculated over the whole gene set was 0.0019 (standard error 0.0005). These findings showed that the data of the two species can be pooled and investigated together.

Genes in male sexual development

Twenty-five genes involved in male sexual development were analysed in Latimeria: 3 genes containing a double sex and mab-3 (DM) domain (DMRT1, DMRT3, and DMRT6); 3 genes belonging to the SOXE subfamily (SOX8, SOX9, and SOX10) of SRY-related HMG box transcription factors; other transcription factors including WT1, DAX1, GATA-4, DHH, SF-1; the signalling molecules PDGFα and β, GSDF, AMH, FGF9, and FGF20; 4 receptors comprising AR, AMHR2, and PDGFRα and β; and the steroidogenic enzymes SRD5A1, SRD5A2, SRD5A3 and CYP11B (Table 1). ENSEMBL prediction recovered 23 out of 25 genes in the L. chalumnae genome annotation. The two missing sequences were inferred manually from the genome assembly: one, FGF9, was identified by comparison with orthologous sequences of other species, and the other, GSDF, by aligning an L. menadoensis transcript to WGS contigs of L. chalumnae. Fifteen of the 23 predicted transcripts of L. chalumnae carried complete CDS whereas 8 were partial. The manually inferred L. chalumnae FGF9 covers the complete CDS, whereas the L. chalumnae GSDF homologue is incomplete (about 75% of the CDS). The testis and liver transcriptomes of L. menadoensis contain 22 transcripts. Half of the contigs carried a complete CDS, the other half were partial or fragmented. Transcripts of 3 genes, FGF9, CYP11B, and DMRT3, were not found in liver and testis (Table 1). The male sex development sequences of L. menadoensis and L. chalumnae are compared in Figure S1A. Since the expression of 13 male sex development transcripts in testis was<1 FPKM unit, they were considered as not being expressed above background. With the exception of AR, 11 genes (DMRT6, DMRT1, SOX9, SOX10, WT1, GSDF, AMH, SRD5A1, SRD5A3, DHH, and SF-1) were more expressed in testis than in liver, but only 3 (DMRT1, DMRT6, and SOX9) exhibited a differential expression with an FPKM difference>10. In liver 7 genes were expressed above background: SOX9, SRD5A1, AR, DAX1, PDGFα, GATA-4, and SRD5A2 (Figures 2A and 2B).
Figure 2

Expression of male development genes.

Expression levels of male sex-determining/differentiation genes in L. menadoensis liver and testis transcriptomes. Values are expressed as FPKM (Fragments Per Kilobase of exon per Million sequenced fragments). A) genes highly expressed in testis; B) genes poorly expressed in testis. The expression levels of some housekeeping genes (not represented) were also analysed: PGK 96.95 (liver), 342.41 (testis); RPS27a 152.59 (liver), 128.43 (testis); RPL19 744.01 (liver), 64.89 (testis); RPL11 457.35 (liver), 282.59 (testis); RPL32 629.83 (liver), 373.75 (testis); HSPCB 507.99 (liver), 1213.75 (testis). Threshold value = 1. * Expression level assessed on L. chalumnae orthologue.

Expression of male development genes.

Expression levels of male sex-determining/differentiation genes in L. menadoensis liver and testis transcriptomes. Values are expressed as FPKM (Fragments Per Kilobase of exon per Million sequenced fragments). A) genes highly expressed in testis; B) genes poorly expressed in testis. The expression levels of some housekeeping genes (not represented) were also analysed: PGK 96.95 (liver), 342.41 (testis); RPS27a 152.59 (liver), 128.43 (testis); RPL19 744.01 (liver), 64.89 (testis); RPL11 457.35 (liver), 282.59 (testis); RPL32 629.83 (liver), 373.75 (testis); HSPCB 507.99 (liver), 1213.75 (testis). Threshold value = 1. * Expression level assessed on L. chalumnae orthologue. DMRT6 was the most highly expressed transcript among the 25 male sex development genes analysed (37.79 FPKM in testis, no expression in liver) and one of the 2,000 most abundant transcripts among the 61,000 plus contigs measured in testis. DMRT1, a major gene in male development, plays a key function in fish [65], [66], chickens [67], [68], and reptiles [69]. Alignment of L. menadoensis transcripts to the L. chalumnae genome (Figure 3A) identified 5 exons which exceeded the ENSEMBL predicted transcript by 1,572 bp at the 3′ end (Figure 3B). The DM domain is encoded in the first annotated exon. The long 3′UTR harbours a 320-bp region containing a low-copy interspersed repeat.
Figure 3

Conserved micro-synteny and structure of the DMRT1 genomic locus and transcripts.

A) Genomic representation of DMRT1 on scaffold JH127237 of L. chalumnae. Grey box corresponds to gene. Small boxes and V signs represent the intron/exon map. B) Transcript representation of DMRT1 in L. menadoensis and L. chalumnae. Boxes: exons; V signs: introns; white box: DM domain; light grey box: 3′UTR; dashed box: putative transposable element contained in the 3′UTR. Dotted boxes represent missing exons in the ENSEMBL transcript prediction. C) Micro-syntenic conservation of genomic blocks containing the DMRT1 gene. White pentagons represent DMRT1 genes. The pentagon tip points to the relative gene orientation. Numbers near the pentagons stand for gene size expressed as kb, numbers on lines represent intergene distance expressed as kb. ENSEMBL data: H. sa (Homo sapiens), M. mu (Mus musculus), O. an (Ornithorhynchus anatinus), G. ga (Gallus gallus), A. ca (Anolis carolinensis), L. ch (Latimeria chalumnae), D. re (Danio rerio), T. ru (Takifugu rubripes). L. chalumnae DMRT1 position was clarified using the L. menadoensis transcript, by integrating the L. chalumnae ENSLACT00000015034 coordinates. *In O. anatinus DMRT1 gene size was defined by comparison with other species. **Values obtained in G. gallus from the annotation of NC_006127.3 accession.

Conserved micro-synteny and structure of the DMRT1 genomic locus and transcripts.

A) Genomic representation of DMRT1 on scaffold JH127237 of L. chalumnae. Grey box corresponds to gene. Small boxes and V signs represent the intron/exon map. B) Transcript representation of DMRT1 in L. menadoensis and L. chalumnae. Boxes: exons; V signs: introns; white box: DM domain; light grey box: 3′UTR; dashed box: putative transposable element contained in the 3′UTR. Dotted boxes represent missing exons in the ENSEMBL transcript prediction. C) Micro-syntenic conservation of genomic blocks containing the DMRT1 gene. White pentagons represent DMRT1 genes. The pentagon tip points to the relative gene orientation. Numbers near the pentagons stand for gene size expressed as kb, numbers on lines represent intergene distance expressed as kb. ENSEMBL data: H. sa (Homo sapiens), M. mu (Mus musculus), O. an (Ornithorhynchus anatinus), G. ga (Gallus gallus), A. ca (Anolis carolinensis), L. ch (Latimeria chalumnae), D. re (Danio rerio), T. ru (Takifugu rubripes). L. chalumnae DMRT1 position was clarified using the L. menadoensis transcript, by integrating the L. chalumnae ENSLACT00000015034 coordinates. *In O. anatinus DMRT1 gene size was defined by comparison with other species. **Values obtained in G. gallus from the annotation of NC_006127.3 accession. The size of the DMRT1 gene in the L. chalumnae genome is>152 kb (Figure 3A), close to the 127 kb gene of H. sapiens (ENSEMBL annotation) but spanning a much longer range than the 3 kb gene of Crocodylus palustris [70], the 45 kb gene of D. rerio [71], and 53–58 kb gene of G. gallus ([72], ENSEMBL). Moreover the lack of a 5′ UTR (Figure 3B), which in other fish is transcribed in the so-called exon 0 [73], both in sequences from the transcriptome and the ENSEMBL prediction, suggests the existence of another exon (which would further extend the genomic locus). Brunner and colleagues [36] previously reported that the gene order around the DMRT1 gene, involving two other DM domain genes, DMRT2 and DMRT3, and the gene KANK1 (KIAA0172), was strictly conserved. A similar micro-synteny conservation was also noted in the L. chalumnae genome when the genomic scaffold JH127237 (1,057,921 bp), from position 608,000 to 941,000, was compared to other vertebrate chromosomes (Figure 3C). Interestingly, this region is linked to the Z gonosome in G. gallus (where DMRT1 is pivotal in male development) and to the X5 gonosome in Ornithorhynchus anatinus, whereas in other species of the actinopterygian and sarcopterygian lineages it is located on an autosome. To date it has been impossible to identify sex chromosomes in the Latimeria karyotype [74] or to relate the scaffold containing DMRT1 to a definite chromosome. DMRT1 is the second most abundantly expressed gene in testis (11.84 FPKM units) and among the 10% most abundantly expressed transcripts (Figure 2A) of those analysed. SOX9 is a transcription factor activating AMH; together with DMRT1 it inhibits WNT4 and FOXL2. In mammals it is activated by another SOX family protein, SRY, whereas in other vertebrates it is mainly regulated by SF-1 and DMRT1; together with SOX8 and SOX10 it belongs to SOX protein subgroup E. Phylogenetic analysis (Figure 4) of SOX E proteins from several vertebrates yielded a tree topology with 3 major clades corresponding to the 3 genes. In the SOX9 and SOX10 clades Latimeria sequences comprise a sister group of tetrapods, while the relationship of the Latimeria SOX8 was not clearly resolved given its phylogenetic position. SOX9 and SOX10 were more strongly expressed in testis than in liver (Figure 2A; FPKM: 11.60 and 1.38 for SOX9, FPKM: 2.25 and 0.04 for SOX10), whereas SOX8 expression was scanty in L. menadoensis liver (Figure 2B).
Figure 4

Phylogenetic tree of SOX8, SOX9, and SOX10.

Phylogenetic analyses of vertebrate SOXE amino acid sequences. Midpoint rooting. Total characters: 592, constant: 164, parsimony non-informative: 77, parsimony informative: 351. Numbers close to nodes represent posterior probability in Bayesian Inference/bootstrap percentage in Maximum Parsimony. Danio rerio (SOX8: AAX73357.1; SOX9a: NP_571718.1; SOX9b: NP_571719.1; SOX10: AAK84872.1); Dicentrarchus labrax (SOX8: CBN81184.1; SOX9: CBN81190.1); Gallus gallus (SOX8: AAF73917.1; SOX9: BAA25296.1; SOX10: AAD38050.2); Homo sapiens (SOX8: AAH31797.1; SOX9: CAA86598.1; SOX10: CAG30470.1); Latimeria chalumnae (SOX8: ENSLACP00000018883; SOX9: ENSLACP00000021343; SOX10: ENSLACP00000004990); Latimeria menadoensis (SOX9, SOX10: this study); Mus musculus (SOX8: AAF35837.1; SOX9: AAH23953.1; SOX10: NP_035567.1); Oryzias latipes (SOX8: NP_001158342.1; SOX9a: AAX62152.1; SOX9b: AAX62151.1); Salmo salar (SOX8: ABC24688.1; SOX9: ACN10975.1); Scyliorhinus canicula (SOX9: ABY71239.1); Trachemys scripta (SOX8: AAP59791.1; SOX9: ACG70782.1; SOX10: ENSLACP00000004990); Xenopus laevis (SOX8: AAI69525.1; SOX9: NP_001084276; SOX10: NP_001082358.1). *Only a partial SOX8 sequence, perfectly matching the ENSEMBL prediction of the L. chalumnae SOX8 gene, was retrieved in the transcriptome assembly of L. menadoensis.

Phylogenetic tree of SOX8, SOX9, and SOX10.

Phylogenetic analyses of vertebrate SOXE amino acid sequences. Midpoint rooting. Total characters: 592, constant: 164, parsimony non-informative: 77, parsimony informative: 351. Numbers close to nodes represent posterior probability in Bayesian Inference/bootstrap percentage in Maximum Parsimony. Danio rerio (SOX8: AAX73357.1; SOX9a: NP_571718.1; SOX9b: NP_571719.1; SOX10: AAK84872.1); Dicentrarchus labrax (SOX8: CBN81184.1; SOX9: CBN81190.1); Gallus gallus (SOX8: AAF73917.1; SOX9: BAA25296.1; SOX10: AAD38050.2); Homo sapiens (SOX8: AAH31797.1; SOX9: CAA86598.1; SOX10: CAG30470.1); Latimeria chalumnae (SOX8: ENSLACP00000018883; SOX9: ENSLACP00000021343; SOX10: ENSLACP00000004990); Latimeria menadoensis (SOX9, SOX10: this study); Mus musculus (SOX8: AAF35837.1; SOX9: AAH23953.1; SOX10: NP_035567.1); Oryzias latipes (SOX8: NP_001158342.1; SOX9a: AAX62152.1; SOX9b: AAX62151.1); Salmo salar (SOX8: ABC24688.1; SOX9: ACN10975.1); Scyliorhinus canicula (SOX9: ABY71239.1); Trachemys scripta (SOX8: AAP59791.1; SOX9: ACG70782.1; SOX10: ENSLACP00000004990); Xenopus laevis (SOX8: AAI69525.1; SOX9: NP_001084276; SOX10: NP_001082358.1). *Only a partial SOX8 sequence, perfectly matching the ENSEMBL prediction of the L. chalumnae SOX8 gene, was retrieved in the transcriptome assembly of L. menadoensis. In mammals FGF9 has an important function in male development, creating a positive feedback cycle with SOX9 and inhibiting the WNT4 pathway in testis [75]. It has not yet been detected in teleosts and seems to be replaced by FGF20b [63], [64] in sexual development. Interestingly, we found an FGF9-like sequence in L. chalumnae. To confirm the orthology relationships of the putative Latimeria FGF9, FGF16, and FGF20, sequence comparisons were performed and the conserved synteny arrangements of the flanking regions investigated (Figure 5). In tetrapods the two blocks harbouring FGF9 or FGF20 are characterized by an EFHA and a ZDHHC gene upstream the FGF genes. Extensive gene-deserted regions are found downstream FGF9, 16 and 20. In teleosts (where FGF9 is absent) the other genes forming the micro-syntenic cluster are distributed on different chromosomes. In L. chalumnae the FGF9 cluster is split between two scaffolds whose co-localization on the same chromosome cannot as yet be confirmed. However, the proximity of a putative EFHA1 coding fragment upstream the 5′ end of FGF9 suggests that the Latimeria FGF9 follows the tetrapod pattern.
Figure 5

Analysis of micro-syntenic conservation in FGF9, FGF16 and FGF20 blocks.

Micro-syntenic conservation of genomic regions containing the FGF9, FGF20 and FGF16 genes. White pentagons represent FGF genes. The pentagon tip points to the relative gene orientation. The grey mark on the top third of the figure indicates a EFHA1 putative sequence of Latimeria chalumnae. Numbers near pentagons stand for gene size expressed as kb, numbers on lines represent intergene distance expressed as kb. ENSEMBL data: H. sa (Homo sapiens), G. ga (Gallus gallus), A. ca (Anolis carolinensis), X. tr (Xenopus tropicalis), L. ch (Latimeria chalumnae), D. re (Danio rerio), T. ru (Takifugu rubripes). Syntenic blocks for FGF20 in L. chalumnae and X. tropicalis, and FGF16 in A. carolinensis are split between two different scaffolds. The ZDHHC15 genes belonging to the syntenic block of FGF16 in H. sapiens and X. tropicalis lie on the same chromosome or scaffold, but are far removed from the genomic locus of FGF16 and ATRX. *Genes missing in the ENSEMBL prediction.

Analysis of micro-syntenic conservation in FGF9, FGF16 and FGF20 blocks.

Micro-syntenic conservation of genomic regions containing the FGF9, FGF20 and FGF16 genes. White pentagons represent FGF genes. The pentagon tip points to the relative gene orientation. The grey mark on the top third of the figure indicates a EFHA1 putative sequence of Latimeria chalumnae. Numbers near pentagons stand for gene size expressed as kb, numbers on lines represent intergene distance expressed as kb. ENSEMBL data: H. sa (Homo sapiens), G. ga (Gallus gallus), A. ca (Anolis carolinensis), X. tr (Xenopus tropicalis), L. ch (Latimeria chalumnae), D. re (Danio rerio), T. ru (Takifugu rubripes). Syntenic blocks for FGF20 in L. chalumnae and X. tropicalis, and FGF16 in A. carolinensis are split between two different scaffolds. The ZDHHC15 genes belonging to the syntenic block of FGF16 in H. sapiens and X. tropicalis lie on the same chromosome or scaffold, but are far removed from the genomic locus of FGF16 and ATRX. *Genes missing in the ENSEMBL prediction. Phylogenetic analysis of the FGF9/16/20 group (Figure 6) uncovered three major clades corresponding to the 3 genes. The exact position of L. chalumnae FGF20 sequence is unresolved; like the X. laevis orthologue it is paraphyletic to teleosts and tetrapods. As expected, the coelacanth FGF16 sequence is basal to the tetrapods. However, the position of the Latimeria FGF9, albeit firmly nested within the FGF9 tetrapod clade, does not reflect its phylogenetic position in the taxonomic group.
Figure 6

Phylogenetic tree of FGF9, FGF16, and FGF20.

Phylogenetic analysis of amino acid sequences of the vertebrate FGF9/16/20. Midpoint rooting. Total characters: 237, constant: 91, parsimony non-informative: 35, parsimony informative: 111. Numbers close to nodes represent posterior probability in Bayesian Inference/bootstrap percentage in Maximum Parsimony. Danio rerio (FGF16: ENSDART00000061928; FGF20a: NP_001032180.1; FGF20b: NP_001034261.1); Gallus gallus (FGF9: NP_989730.1; FGF16; NP_001038115.1; FGF20: XP_426335.2); Homo sapiens (FGF9: NP_002001.1; FGF16: NP_003859.1; FGF20: NP_062825.1); Latimeria chalumnae (FGF9: manually inferred from JH128123; FGF16: ENSLACT00000011509; FGF20: ENSLACT00000014939); Mus musculus (FGF9: ADL60500.1; FGF16: BAB16405.1; FGF20: NP_085113.2); Oryzias latipes (FGF16: ENSORLT00000007651; FGF20a: ENSORLT00000012578; FGF20b: ENSORLT00000025767); Takifugu rubripes (FGF16: ENSTRUT00000021181; FGF20(1): ENSTRUT00000008788; FGF20(2): ENSTRUT00000039390); Xenopus tropicalis (FGF9: XP_002938621.1; FGF16: ENSXETT00000009790; FGF20: NP_001137399.1). Latimeria menadoensis is missing in this analysis because FGF9 and FGF20 are poorly or not expressed in the transcriptomes.

Phylogenetic tree of FGF9, FGF16, and FGF20.

Phylogenetic analysis of amino acid sequences of the vertebrate FGF9/16/20. Midpoint rooting. Total characters: 237, constant: 91, parsimony non-informative: 35, parsimony informative: 111. Numbers close to nodes represent posterior probability in Bayesian Inference/bootstrap percentage in Maximum Parsimony. Danio rerio (FGF16: ENSDART00000061928; FGF20a: NP_001032180.1; FGF20b: NP_001034261.1); Gallus gallus (FGF9: NP_989730.1; FGF16; NP_001038115.1; FGF20: XP_426335.2); Homo sapiens (FGF9: NP_002001.1; FGF16: NP_003859.1; FGF20: NP_062825.1); Latimeria chalumnae (FGF9: manually inferred from JH128123; FGF16: ENSLACT00000011509; FGF20: ENSLACT00000014939); Mus musculus (FGF9: ADL60500.1; FGF16: BAB16405.1; FGF20: NP_085113.2); Oryzias latipes (FGF16: ENSORLT00000007651; FGF20a: ENSORLT00000012578; FGF20b: ENSORLT00000025767); Takifugu rubripes (FGF16: ENSTRUT00000021181; FGF20(1): ENSTRUT00000008788; FGF20(2): ENSTRUT00000039390); Xenopus tropicalis (FGF9: XP_002938621.1; FGF16: ENSXETT00000009790; FGF20: NP_001137399.1). Latimeria menadoensis is missing in this analysis because FGF9 and FGF20 are poorly or not expressed in the transcriptomes. Unexpectedly, neither FGF9 nor FGF20 expression was found in L. menadoensis testis. GSDF, a recently described gene that appears to be critically involved in the development of male teleosts [29], [39], [40], [45], has not been found in tetrapods and no sarcopterygian homologue has yet been described. However, BLAST analysis of teleost GSDF in the L. menadoensis transcript database suggested a putative GSDF gene, whose identity was confirmed by BLASTx analysis. Despite low similarity values (29% identity, 49% positive matching with Oncorhynchus mykiss GSDF NP_001118051.1, and 28% identity and 50% positive matching with O. latipes GSDF NP_001171213.1), BI and MP analyses reliably assigned the sequence to the teleost GSDF clade (Figure 7). Besides GSDF the phylogenetic analysis included two other proteins of the TGF-β family, AMH and inhibin-α, selected for their close relationships to GSDF [39]. A multiple amino acid alignment of the conserved TGF-β domain of the 3 genes disclosed that the L. menadoensis GSDF is a sister group of teleost GSDFs, with a posterior probability of 100 in BI analysis and a bootstrap value of 97 in the MP tree (Figure 8). The lack of a glycine, a diagnostic amino acid not found in the GSDF protein [45], in a cysteine knot further confirms the inclusion of the L. menadoensis sequence in the GSDF group, the first homologue to be described in the sarcopterygian lineage.
Figure 7

Phylogenetic tree of GSDF, AMH, and inhibin-α.

Phylogenetic analysis of amino acid sequences of vertebrate GSDF, inhibin-α and AMH. Total characters: 849, constant: 84, parsimony non-informative: 225, parsimony informative: 540. Outgroup: human glial-derived nerve growth factor (GDNF). Numbers close to nodes represent posterior probabilities in Bayesian Inference/bootstrap percentage in Maximum Parsimony. Anolis carolinensis (inhibin-α: ENSACAT00000014331); Danio rerio (GSDFa: AEL99890.1; GSDFb: AEL99889.1; AMH: NP_001007780.1; inhibin-α: ENSDART00000057348); Gallus gallus (AMH: NP_990361.1; inhibin-α: NP_001026428.1); Gasterosteus aculeatus (GSDF: ENSGACT00000021595; inhibin-α: ENSGACT00000018909); Homo sapiens (AMH AAC25614.1; GDNF: NP_000505.1); Latimeria chalumnae (inhibin-α: ENSLACT00000017535); Latimeria menadoensis (GSDF, AMH this study); Mus musculus (AMH: AAI50478.1; inhibin-α: AAH56627.1); Oreochromis niloticus (GSDF: BAJ78985.1); Oryzias latipes (GSDF: NP_001171213.1); Oncorhynchus mykiss (GSDF: ABF48201.1); Takifugu rubripes (GSDF: ENSTRUT00000036269; AMH: ENSTRUT00000045919); Xenopus laevis (inhibin-α: NP_001106349.1). The reliability of L. menadoensis CDSs is supported by the same sequence resulting from application of two different assembly procedures.

Figure 8

Multiple alignment of the TGF-β domain in GSDF, AMH, and inhibin-α.

Conserved amino acids of the cysteine knot are boxed. Anolis carolinensis (inhibin-α: ENSACAT00000014331); Danio rerio (GSDFa: AEL99890.1, GSDFb: AEL99889.1; AMH: NP_001007780.1; inhibin-α: ENSDART00000057348); Gallus gallus (AMH: NP_990361.1; inhibin-α: NP_001026428.1); Gasterosteus aculeatus (GSDF: ENSGACT00000021595; inhibin-α: ENSGACT00000018909); Homo sapiens (AMH: AAC25614.1; GDNF: NP_000505.1); Latimeria chalumnae (inhibin-α: ENSLACT00000017535); Latimeria menadoensis (this study); Mus musculus (AMH: AAI50478.1; inhibin-α: AAH56627.1); Oreochromis niloticus (GSDF: BAJ78985.1); Oryzias latipes (GSDF: NP_001171213.1); Oncorhynchus mykiss (GSDF: ABF48201.1); Takifugu rubripes (GSDF: ENSTRUT00000036269; AMH: ENSTRUT00000045919); Xenopus laevis (inhibin-α: NP_001106349.1). The reliability of L. menadoensis CDSs is supported by the same sequence resulting from application of two different assembly procedures.

Phylogenetic tree of GSDF, AMH, and inhibin-α.

Phylogenetic analysis of amino acid sequences of vertebrate GSDF, inhibin-α and AMH. Total characters: 849, constant: 84, parsimony non-informative: 225, parsimony informative: 540. Outgroup: human glial-derived nerve growth factor (GDNF). Numbers close to nodes represent posterior probabilities in Bayesian Inference/bootstrap percentage in Maximum Parsimony. Anolis carolinensis (inhibin-α: ENSACAT00000014331); Danio rerio (GSDFa: AEL99890.1; GSDFb: AEL99889.1; AMH: NP_001007780.1; inhibin-α: ENSDART00000057348); Gallus gallus (AMH: NP_990361.1; inhibin-α: NP_001026428.1); Gasterosteus aculeatus (GSDF: ENSGACT00000021595; inhibin-α: ENSGACT00000018909); Homo sapiens (AMH AAC25614.1; GDNF: NP_000505.1); Latimeria chalumnae (inhibin-α: ENSLACT00000017535); Latimeria menadoensis (GSDF, AMH this study); Mus musculus (AMH: AAI50478.1; inhibin-α: AAH56627.1); Oreochromis niloticus (GSDF: BAJ78985.1); Oryzias latipes (GSDF: NP_001171213.1); Oncorhynchus mykiss (GSDF: ABF48201.1); Takifugu rubripes (GSDF: ENSTRUT00000036269; AMH: ENSTRUT00000045919); Xenopus laevis (inhibin-α: NP_001106349.1). The reliability of L. menadoensis CDSs is supported by the same sequence resulting from application of two different assembly procedures.

Multiple alignment of the TGF-β domain in GSDF, AMH, and inhibin-α.

Conserved amino acids of the cysteine knot are boxed. Anolis carolinensis (inhibin-α: ENSACAT00000014331); Danio rerio (GSDFa: AEL99890.1, GSDFb: AEL99889.1; AMH: NP_001007780.1; inhibin-α: ENSDART00000057348); Gallus gallus (AMH: NP_990361.1; inhibin-α: NP_001026428.1); Gasterosteus aculeatus (GSDF: ENSGACT00000021595; inhibin-α: ENSGACT00000018909); Homo sapiens (AMH: AAC25614.1; GDNF: NP_000505.1); Latimeria chalumnae (inhibin-α: ENSLACT00000017535); Latimeria menadoensis (this study); Mus musculus (AMH: AAI50478.1; inhibin-α: AAH56627.1); Oreochromis niloticus (GSDF: BAJ78985.1); Oryzias latipes (GSDF: NP_001171213.1); Oncorhynchus mykiss (GSDF: ABF48201.1); Takifugu rubripes (GSDF: ENSTRUT00000036269; AMH: ENSTRUT00000045919); Xenopus laevis (inhibin-α: NP_001106349.1). The reliability of L. menadoensis CDSs is supported by the same sequence resulting from application of two different assembly procedures. BLAST analysis of L. menadoensis GSDF on the L. chalumnae genome allowed identification of a genomic counterpart that was found partly on contig AFYH01270444 and partly on scaffold JH127632, with an intervening gap of 171 bp. The L. menadoensis GSDF is strongly expressed in testis but is not expressed in liver (Figure 2A).

Genes in female sexual development

Eight female determining/differentiation genes were examined in the two coelacanths (Table 2): 3 genes belonging to the WNT signalling pathway (WNT4, RSPO-1, and CTNNB1), a transcription factor (FOXL2), two oestrogen receptors (ERα and ERβ), a steroidogenic enzyme (aromatase), and an activin-binding protein (FST). ENSEMBL prediction recovered all 8 gene sequences in the L. chalumnae genome. Four transcripts (ERβ, CTNNB1, WNT4, and FOXL2) have a complete CDS; only two codons are missing at the 5′ end of FST; RSPO-1 and aromatase are partial, whereas ERα, subdivided into 4 different scaffolds in the WGS, could be only partially identified. Analysis of the L. menadoensis transcriptome yielded 3 complete CDS sequences (CTNNB1, ERβ, and FST) and 2 fragmented CDSs (RSPO-1 and ERα), whereas 3 transcripts were missing (FOXL2, WNT4, and aromatase). The female sex development sequences of L. menadoensis and L. chalumnae are compared in Figure S1B. Their values in L. menadoensis testis and liver are shown in Figure 9. As expected, WNT4, FOXL2 and aromatase — held to be responsible for female development and pathway maintenance — were not expressed in testis. CTNNB1, FST, and ERβ were a strongly expressed in liver (56.08, 27.33, 12.93 FPKM, respectively); the expression of FST and CTNNB1 was expected, because their expression is ubiquitous [76]. Finally, ERβ liver expression in the L. menadoensis specimen, a male individual, was unexpected.
Figure 9

Expression of female development genes.

Expression of female-determining/differentiation genes in L. menadoensis liver and testis transcriptomes. Expression levels are reported as FPKM (Fragments Per Kilobase of exon per Million sequenced fragments). The expression levels of some housekeeping genes were also analysed: PGK 96.95 (liver), 342.41 (testis); RPS27a 152.59 (liver), 128.43 (testis); RPL19 744.01 (liver) 64.89 (testis); RPL11 457.35 (liver), 282.59 (testis); RPL32 629.83 (liver), 373.75 (testis); HSPCB 507.99 (liver), 1213.75 (testis). Threshold value = 1. * Expression level assessed on L. chalumnae orthologue.

Expression of female development genes.

Expression of female-determining/differentiation genes in L. menadoensis liver and testis transcriptomes. Expression levels are reported as FPKM (Fragments Per Kilobase of exon per Million sequenced fragments). The expression levels of some housekeeping genes were also analysed: PGK 96.95 (liver), 342.41 (testis); RPS27a 152.59 (liver), 128.43 (testis); RPL19 744.01 (liver) 64.89 (testis); RPL11 457.35 (liver), 282.59 (testis); RPL32 629.83 (liver), 373.75 (testis); HSPCB 507.99 (liver), 1213.75 (testis). Threshold value = 1. * Expression level assessed on L. chalumnae orthologue.

Discussion

In this study a set of 33 genes held to be critically involved in sexual development were isolated and characterized for the first time in coelacanths. Comparison of the gene sequences of the two Latimeria species confirmed the very slow rate of gene evolution that has recently been documented in HOX genes [77], although the latter genes are known to evolve particularly slowly. The 33 genes examined belong to a range of different families, thus providing valuable information. Interpretation of our data is of course limited by the fact that they come from a single adult individual. However, given the importance of this living fossil in understanding tetrapod and fish evolution, and the exceptional opportunity provided by the availability of high-quality RNA from a specimen of an endangered species, we nonetheless cautiously draw some conclusions. Ka/Ks analysis indicated that no gene in the set studied here is under positive selection in coelacanths. A totally unexpected finding was the very high DMRT6 expression in testis, which was actually the most abundant male-specific transcript. To date the gene had only been found in amniotes and is not annotated in Xenopus and all fish genomes. This phylogenetic pattern could be explained by its being a newly arisen paralogue of the DMRT family at the base of amniote vertebrates. Detection of a bona fide DMRT6 homologue in Latimeria points to a much earlier origin of the gene and supports a possible origin from the 1R/2R whole genome duplication events that occurred in ancestral vertebrates [78] and a subsequent, repeated loss in the teleost fish and amphibian lineages and even in basal chordates. Since information on DMRT6 expression is quite scanty the interpretation of some of these data is merely speculative. In mouse embryo it is expressed in the developing brain but not in the gonads [43]. In the human microarray database (https://www.genevestigator.com) it is highly expressed exclusively in ovary and testis, whereas studies of mouse organs have disclosed that only erythroblasts and oocytes show elevated expression. Whatever its original function, it is reasonable to assume that DMRT6 was taken over by other members of the gene family, and that it has ceased to be required in those lineages where it is no longer extant. Its persistence in Latimeria may indicate an important function in male (and possibly female) development which, according to current knowledge, was then at least partially conserved in amniotes. Our findings suggest its being a putative novel gene in the gonad regulatory network. The high DMRT1 expression found in Latimeria testis and its lack of expression in liver is in line with its expression pattern and important role in testis development and in maintenance of the male gonad identity documented in vertebrates, from fish to mammals [65], [66]. In teleost fish adult testis DMRT1 is found in germ cells, in somatic cell types or both [65]. Unfortunately, RNA-Seq transcriptome data provide no information on the cell type expressing DMRT1 in coelacanth testis. In medaka a duplicated version of DMRT1 on the Y chromosome, designated dmrt1bY, is the master male sex determining gene [26], [27]. Its major function appears to suppress germ cell proliferation at the critical sex-determining stage in males [79]. In adult testes it is dramatically downregulated [80], and its high expression suggests that only the autosomal DMRT1 (dmrt1a in medaka) may function in mature testis. As in all the teleosts studied so far, a single DMRT1 copy was found in L. menadoensis, suggesting that in coelacanths it may not serve a major role in primary sex determination, but may do so in testis differentiation and adult testis function. Unlike all the other vertebrates studied [38], [43], [81]–[83], DMRT3 is not expressed in L. menadoensis male gonad. The TGF-β family member GSDF is an important gene in teleost fish gonad development and displays much higher expression in testis than in ovary [39], [40]. A duplicate of GSDF may actually have become the master male sex determination gene in Oryzas luzonensis [29]; there is strong evidence that in O. latipes the master male sex determining gene dmrt1bY upregulates GSDF and that upregulation correlates with early testis differentiation [45]. No GSDF homologue has yet been identified outside teleosts. Identification in our study of a bona fide GSDF sequence in Latimeria and its high expression in testis (which also points at its functional conservation) suggests that the gene arose already at the base of the fish lineage, but was later lost during tetrapod evolution. GSDF thus appears to be an ancestral male sex-determining gene. In the absence of functional data on GSDF function in fish, it remains unclear whether during tetrapod testis development another TGF-β family member may have taken over the function it exerted in teleosts and coelacanths. The high expression in the L. menadoensis testis transcriptome of SOX9, SOX10, WT1, AMH, DHH, SF-1 and SDR5A1 and 3 (at least compared to liver), the low expression of AMHR2, and the absence of the female factors FST, RSPO-1, WNT4, FOXL2, aromatase, and oestrogen receptor transcripts are in line with their expression patterns documented in many vertebrate species and with their proposed function in sexual development. In particular, the AMH/AMH-receptor system is of interest for Latimeria sexual development. In mammals and most likely in all tetrapods AMH induces Müllerian duct regression. Teleosts do not have Müllerian ducts, whereas lungfish and Latimeria possess oviducts that are homologous to those of tetrapods [84]. Despite the absence of Müllerian ducts, AMH/AMH-receptor system has an important function in the manifestation of gonadal sex in teleosts, because in medaka AMH signalling is crucial in regulating germ cell proliferation in early gonad differentiation [85]. Given that AMH and AMH-receptor are expressed in L. menadoensis adult testis, the AMH signalling system is present and probably active there, as in adult teleosts [86], [87], [88], whereas in mouse testis the system is downregulated before sexual maturity [89]. Several of the 33 genes tested, all of which are involved in sex determination and differentiation in other organisms, were found to be abundantly expressed in the liver transcriptome. The high CTNNB1 levels were expected, due to the ubiquitous function of this signal transducer of the WNT pathway. High FST expression agrees with its expression in all vertebrates and with the finding that it is required for liver cell growth homeostasis in mice [90]. This non-gonadal function of the gene may be conserved in coelacanths. Similarly the transcription factor GATA-4, besides a role in gene regulation in testis development [91], is also involved in the control of a number of liver genes, explaining why transcripts of the coelacanth homologue were found in both tissues. In contrast to coelacanths, where 5α-reductase 2 is highly expressed in liver, the 5α-reductase 1 isoform is differentially regulated by androgens and glucocorticoids in rat liver, resulting in high expression in this tissue, while 5α-reductase 2 is preferentially expressed in gonads [92]. This may indicate lineage-specific sub-functionalization of the isozymes during evolution. The absence of SOX8 expression in Latimeria testis was unexpected. In other vertebrates, including teleost fish, it is readily detected in this organ, and in mammals it has been assigned an important function in the FGF9/SOX9 interaction loop to maintain Sertoli cell identity by acting redundantly to SOX9 [6], [93]. Such back-up function does not seem to be required in Latimeria testis maintenance, or may have been lost in the extant coelacanth lineage. In medaka SOX9 is required for germ cell proliferation and survival, but not for testis determination [94]. Together with the other L. menadoensis findings this may indicate that the sex-determining function was acquired later in tetrapod lineage, after the split of teleost and coelacanth lineages. Intriguing data were found for FGF9 and 20, which together with FGF16 constitute a gene subfamily of paracrine FGFs. The critical role of FGF9 in mammalian testis development is well established and appears to be conserved in all tetrapods. On the other hand, the gene is not found in any teleost genome ([63], [64], ENSEMBL), unlike FGF16 and 20 (the latter being duplicated due to the teleost genome duplication). In the amphioxus an FGF gene is basal to the three FGFs in tetrapods [95]. FGF9 could thus be a later duplicate of either FGF16 or 20, and its role in testis development could be interpreted as an innovation arising in tetrapods. However, identification of FGF9 in Latimeria supports an origin during the 1R/2R whole genome duplication events that took place in ancestral chordates and its loss in the lineage leading to teleosts. In the teleost Oreochromis niloticus (Tilapia) FGF20b and FGF16 are both expressed in ovary, whereas only FGF16 is (poorly) expressed in testis [64]. Together with the complete absence of FGF9, FGF20 and FGF16 expression in L. menadoensis liver and testis, this indicates that the function of FGF signalling in testis, in particular the central role of FGF9, was acquired later in tetrapod evolution. Surprisingly, the ERβ gene was expressed in the liver of the male coelacanth. A previous study of the same individual had disclosed expression of the vitellogenin genes vtgABI, II and III [50]. Vitellogenins are yolk proteins physiologically expressed in the liver of reproductive females upon induction by oestrogens. Thus expression of vitellogenins and oestrogen receptor indicates the presence of oestrogens in this male specimen. They could derive from environment pollutants, as reported in a number of specimens from polluted waters; however this individual lived in Bunaken Marine Park in submarine caves at a depth of 100 to 200 m, i.e. in a relatively protected environment. Alternatively, ERβ expression could be the result of a pathological condition, of a hormone imbalance due to ageing, or of a physiological feature of coelacanths.

Conclusions

Analysis of the coelacanth testis transcriptome, reported here for the first time, disclosed important new information on which genes involved in sexual development and testis differentiation in other organisms are present and expressed in this living fossil and on the evolution of this process in vertebrates. Interestingly, some genes that are generally considered critical for testis maintenance in all vertebrates, like SOX8 or a fibroblast growth factor gene from the FGF9/16/20 subfamily, do not play this role in Latimeria. This finding and the high GSDF expression found in the coelacanths make their transcript profile more similar to that of modern fish. In summary, the coelacanth testis transcriptome is expected to contribute further important information to reconstruct the ancestral tetrapod situation and indicates that evolutionary innovations for sexual development occurred already during the transition from water to land. Sequence pair comparison of male sex development genes. Sequence pair comparison of male sex-determining/differentiation transcripts from the L. menadoensis transcriptome and L. chalumnae ENSEMBL predictions. Boxes represent CDSs. Lines represent UTRs. Dashed boxes represent a missing part in the CDS. Green lines/boxes represent an inaccurate gene prediction or a mismatch between L. chalumnae and L. menadoensis sequences. Scale dimension are preserved. B) Sequence pair comparison of female sex development genes. Sequence pair comparison of female sex-determining/differentiation transcripts from the L. menadoensis transcriptome and L. chalumnae ENSEMBL predictions. Boxes represent CDSs. Lines represent UTRs. Dashed boxes represent a missing part in the CDS. Green lines/boxes represent an inaccurate gene prediction or a mismatch between L. chalumnae and L. menadoensis sequences. Scale dimension are preserved. (PDF) Click here for additional data file. Micro-syntenic conservation of . Micro-syntenic conservation of genomic regions containing CYP11B genes. Black pentagons represent CYP11B genes. The pentagon tip points to the relative gene orientation. ENSEMBL data: H. sa (Homo sapiens), M. mu (Mus musculus), B. ta (Bos taurus), L. ch (Latimeria chalumnae), D. re (Danio rerio), T. ru (Takifugu rubripes). (PDF) Click here for additional data file. Gene Ontology analysis of the “sex determination” term. (PDF) Click here for additional data file. Gene Ontology analysis of the “sex differentiation” term. (PDF) Click here for additional data file.
  89 in total

Review 1.  From the origin of sex-determining factors to the evolution of sex-determining systems.

Authors:  Tobias Uller; Heikki Helanterä
Journal:  Q Rev Biol       Date:  2011-09       Impact factor: 4.875

Review 2.  Sex determination and sexual differentiation in the avian model.

Authors:  Justin Chue; Craig A Smith
Journal:  FEBS J       Date:  2011-02-25       Impact factor: 5.542

3.  The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

Authors:  Aude Gautier; Florence Le Gac; Jean-Jacques Lareyre
Journal:  Gene       Date:  2010-10-31       Impact factor: 3.688

4.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

Review 5.  Dmrt1 genes at the crossroads: a widespread and central class of sexual development factors in fish.

Authors:  Amaury Herpin; Manfred Schartl
Journal:  FEBS J       Date:  2011-02-25       Impact factor: 5.542

6.  Sex determination: switch and suppress.

Authors:  Amaury Herpin; Manfred Schartl
Journal:  Curr Biol       Date:  2011-09-13       Impact factor: 10.834

7.  Amphioxus FGF signaling predicts the acquisition of vertebrate morphological traits.

Authors:  Stephanie Bertrand; Alain Camasses; Ildiko Somorjai; Mohamed R Belgacem; Olivier Chabrol; Marie-Line Escande; Pierre Pontarotti; Hector Escriva
Journal:  Proc Natl Acad Sci U S A       Date:  2011-05-12       Impact factor: 11.205

8.  Genome size, GC percentage and 5mC level in the Indonesian coelacanth Latimeria menadoensis.

Authors:  Daisy Monica Makapedua; Marco Barucca; Mariko Forconi; Niki Antonucci; Davide Bizzaro; Adolfo Amici; Maria Rita Carradori; Ettore Olmo; Adriana Canapa
Journal:  Mar Genomics       Date:  2011-07-08       Impact factor: 1.710

9.  DMRT1 prevents female reprogramming in the postnatal mammalian testis.

Authors:  Clinton K Matson; Mark W Murphy; Aaron L Sarver; Michael D Griswold; Vivian J Bardwell; David Zarkower
Journal:  Nature       Date:  2011-07-20       Impact factor: 49.962

10.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

View more
  21 in total

Review 1.  Plasticity of gene-regulatory networks controlling sex determination: of masters, slaves, usual suspects, newcomers, and usurpators.

Authors:  Amaury Herpin; Manfred Schartl
Journal:  EMBO Rep       Date:  2015-09-09       Impact factor: 8.807

2.  Genetic sex determination in Astatotilapia calliptera, a prototype species for the Lake Malawi cichlid radiation.

Authors:  Erin N Peterson; Maggie E Cline; Emily C Moore; Natalie B Roberts; Reade B Roberts
Journal:  Naturwissenschaften       Date:  2017-04-25

Review 3.  Genetics of sexual development: an evolutionary playground for fish.

Authors:  Corina Heule; Walter Salzburger; Astrid Böhne
Journal:  Genetics       Date:  2014-03       Impact factor: 4.562

4.  Analysis of the transcriptome of the Indonesian coelacanth Latimeria menadoensis.

Authors:  Alberto Pallavicini; Adriana Canapa; Marco Barucca; Jessica Alfőldi; Maria Assunta Biscotti; Francesco Buonocore; Gianluca De Moro; Federica Di Palma; Anna Maria Fausto; Mariko Forconi; Marco Gerdol; Daisy Monica Makapedua; Jason Turner-Meier; Ettore Olmo; Giuseppe Scapigliati
Journal:  BMC Genomics       Date:  2013-08-08       Impact factor: 3.969

5.  Gonadal soma controls ovarian follicle proliferation through Gsdf in zebrafish.

Authors:  Yi-Lin Yan; Thomas Desvignes; Ruth Bremiller; Catherine Wilson; Danielle Dillon; Samantha High; Bruce Draper; Charles Loren Buck; John Postlethwait
Journal:  Dev Dyn       Date:  2017-09-25       Impact factor: 3.780

6.  Sequencing and De Novo Assembly of the Gonadal Transcriptome of the Endangered Chinese Sturgeon (Acipenser sinensis).

Authors:  Huamei Yue; Chuangju Li; Hao Du; Shuhuan Zhang; Qiwei Wei
Journal:  PLoS One       Date:  2015-06-01       Impact factor: 3.240

7.  Divergent expression regulation of gonad development genes in medaka shows incomplete conservation of the downstream regulatory network of vertebrate sex determination.

Authors:  Amaury Herpin; Mateus C Adolfi; Barbara Nicol; Maria Hinzmann; Cornelia Schmidt; Johanna Klughammer; Mareen Engel; Minoru Tanaka; Yann Guiguen; Manfred Schartl
Journal:  Mol Biol Evol       Date:  2013-07-24       Impact factor: 16.240

Review 8.  Evolution, Expression, and Function of Gonadal Somatic Cell-Derived Factor.

Authors:  Chen-Wei Hsu; Bon-Chu Chung
Journal:  Front Cell Dev Biol       Date:  2021-07-07

9.  The sex-specific transcriptome of the hermaphrodite sparid sharpsnout seabream (Diplodus puntazzo).

Authors:  Tereza Manousaki; Alexandros Tsakogiannis; Jacques Lagnel; Elena Sarropoulou; Jenny Z Xiang; Nikos Papandroulakis; Constantinos C Mylonas; Costas S Tsigenopoulos
Journal:  BMC Genomics       Date:  2014-08-06       Impact factor: 3.969

10.  Transcriptomics of two evolutionary novelties: how to make a sperm-transfer organ out of an anal fin and a sexually selected "sword" out of a caudal fin.

Authors:  Ji Hyoun Kang; Tereza Manousaki; Paolo Franchini; Susanne Kneitz; Manfred Schartl; Axel Meyer
Journal:  Ecol Evol       Date:  2015-01-23       Impact factor: 2.912

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.