| Literature DB >> 35886982 |
Arie Yehuda Curzon1,2, Andrey Shirak1, Ayana Benet-Perlberg3, Alon Naor3, Shay Israel Low-Tanne3, Haled Sharkawi3, Micha Ron1, Eyal Seroussi1.
Abstract
Oreochromis niloticus has been used as a reference genome for studies of tilapia sex determination (SD) revealing segregating genetic loci on linkage groups (LGs) 1, 3, and 23. The master key regulator genes (MKR) underlying the SD regions on LGs 3 and 23 have been already found. To identify the MKR in fish that segregate for the LG1 XX/XY SD-system, we applied short variant discovery within the sequence reads of the genomic libraries of the Amherst hybrid stock, Coptodon zillii and Sarotherodon galilaeus, which were aligned to a 3-Mbp-region of the O. aureus genome. We obtained 66,372 variants of which six were concordant with the XX/XY model of SD and were conserved across these species, disclosing the male specific figla-like gene. We further validated this observation in O. mossambicus and in the Chitralada hybrid stock. Genome alignment of the 1252-bp transcript showed that the figla-like gene's size was 2664 bp, and that its three exons were capable of encoding 99 amino acids including a 45-amino-acid basic helix-loop-helix domain that is typical of the ovary development regulator-factor-in-the-germline-alpha (FIGLA). In Amherst gonads, the figla-like gene was exclusively expressed in testes. Thus, the figla-like genomic presence determines male fate by interrupting the female developmental program. This indicates that the figla-like gene is the long-sought SD MKR on LG1.Entities:
Keywords: cichlids; figla-like; master key regulator; sex determination; tilapia
Mesh:
Year: 2022 PMID: 35886982 PMCID: PMC9316214 DOI: 10.3390/ijms23147636
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Figure 1Genetic markers (red) and genes (blue) previously linked with sex on LG1. Positions (Mbp) of genes and genetic makers on the current On genomic map build (Genome accession number: GCA_001858045.3) are denoted. Gene symbols of genes mentioned in this article are indicated below or above the bars that delineate their positions.
Sequence variations in the SD region 1 on the O. aureus genomic map, which fit an XY model in Amherst strain (As), S. melanotheron 2 (Sm), and C. zillii (Cz).
| Position 3 | REF 4 | ALT | M | M | M | Region | 5′ End | 3′ End | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 25,672,475 | C | T, G | 0/0 | 0/1 | 0/0 | 0/2 | 0/0 | 0/1 | |||
| 26,488,670 | T | A | 1/1 | 0/1 | 1/1 | 0/1 | 1/1 | 0/1 | intergenic |
|
|
| 26,490,716 | G | C | ./. | 0/0 | ./. | 1/1 | ./. | 1/1 | |||
| 26,490,863 | C | T | ./. | 0/0 | ./. | 1/1 | ./. | 1/1 | |||
| 26,509,215 | A | C | 0/0 | 0/1 | 0/0 | 0/1 | 0/0 | 0/1 | intergenic |
|
|
| 26,510,329 | C | *, T | 0/0 | 0/1 | 0/0 | 0/2 | 0/0 | 0/1 | intergenic |
|
|
1 The SD interval was chosen for analysis based on synteny with the reference O. niloticus genome. 2 The Sm definition is according to depositors of the library; however, this is challenged in paragraph 2.5. 3 in bp on the Oa map. 4 0—reference allele (REF, O. aureus), 1—alternative (ALT) allele, 2—alternative allele, ./.—null call, M—males, F—females. A, C, G, T and *—adenine, cytosine, guanine, thymine, and deletion, respectively.
Genomic organization of the figla-like gene in O. aureus.
| Intron 1 | Exon | Intron | Size | |
|---|---|---|---|---|
| no. | Size | |||
|
…TCCAGCC | 1 | 174 |
TGGAACG | 1290 |
| cttac | 2 | 147 | TGACAAT | 122 |
|
atttt | 3 | 931 | CAGTCCT | |
1 Intron and exon sequences are written in lowercase and uppercase letters, respectively. The first and last two bases of the introns are presented in bold type (gt and ag for donor and acceptor splice sites, respectively). The initiation and stop codons are shown in bold and underlined. Considering the predicted transcript (Nucleotide accession number: XM_031726851.2), the genomic size of the figla-like gene was 2664 bp.
Figure 2Protein sequence and phylogenetic tree of the figla-like gene. (a) An alignment of the predicted proteins encoded by figla-like and figla proteins. The alignment includes five shorter polypeptides, which are of figla-like protein groups (G1–4). Protein group G1 (blue) consists of the identical polypeptides of C. zillii (Cz) (Nucleotide accession number: OW742498) and S. galilaeus (Sg) (Nucleotide accession number: OW742804). Protein group G2 (red) consists of the identical polypeptides of O. urolepis hornorum (Oh) (Nucleotide accession number: OW740593) and O. mossambicus (OmI) (Nucleotide accession number: OW739941). Protein group G3 (green) consists of the identical polypeptides of Chitralada strain (Cs) (Nucleotide accession number: OW739608), O. aureus genomic build (LOC116310109), O. aureus from Ein-Feshka (Oa Ein-Feshka) (Nucleotide accession number: OW770257), and Amherst strain (As) (Nucleotide accession number: OX031319). O. tanganicae (Ot, green) (Nucleotide accession number: OW739839) is shown out of G3, as its sequence differs by an additional D residue. P. mariae (Pm, purple) (Nucleotide accession number: OW742294) is the only member in G4. The alignment also includes three partial figla polypeptides for Oa (Protein accession number: XP_039476449.1), Danio rerio (Dre) (Protein accession number: NP_944601.2), and Mus musculus (Mm) (Protein accession number: NP_036143.1). Dashes indicate gaps introduced by the alignment program. Identical amino-acid residues in at least four of eight sequences are indicated by a black background. White boxes indicate nonconservative amino-acid changes between the proteins, whereas gray boxes indicate conservative changes. The black line represents the position of a 45-long basic helix–loop–helix (bHLH) domain found in factor-in-the-germline-alpha (FIGLA) proteins. The amino-acid numbering follows that of the full alignment of figla with figla-like genes (Figure S2). Below, exon–intron boundaries are delineated. (b,c) Comparison between the phylogenetic trees of figla-like predicted proteins and barcode cox1 DNA sequences. The trees were generated by MEGAX [66] using the Maximum Likelihood method using models with the best Bayesian information criterion (BIC) levels and default setting (5 categories, +G, parameter = 0.2071). Numbers at tree junctions indicate the percentage of trees that correspond to the consensus bootstrap tree (500 replicates) using MUSCLE. (b) JTT matrix-based model with a discrete Gamma distribution was used to model evolutionary rate differences among the figla-like predicted proteins. The scale on the X-axis represents the distance in number of amino-acid substitutions per site. (c) Hasegawa–Kishino–Yano model discrete Gamma distribution [67] was used to study evolutionary rate differences among the mitochondrial DNA sequences. The scale on the X-axis represents the distance in number of nucleotide substitutions per site. Barcode sequences and accession numbers of O. niloticus (On) of Egyptian and Ghanaian origin and of others are provided in the Supplementary Materials (Table S3).
Polymerase chain reaction (PCR) primers for generation of amplicons for fragment analysis and resequencing of the figla-like gene in S. galilaeus (Sg) and the Chitralada strain (Cs).
| Marker | Primers | Assay | GenBank | Positions | Amplicon Size (bp) | ||
|---|---|---|---|---|---|---|---|
| Start | End | ||||||
| LG1y | F | AACCAAGCCAAAATGTGAGC | Duplex fragment analysis | LOC116310109 | 1520 | 1821 | 302 |
| R | CATTCACTTGCCAGAGGTCA | ||||||
| LG1x | F | TCTGTGAAGCACTTTGGCATA | Duplex fragment analysis | NC_031965.2 | 24,979,876 | 24,980,010 | 135 |
| R | CTGCACCTCCTCCAATTGTT | ||||||
| Reseq1 | F | CTTGCACTGGCCTTGAGTTT | Resequencing of | NC_031965.2 | 26,489,072 | 26,490,461 | 1390 |
| R | AAAAATACAGCCAATACATCTGGT | ||||||
| Reseq2 | F | AAAACCAAACAAGGTCACAATTC | Resequencing of | NC_031965.2 | 26,490,237 | 26,491,052 | 816 |
| R | CATTTCAAGGACTGACAGCAA | ||||||
| Reseq3 | F | TGACCTCTGGCAAGTGAATG | Resequencing of | ERZ9148259 | 1556 | 2526 | 971 |
| R | ATGCCTGGACTGGAAACAAG | ||||||
| Reseq4 | F | TGACCTCTGGCAAGTGAATG | Resequencing of | NC_031965.2 | 26,490,991 | 26,491,772 | 782 |
| R | GCCGAGCAGAGCCTAGTTTA | ||||||
Association of sex with the figla-like sequence in two O. mossambicus (OmI) families, S. galilaeus (Sg), C. zillii (Cz), and Amherst (As) and Chitralada (Cs) strains.
| Species | Genotype 1 | Females | Males | |
|---|---|---|---|---|
| xy | 0 | 8 | 0.0002 | |
| xx | 7 | 0 | ||
| xy | 0 | 8 | 0.0003 | |
| xx | 6 | 0 | ||
|
| xy | 0 | 15 | 0.0001 |
| xx | 18 | 1 | ||
|
| xy | 0 | 9 | <0.0001 |
| xx | 13 | 0 | ||
|
| xy | 0 | 58 | <0.0001 |
| xx | 33 | 0 | ||
|
| xy | 2 | 11 | 0.0001 |
| xx | 12 | 0 |
1 xx and xy genotypes correspond to LG1x/LG1x and LG1x/LG1y, respectively (Table 3, Figure S3). 2 Fisher’s exact test. 3 Electronic PCR based on pooled samples, SRA accession numbers: SRX3638079 and SRX3638078. 4 Electronic PCR based on pooled samples, SRA accession numbers: SRX726489 and SRX726488. 5 Identical results were obtained using marker BYL018.
Figure 3Proposed model of the sex related genes’ expression during LG1-driven gonad differentiation. On the right, the table shows the observed ratio between the RPKM values of these genes calculated for As at 45 days posthatch (Table S4). With increased expression in the ovary (red), Figla upregulates a subset of transcripts orthologous to mouse germline genes (nobox, sohlh1, taf4b, and gdf9) and their downstream genes, which are essential for early oocyte development [83]. Increased expression in the testis (blue) of a subset of known testis genes (figla-like, sox9a, dmrt1, amh, wt1a, gsdf, and sf-1) [84] is compatible with diverting the default female into the male developmental program.
The schematic allelic state of the SD systems for LGs 1, 3 and 23 in O. niloticus and O. aureus 1.
| Linkage Group | ||||
|---|---|---|---|---|
| Sex | 1 | 3 | 23 | |
| Species/Proposed SD MKR | ||||
|
| Male | xx | ZZ | XY |
| Female | xx | ZZ | XX | |
|
| Male | yy | ZZ | XX |
| Female | yy | WZ | XX | |
1 This table also integrates our previously published results [12,14].